forks/candle.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	bert attention mask (#1934)	Zheng Li	2024-08-01	1	-2/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* bert attention mask * Allow for using None as a mask. * Revert part of the changes so that the proper default mask applies. * Cosmetic change. * Another cosmetic tweak. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>
*	Speed up bert with approx gelu (#1410)	Juarez Bochi	2023-12-06	1	-2/+9
\|
*	Handle LongStorage in pytorch checkpoints. (#1152)	Laurent Mazare	2023-10-22	1	-27/+20
\|
*	Adapt more examples to the updated safetensor api. (#947)	Laurent Mazare	2023-09-23	1	-3/+2
\| \| \| \| \| \| \| \| \|	* Simplify the safetensor usage. * Convert more examples. * Move more examples. * Adapt stable-diffusion.
*	Move some models to candle-transformers so that it's easier to re-use. (#794)	Laurent Mazare	2023-09-10	1	-2/+1
\| \| \| \| \| \| \| \| \|	* Move some models to candle-transformers so that they can be shared. * Also move falcon. * Move Llama. * Move whisper (partial).
*	Upgrading hf-hub.	Nicolas Patry	2023-08-29	1	-4/+4
\|
*	Add a yolo-v3 example. (#528)	Laurent Mazare	2023-08-20	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Add a couple functions required for yolo. * Add the yolo-v3 example. * Add minimum and maximum. * Use the newly introduced maximum. * Cuda support for min/max + add some testing. * Allow for more tests to work with accelerate. * Fix a typo.
*	Add a cuda kernel for upsampling. (#441)	Laurent Mazare	2023-08-14	1	-1/+4
\| \| \| \| \|	* Add a cuda kernel for upsampling. * Update for the latest tokenizers version.
*	Normalize embeddings in the bert example. (#390)	Laurent Mazare	2023-08-10	1	-0/+14
\|
*	Upgrading hf-hub to `0.2.0` (Modified API to not pass the Repo around	Nicolas Patry	2023-07-27	1	-3/+4
\| \| \| \|	all the time)
*	Rename the .r functions to .dims so as to be a bit more explicit. (#220)	Laurent Mazare	2023-07-22	1	-1/+1
\|
*	Removing `candle-hub` internal to extract into `hf-hub` standalone.	Nicolas Patry	2023-07-19	1	-1/+1
\|
*	Use mkl to accelerate binary ops. (#190)	Laurent Mazare	2023-07-18	1	-2/+5
\| \| \| \| \| \| \| \| \| \| \|	* Vectorized binary ops with mkl. * Improve the binary op mkl support. * Push the support for mkl binary ops. * Proper vectorization of binary ops. * Proper mkl'isation when broadcasting binary ops.
*	Bert tracing (#184)	Laurent Mazare	2023-07-17	1	-461/+20
\| \| \| \| \| \| \|	* Add some tracing to bert. * More tracing. * Add a flag for tracing.
*	Add some 'cuda-if-available' helper function. (#172)	Laurent Mazare	2023-07-15	1	-14/+1
\|
*	Removing cuda default.	Nicolas Patry	2023-07-14	1	-1/+10
\| \| \| \| \| \| \|	Seems very important for a lot of exploring users usually on laptop without GPUs. Adding more README instructions in a follow up.
*	Simplify the parameters used by sum and sum_keepdim. (#165)	Laurent Mazare	2023-07-14	1	-1/+1
\|
*	Use the same default as pytorch for sum. (#164)	Laurent Mazare	2023-07-13	1	-4/+4
\|
*	Tensor mutability (#154)	Laurent Mazare	2023-07-13	1	-1/+1
\| \| \| \| \|	* Working towards tensor mutability. * Use a ref-cell to provide tensor mutability.
*	Remove some dead-code pragmas. (#137)	Laurent Mazare	2023-07-11	1	-10/+2
\|
*	VarBuilder path creation (#131)	Laurent Mazare	2023-07-10	1	-72/+42
\| \| \| \| \| \| \|	* Use a struct for the safetensor+routing. * Group the path and the var-builder together. * Fix for the empty path case.
*	Move the var-builder in a central place. (#130)	Laurent Mazare	2023-07-10	1	-61/+3
\|
*	[nn] Move the Embedding and Activation parts. (#116)	Laurent Mazare	2023-07-10	1	-30/+7
\| \| \| \| \|	* Share the Embedding and Activation parts. * Tweak some activations.
*	Sketch the candle-nn crate. (#115)	Laurent Mazare	2023-07-10	1	-76/+33
\| \| \| \| \| \| \|	* Sketch the candle-nn crate. * Tweak the cuda dependencies. * More cuda tweaks.
*	Merge pull request #92 from LaurentMazare/sync_hub	Nicolas Patry	2023-07-07	1	-8/+7
\|\ \| \| \| \|	Creating new sync Api for `candle-hub`.
\| *	Creating new sync Api for `candle-hub`.	Nicolas Patry	2023-07-06	1	-8/+7
\| \| \| \| \| \| \| \| \| \| \| \|	- `api::Api` -> `api::tokio::api` (And created new `api::sync::Api`). - Remove `tokio` from all our examples. - Using similar codebase for now instead of ureq (for simplicity).
* \|	Enabling `roberta` for the example (it's the same model as Bert, with	Nicolas Patry	2023-07-06	1	-8/+26
\|/ \| \| \|	just different naming.)
*	Add mkl support for matrix multiply. (#86)	Laurent Mazare	2023-07-06	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	* Fix some rebase issues. * Use mkl instead. * Use mkl in bert. * Add the optional mkl feature. * Conditional compilation based on the mkl feature. * Add more mkl support.
*	Add a simpler way to specify the dim index for some ops.	laurent	2023-07-05	1	-2/+2
\|
*	Use avg pooling before the cosine similarity.	laurent	2023-07-05	1	-3/+4
\|
*	Add some sentence similarity comparision to the bert example.	laurent	2023-07-05	1	-63/+122
\|
*	Some polish.	Nicolas Patry	2023-07-05	1	-7/+11
\|
*	Adding offline mode.	Nicolas Patry	2023-07-05	1	-11/+41
\|
*	Upgrading bert example to work with `bert-base-uncased`.	Nicolas Patry	2023-07-04	1	-16/+61
\| \| \| \| \| \| \| \|	- Always take weights from the hub - Optional `model_id` + `revision` to use safetensors version potentially - Optional loading for `bert-base-uncased` (`weight` vs `gamma`). - Take the config from the hub.
*	Add a batch dimension on the bert example.	laurent	2023-07-04	1	-11/+15
\|
*	Allow for batch dimensions in the embedding layer.	laurent	2023-07-03	1	-6/+14
\|
*	Minor tweaks.	laurent	2023-07-03	1	-1/+2
\|
*	Remove the fixed length hack.	laurent	2023-07-03	1	-1/+1
\|
*	Bugfix for transpose.	laurent	2023-07-03	1	-3/+4
\|
*	Fix the layer norm to properly handle bias.	laurent	2023-07-03	1	-11/+30
\|
*	Get some embeddings out.	laurent	2023-07-03	1	-14/+23
\|
*	Get the tensors to be loaded properly.	laurent	2023-07-03	1	-11/+72
\|
*	Add the varbuilder + check shapes.	laurent	2023-07-03	1	-62/+156
\|
*	Model creation.	laurent	2023-07-03	1	-6/+134
\|
*	Complete (?) the forward pass.	laurent	2023-07-03	1	-2/+28
\|
*	Add more to the forward pass.	laurent	2023-07-03	1	-9/+48
\|
*	Boilerplate code for Bert.	laurent	2023-07-03	1	-0/+239