forks/candle.git -

	Commit message (Expand)	Author	Age	Files	Lines
*	Explicit caching in llama2.c.	laurent	2024-02-22	1	-15/+15
*	Use the tokenizer-output-stream in the llama example. (#1715)	Laurent Mazare	2024-02-15	1	-7/+6
*	Quantized GGUF style (#1523)	Nicolas Patry	2024-01-17	1	-4/+4
*	Infer the config for llama2-c. (#1208)	Laurent Mazare	2023-10-28	1	-2/+12
*	Move the llama2-c model in transformers. (#1205)	Laurent Mazare	2023-10-28	1	-3/+3
*	Add a quantized variant of llama2.c (#1197)	Laurent Mazare	2023-10-27	1	-6/+54
*	Implement top_p / nucleus sampling (#819)	Juarez Bochi	2023-09-12	1	-1/+6
*	Add a repeat penality to the llama2-c command line example. (#713)	Laurent Mazare	2023-09-01	1	-0/+18
*	Support the Accelerate BLAS on macOS. (#325)	Laurent Mazare	2023-08-05	1	-0/+3
*	Add the candle-datasets crate (#322)	Laurent Mazare	2023-08-05	1	-1/+1
*	Support safetensors weights in llama2.c inference. (#317)	Laurent Mazare	2023-08-03	1	-6/+17
*	Add training for the llama2.c example (#296)	Laurent Mazare	2023-08-01	1	-7/+33
*	Move the weight bits in a separate module. (#295)	Laurent Mazare	2023-08-01	1	-161/+6
*	Add some batcher variants that handle errors. (#294)	Laurent Mazare	2023-08-01	1	-4/+4
*	Add the batcher. (#293)	Laurent Mazare	2023-08-01	1	-18/+14
*	Use subcommands in llama2. (#292)	Laurent Mazare	2023-08-01	1	-100/+90
*	Pre-tokenized evaluation mode for llama2.c. (#291)	Laurent Mazare	2023-08-01	1	-30/+51
*	Evaluate on the pre-tokenized file. (#290)	Laurent Mazare	2023-07-31	1	-1/+58
*	Remove the end of text tokens. (#289)	Laurent Mazare	2023-07-31	1	-1/+2
*	Add an eval mode to llama2-c (#288)	Laurent Mazare	2023-07-31	1	-20/+75
*	Add a prompt and support more models in llama2-c. (#285)	Laurent Mazare	2023-07-31	1	-4/+22
*	Use the hub models for llama2.c (#284)	Laurent Mazare	2023-07-31	1	-25/+37
*	Use the binary decoder for llama2.c. (#230)	Laurent Mazare	2023-07-24	1	-57/+74
*	Add llama2.c as an example. (#229)	Laurent Mazare	2023-07-24	1	-0/+240