forks/candle.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	Add the StarCoder2 model. (#1779)	Laurent Mazare	2024-02-28	1	-1/+1
\| \| \| \| \| \| \|	* Add the StarCoder2 model. * Add the example code and get things to work. * And also tweak the readme.
*	Fix token generation in bilingual models (non-English outputs) (#1668)	Guoqing Bao	2024-02-06	1	-1/+1
\| \| \|	Co-authored-by: Guoqing Bao <guoqing.bao@enflame-tech.com>
*	Quantized version of mistral. (#1009)	Laurent Mazare	2023-09-30	1	-2/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Quantized version of mistral. * Integrate the quantized mistral variant. * Use the quantized weight files. * Tweak the quantization command. * Fix the dtype when computing the rotary embeddings. * Update the readme with the quantized version. * Fix the decoding of the remaining tokens.
*	Streaming mode for reporting the generated tokens (#1007)	Laurent Mazare	2023-09-30	1	-0/+74
	* Token streaming. * Use the token output stream. * Flush the output. * Ensure that the last characters get reported.