forks/candle.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	Documentation Pass for Models (#2617)	zachcp	2024-11-15	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* links in chinese_clip * links for clip model * add mod docs for flux and llava * module doc for MMDIT and MIMI * add docs for a few more modesl * mod docs for bert naser and beit * add module docs for convmixer colpali codegeex and chatglm * add another series of moddocs * add fastvit-llama2_c * module docs mamba -> mobileone * module docs from moondream-phi3 * mod docs for quantized and qwen * update to yi * fix long names * Update llama2_c.rs * Update llama2_c_weights.rs * Fix the link for mimi + tweaks --------- Co-authored-by: Laurent Mazare <laurent.mazare@gmail.com>
*	Use cat for faster MQA computation. (#2043)	Laurent Mazare	2024-04-12	1	-14/+3
\| \| \| \| \| \| \| \| \|	* Use cat for faster MQA computation. * Move the function to utils + use it in mistral. * Use the shared repeat-kv in a few more models. * Fix.
*	Rustfmt fix. (#1788)	Laurent Mazare	2024-03-02	1	-1/+5
\|
*	Update StableLM config (#1787)	Frkri	2024-03-02	1	-3/+3
\|
*	Quantized support for stable-lm2. (#1654)	Laurent Mazare	2024-02-04	1	-4/+9
\| \| \| \| \|	* Quantized support for stable-lm2. * Quantized support for v2-zephyr.
*	Make more models cloneable. (#1203)	Laurent Mazare	2023-10-28	1	-4/+4
\|
*	Tracing for StableLM and quantized StableLM. (#1068)	Laurent Mazare	2023-10-10	1	-0/+12
\|
*	Move the common quantized-nn code to a shared module. (#1063)	Laurent Mazare	2023-10-09	1	-24/+1
\|
*	Quantized version of StableLM. (#1058)	Laurent Mazare	2023-10-08	1	-0/+299
	* Quantized version of StableLM. * Adapt the stable-lm example to support quantizsed. * Use some separate hub repo. * Another repo name tweak.