forks/candle.git -

	Commit message (Expand)	Author	Age	Files	Lines
*	Add the new gemma models. (#2023)	Laurent Mazare	2024-04-06	1	-0/+1
*	Fix the final rmsnorm for quantized-metavoice. (#2021)	Laurent Mazare	2024-04-06	1	-0/+1
*	Faster mask implementation for mixformers. (#2017)	Laurent Mazare	2024-04-05	1	-21/+6
*	Moondream tracing. (#2016)	Laurent Mazare	2024-04-05	2	-13/+48
*	Add the rope THD kernel. (#2014)	Laurent Mazare	2024-04-05	1	-22/+6
*	Use F16 for moondream on cuda. (#2013)	Laurent Mazare	2024-04-04	1	-5/+8
*	Include topk sampling in the quantized example. (#2005)	Laurent Mazare	2024-04-04	1	-1/+25
*	Relax the contiguous check for cuda kernels. (#2000)	Laurent Mazare	2024-04-03	1	-1/+2
*	Improve the handling of matmul with squeezed layouts. (#1998)	Laurent Mazare	2024-04-02	1	-1/+1
*	Match Moondream's latest release (#1997)	Santiago Medina	2024-04-02	1	-1/+1
*	first commit (#1994)	Jorge António	2024-04-02	1	-1/+2
*	Stable diffusion fix. (#1993)	Laurent Mazare	2024-04-02	1	-1/+3
*	Expose the t5 config fields + allow t5-large. (#1987)	Laurent Mazare	2024-04-01	1	-16/+16
*	Quantized moondream implementation and BOS token (#1980)	Santiago Medina	2024-04-01	5	-16/+316
*	Add options to use local files + specify a custom repo or branch. (#1973)	Laurent Mazare	2024-03-31	1	-13/+15
*	Add Moondream transformer implementation and example (#1970)	Santiago Medina	2024-03-31	3	-0/+329
*	Remove some unnecessary calls to contiguous. (#1968)	Laurent Mazare	2024-03-30	1	-4/+10
*	Qwen MoE model. (#1960)	Laurent Mazare	2024-03-28	2	-0/+489
*	Fix clippy lints + minor cleanups. (#1957)	Laurent Mazare	2024-03-28	4	-100/+41
*	CLIP model implementation with example (#1950)	Tigran Zhampeissov	2024-03-28	4	-0/+694
*	add send and sync trait bounds for scheduler config in stable diffusion model...	Jorge António	2024-03-28	1	-1/+1
*	add config for mamba 2.8b model parameter (#1946)	Jorge António	2024-03-27	1	-4/+4
*	Another fix for squeezing. (#1943)	Laurent Mazare	2024-03-26	1	-2/+2
*	Faster repeat penalty (#1940)	Laurent Mazare	2024-03-26	1	-3/+7
*	Use the new rope kernel in mistral. (#1937)	Laurent Mazare	2024-03-25	2	-28/+12
*	Avoid the attention mask where possible. (#1933)	Laurent Mazare	2024-03-25	3	-16/+32
*	Fast kernels for rotary embeddings. (#1928)	Laurent Mazare	2024-03-24	1	-26/+5
*	Also avoid the mask in the llama example.	laurent	2024-03-24	1	-2/+6
*	Avoid using the attn mask when not necessary.	laurent	2024-03-24	1	-5/+19
*	Support more mistral models. (#1927)	Laurent Mazare	2024-03-24	2	-24/+31
*	Allow for arbitrary temperature modifications.	laurent	2024-03-23	1	-1/+7
*	Add topk sampling. (#1923)	Laurent Mazare	2024-03-23	2	-24/+88
*	Avoid broadcasting on the batch dimension for the attention mask. (#1920)	Laurent Mazare	2024-03-23	2	-8/+6
*	Fix loading the gguf files. (#1913)	Laurent Mazare	2024-03-22	1	-1/+1
*	Fix for the llama model. (#1906)	Laurent Mazare	2024-03-21	1	-1/+1
*	Use the fast RmsNorm in the quantized model. (#1904)	Laurent Mazare	2024-03-21	3	-35/+21
*	Avoid copying the data on squeeze and unsqueeze. (#1884)	Laurent Mazare	2024-03-20	2	-2/+2
*	Use a common with_tracing::RmsNorm in a few models. (#1871)	Jani Monoses	2024-03-18	6	-111/+29
*	Expose some helper functions to create quantized models. (#1837)	Laurent Mazare	2024-03-12	3	-0/+15
*	Add some tracing to metavoice. (#1826)	Laurent Mazare	2024-03-09	2	-8/+82
*	Quantized version of the metavoice model. (#1824)	Laurent Mazare	2024-03-09	4	-4/+241
*	Add a flag to select the dtype used in metavoice. (#1805)	Laurent Mazare	2024-03-05	2	-5/+13
*	Speaker embeddings computation for metavoice. (#1800)	Laurent Mazare	2024-03-04	2	-23/+109
*	Add an initial Segformer implementation (#1617)	Jiayu Liu	2024-03-03	2	-0/+706
*	More metavoice tweaks. (#1796)	Laurent Mazare	2024-03-03	1	-1/+1
*	Metavoice - first cut (#1717)	Laurent Mazare	2024-03-02	3	-0/+880
*	Rustfmt fix. (#1788)	Laurent Mazare	2024-03-02	2	-3/+10
*	Update StableLM config (#1787)	Frkri	2024-03-02	2	-12/+12
*	EfficientVit (MSRA) model (#1783)	Jani Monoses	2024-03-01	2	-0/+461
*	add models of rwkv v6 and quantized rwkv v6 (#1781)	Jack Shih	2024-03-01	3	-0/+629