forks/candle.git -

	Commit message (Expand)	Author	Age	Files	Lines
...
*	Add some missing index-select metal kernels. (#2613)	Laurent Mazare	2024-11-12	3	-2/+16
*	Bump the crate version to 0.8.0. (#2612)	Laurent Mazare	2024-11-12	5	-16/+16
*	Update docs (#2553)	zachcp	2024-11-11	10	-0/+48
*	Add some fast Metal MLX SDPA kernels (#2584)	Eric Buehler	2024-11-05	7	-15/+2007
*	Improved launch config for layer-norm/rms-norm. (#2591)	Laurent Mazare	2024-11-04	3	-12/+72
*	Add the SmolLM2 models. (#2595)	Laurent Mazare	2024-11-03	3	-18/+73
*	Support Skip Layer Guidance (SLG) for Stable Diffusion 3.5 Medium (#2590)	Czxck001	2024-11-01	3	-10/+79
*	Lazy upcasting for t5. (#2589)	Laurent Mazare	2024-10-30	3	-34/+59
*	Support sd3.5 medium and MMDiT-X (#2587)	Czxck001	2024-10-30	4	-35/+269
*	Reduce memory usage for sd 3.5. (#2582)	Laurent Mazare	2024-10-28	1	-0/+2
*	Release the mmdit model earlier to reduce memory usage. (#2581)	Laurent Mazare	2024-10-28	1	-16/+17
*	UG metal integration. (#2580)	Laurent Mazare	2024-10-27	8	-18/+92
*	Support for UG kernels. (#2579)	Laurent Mazare	2024-10-27	8	-2/+139
*	Stable diffusion 3.5 support. (#2578)	Laurent Mazare	2024-10-27	5	-85/+209
*	Update README.md (#2577)	sashaphmn	2024-10-26	1	-1/+2
*	Fix the repo name for llama 3.1. (#2576)	Laurent Mazare	2024-10-26	3	-7/+7
*	use softmax_last_dim (metal and cuda kernel) in llama attention layer (#2572)	Zack Angelo	2024-10-23	1	-1/+2
*	ONNX: GatherElements, Xor (#2568)	Anubhab Bandyopadhyay	2024-10-17	2	-0/+582
*	Testcases (#2567)	Anubhab Bandyopadhyay	2024-10-17	2	-3/+278
*	onnx: ReduceMin/Max Ops (#2563)	Anubhab Bandyopadhyay	2024-10-15	2	-1/+1211
*	Enable stable-diffusion 3 on metal. (#2560)	Laurent Mazare	2024-10-14	4	-12/+11
*	Adds support for Stella_en_v5 embedding model - 1.5B variant (#2551)	Anubhab Bandyopadhyay	2024-10-13	4	-0/+804
*	fix: Allow marian configs to deserialize from json. (#2556)	Mikarific	2024-10-13	1	-1/+2
*	Fix the guide to gain access to Stable Diffusion 3 Medium (#2559)	Czxck001	2024-10-13	1	-2/+9
*	Add Stable Diffusion 3 Example (#2558)	Czxck001	2024-10-13	16	-34/+751
*	feat: intergrate chinese clip and add example (#2555)	SethWen	2024-10-10	5	-0/+1358
*	Add BertForMaskedLM to support SPLADE Models (#2550)	Akshay Ballal	2024-10-07	3	-0/+335
*	improve (#2548)	Jorge António	2024-10-07	1	-0/+1
*	Switch to using the MLX matmul by default. (#2547)	Laurent Mazare	2024-10-06	1	-3/+3
*	pyo3 update. (#2545)	Laurent Mazare	2024-10-06	5	-27/+22
*	Tensor tools print all (#2543)	Laurent Mazare	2024-10-05	1	-0/+29
*	Add required feature for whisper example in Readme (#2539)	dengelt	2024-10-04	1	-1/+1
*	Make the RNN configs accessible from the models. (#2541)	Laurent Mazare	2024-10-04	3	-74/+103
*	Fix for cudnn bf16 conv2d. (#2535)	Laurent Mazare	2024-10-02	2	-10/+14
*	Support whisper large-v3 turbo in the whisper-microphone example. (#2533)	Laurent Mazare	2024-10-02	1	-0/+3
*	Add support for cuda streams. (#2532)	Laurent Mazare	2024-10-02	3	-0/+24
*	Add whisper large-v3 turbo to the example. (#2531)	Laurent Mazare	2024-10-02	1	-0/+3
*	Add a seed to the flux example. (#2529)	Laurent Mazare	2024-10-02	1	-3/+10
*	Tweak some metal tests. (#2528)	Laurent Mazare	2024-10-02	2	-62/+23
*	Efficient implementation of `Tensor::ones()` for `metal` (#2512)	Anubhab Bandyopadhyay	2024-10-01	5	-4/+194
*	Cuda quantized mmv bugfix. (#2526)	Laurent Mazare	2024-10-01	1	-1/+25
*	Add ColPali (#2524)	Akshay Ballal	2024-10-01	7	-1/+394
*	Refactor the whisper microphone example. (#2523)	Laurent Mazare	2024-10-01	2	-82/+74
*	Add/lstm direction (#2455)	Justin Sing	2024-09-30	1	-8/+25
*	Yet another cuda qmm padding fix. (#2509)	Laurent Mazare	2024-09-30	1	-25/+55
*	Pixtral polishing. (#2522)	Laurent Mazare	2024-09-30	2	-12/+29
*	Add Pixtral. (#2521)	Laurent Mazare	2024-09-30	9	-19/+822
*	Add PaliGemma. (#2519)	Laurent Mazare	2024-09-29	5	-0/+434
*	Paligemma siglip vision config (#2518)	Laurent Mazare	2024-09-29	1	-0/+54
*	Bump the crate version to 0.7.2. (#2517)	Laurent Mazare	2024-09-29	5	-16/+16