summaryrefslogtreecommitdiff
Commit message (Expand)AuthorAgeFilesLines
* Clippy fixes for the cuda feature. (#2650)Laurent Mazare2024-11-292-11/+11
* Adds support for stella_en_v5 embedding model -400M variant (#2608)iskng2024-11-293-112/+555
* Lint fixes introduced with Rust 1.83 (#2646)Anubhab Bandyopadhyay2024-11-2819-55/+57
* Fix for whisper-microphone example failure if audio isn't chunk aligned (#2645)Adam Nelson2024-11-271-3/+17
* Onnx Support for Sign operation #2641 (#2642)Ionut Mihalcea2024-11-262-0/+47
* Provide a method to allow PTH files with state maps to be loaded. (#2639)zachcp2024-11-261-1/+11
* fix typo (#2606)Andrei Fajardo2024-11-231-1/+1
* Tweak the CI to avoid running out of disk space. (#2630)Laurent Mazare2024-11-191-0/+3
* 20241118 docs (#2629)zachcp2024-11-1927-12/+72
* Import the ggml_cuda_dp4a function. (#2628)Laurent Mazare2024-11-191-33/+44
* Fix for clippy. (#2626)Laurent Mazare2024-11-181-1/+1
* Module Docs (#2624)zachcp2024-11-1839-115/+170
* More Model Module Docs (#2623)zachcp2024-11-1712-72/+291
* Module Docs (#2620)zachcp2024-11-165-10/+126
* Remove some unused macros. (#2618)Laurent Mazare2024-11-159-14/+13
* Documentation Pass for Models (#2617)zachcp2024-11-1594-51/+1001
* Add max-all/min-all. (#2616)Laurent Mazare2024-11-141-0/+36
* Add some missing index-select metal kernels. (#2613)Laurent Mazare2024-11-123-2/+16
* Bump the crate version to 0.8.0. (#2612)Laurent Mazare2024-11-125-16/+16
* Update docs (#2553)zachcp2024-11-1110-0/+48
* Add some fast Metal MLX SDPA kernels (#2584)Eric Buehler2024-11-057-15/+2007
* Improved launch config for layer-norm/rms-norm. (#2591)Laurent Mazare2024-11-043-12/+72
* Add the SmolLM2 models. (#2595)Laurent Mazare2024-11-033-18/+73
* Support Skip Layer Guidance (SLG) for Stable Diffusion 3.5 Medium (#2590)Czxck0012024-11-013-10/+79
* Lazy upcasting for t5. (#2589)Laurent Mazare2024-10-303-34/+59
* Support sd3.5 medium and MMDiT-X (#2587)Czxck0012024-10-304-35/+269
* Reduce memory usage for sd 3.5. (#2582)Laurent Mazare2024-10-281-0/+2
* Release the mmdit model earlier to reduce memory usage. (#2581)Laurent Mazare2024-10-281-16/+17
* UG metal integration. (#2580)Laurent Mazare2024-10-278-18/+92
* Support for UG kernels. (#2579)Laurent Mazare2024-10-278-2/+139
* Stable diffusion 3.5 support. (#2578)Laurent Mazare2024-10-275-85/+209
* Update README.md (#2577)sashaphmn2024-10-261-1/+2
* Fix the repo name for llama 3.1. (#2576)Laurent Mazare2024-10-263-7/+7
* use softmax_last_dim (metal and cuda kernel) in llama attention layer (#2572)Zack Angelo2024-10-231-1/+2
* ONNX: GatherElements, Xor (#2568)Anubhab Bandyopadhyay2024-10-172-0/+582
* Testcases (#2567)Anubhab Bandyopadhyay2024-10-172-3/+278
* onnx: ReduceMin/Max Ops (#2563)Anubhab Bandyopadhyay2024-10-152-1/+1211
* Enable stable-diffusion 3 on metal. (#2560)Laurent Mazare2024-10-144-12/+11
* Adds support for Stella_en_v5 embedding model - 1.5B variant (#2551)Anubhab Bandyopadhyay2024-10-134-0/+804
* fix: Allow marian configs to deserialize from json. (#2556)Mikarific2024-10-131-1/+2
* Fix the guide to gain access to Stable Diffusion 3 Medium (#2559)Czxck0012024-10-131-2/+9
* Add Stable Diffusion 3 Example (#2558)Czxck0012024-10-1316-34/+751
* feat: intergrate chinese clip and add example (#2555)SethWen2024-10-105-0/+1358
* Add BertForMaskedLM to support SPLADE Models (#2550)Akshay Ballal2024-10-073-0/+335
* improve (#2548)Jorge António2024-10-071-0/+1
* Switch to using the MLX matmul by default. (#2547)Laurent Mazare2024-10-061-3/+3
* pyo3 update. (#2545)Laurent Mazare2024-10-065-27/+22
* Tensor tools print all (#2543)Laurent Mazare2024-10-051-0/+29
* Add required feature for whisper example in Readme (#2539)dengelt2024-10-041-1/+1
* Make the RNN configs accessible from the models. (#2541)Laurent Mazare2024-10-043-74/+103