summaryrefslogtreecommitdiff
path: root/candle-transformers/src/models/quantized_mixformer.rs
Commit message (Expand)AuthorAgeFilesLines
* Quantized moondream implementation and BOS token (#1980)Santiago Medina2024-04-011-0/+24
* Quantized GGUF style (#1523)Nicolas Patry2024-01-171-2/+2
* Quantized version for phi-v2. (#1430)Laurent Mazare2023-12-131-0/+18
* More model cloning. (#1126)Laurent Mazare2023-10-181-7/+7
* Move the common quantized-nn code to a shared module. (#1063)Laurent Mazare2023-10-091-34/+3
* Use softmax-last-dim where possible. (#1057)Laurent Mazare2023-10-081-1/+1
* Expose a function to clear the KV cache on mixformers. (#964)Laurent Mazare2023-09-261-0/+12
* Add the quantized mixformer model. (#953)Laurent Mazare2023-09-241-0/+344