summaryrefslogtreecommitdiff
path: root/candle-core/src/quantized/metal.rs
Commit message (Expand)AuthorAgeFilesLines
* Handle multiple dimensions in metal QMM + two fixes. (#2097)Laurent Mazare2024-04-201-15/+20
* Handle the batch dimension in quantized MMV on metal. (#2022)Laurent Mazare2024-04-061-1/+4
* Improve metal buffer usage (#1807)ivarflakstad2024-03-071-2/+7
* Cuda acceleration for quantized model. (#1754)Laurent Mazare2024-02-251-35/+18
* Qmetal tweaks (#1704)Laurent Mazare2024-02-131-9/+86
* Fixing quantized llama demo on metal. (#1703)Nicolas Patry2024-02-131-0/+4
* Quantized GGUF style (#1523)Nicolas Patry2024-01-171-0/+153