summaryrefslogtreecommitdiff
path: root/candle-core/src/quantized/ggml_file.rs
Commit message (Expand)AuthorAgeFilesLines
* 20241118 docs (#2629)zachcp2024-11-191-1/+1
* Cuda acceleration for quantized model. (#1754)Laurent Mazare2024-02-251-9/+2
* Fixing quantized llama demo on metal. (#1703)Nicolas Patry2024-02-131-0/+3
* Quantized GGUF style (#1523)Nicolas Patry2024-01-171-23/+61
* Avoid some overflows on wasm32. (#968)Laurent Mazare2023-09-261-1/+7
* Tensor -> QTensor conversion (#496)Laurent Mazare2023-08-181-1/+1
* Get the ggml based llama to generate some text. (#464)Laurent Mazare2023-08-161-4/+14
* Add quantized tensors. (#458)Laurent Mazare2023-08-151-105/+26
* Split out the quantized file. (#456)Laurent Mazare2023-08-151-0/+294