summaryrefslogtreecommitdiff
path: root/candle-transformers/src/quantized_nn.rs
Commit message (Collapse)AuthorAgeFilesLines
* 20241118 docs (#2629)zachcp2024-11-191-0/+6
| | | | | | | | | | | | | | | | | * module docs * varbuilder gguf docs * add a link to gguf files * small additonal mod doc titles * safetensor docs * more core docs * more module docs in canlde_core * 2 more link fixes
* Use the fast RmsNorm in the quantized model. (#1904)Laurent Mazare2024-03-211-8/+12
|
* Expose some helper functions to create quantized models. (#1837)Laurent Mazare2024-03-121-0/+8
|
* Quantized version of the metavoice model. (#1824)Laurent Mazare2024-03-091-0/+10
| | | | | * Quantized version of the metavoice model. * Integrate the quantized version of metavoice.
* Add a quantized blip model. (#1155)Laurent Mazare2023-10-221-0/+6
| | | | | * Add a quantized blip model. * Integrate the quantized blip model to the actual example.
* More model cloning. (#1126)Laurent Mazare2023-10-181-3/+3
| | | | | * More model cloning. * More cloning on quantized models.
* Add the quantized mpt model. (#1123)Laurent Mazare2023-10-181-0/+5
| | | | | * Add the quantized mpt model. * Support the quantized model for replit-code.
* Move the common quantized-nn code to a shared module. (#1063)Laurent Mazare2023-10-091-0/+87