Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | 20241118 docs (#2629) | zachcp | 2024-11-19 | 1 | -0/+6 |
| | | | | | | | | | | | | | | | | | * module docs * varbuilder gguf docs * add a link to gguf files * small additonal mod doc titles * safetensor docs * more core docs * more module docs in canlde_core * 2 more link fixes | ||||
* | Use the fast RmsNorm in the quantized model. (#1904) | Laurent Mazare | 2024-03-21 | 1 | -8/+12 |
| | |||||
* | Expose some helper functions to create quantized models. (#1837) | Laurent Mazare | 2024-03-12 | 1 | -0/+8 |
| | |||||
* | Quantized version of the metavoice model. (#1824) | Laurent Mazare | 2024-03-09 | 1 | -0/+10 |
| | | | | | * Quantized version of the metavoice model. * Integrate the quantized version of metavoice. | ||||
* | Add a quantized blip model. (#1155) | Laurent Mazare | 2023-10-22 | 1 | -0/+6 |
| | | | | | * Add a quantized blip model. * Integrate the quantized blip model to the actual example. | ||||
* | More model cloning. (#1126) | Laurent Mazare | 2023-10-18 | 1 | -3/+3 |
| | | | | | * More model cloning. * More cloning on quantized models. | ||||
* | Add the quantized mpt model. (#1123) | Laurent Mazare | 2023-10-18 | 1 | -0/+5 |
| | | | | | * Add the quantized mpt model. * Support the quantized model for replit-code. | ||||
* | Move the common quantized-nn code to a shared module. (#1063) | Laurent Mazare | 2023-10-09 | 1 | -0/+87 |