Commit message (Expand) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | 20241118 docs (#2629) | zachcp | 2024-11-19 | 1 | -1/+1 |
* | Cuda acceleration for quantized model. (#1754) | Laurent Mazare | 2024-02-25 | 1 | -9/+2 |
* | Fixing quantized llama demo on metal. (#1703) | Nicolas Patry | 2024-02-13 | 1 | -0/+3 |
* | Quantized GGUF style (#1523) | Nicolas Patry | 2024-01-17 | 1 | -23/+61 |
* | Avoid some overflows on wasm32. (#968) | Laurent Mazare | 2023-09-26 | 1 | -1/+7 |
* | Tensor -> QTensor conversion (#496) | Laurent Mazare | 2023-08-18 | 1 | -1/+1 |
* | Get the ggml based llama to generate some text. (#464) | Laurent Mazare | 2023-08-16 | 1 | -4/+14 |
* | Add quantized tensors. (#458) | Laurent Mazare | 2023-08-15 | 1 | -105/+26 |
* | Split out the quantized file. (#456) | Laurent Mazare | 2023-08-15 | 1 | -0/+294 |