Commit message (Expand) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Documentation Pass for Models (#2617) | zachcp | 2024-11-15 | 1 | -0/+15 |
* | Use cat for faster MQA computation. (#2043) | Laurent Mazare | 2024-04-12 | 1 | -14/+3 |
* | Rustfmt fix. (#1788) | Laurent Mazare | 2024-03-02 | 1 | -1/+5 |
* | Update StableLM config (#1787) | Frkri | 2024-03-02 | 1 | -3/+3 |
* | Quantized support for stable-lm2. (#1654) | Laurent Mazare | 2024-02-04 | 1 | -4/+9 |
* | Make more models cloneable. (#1203) | Laurent Mazare | 2023-10-28 | 1 | -4/+4 |
* | Tracing for StableLM and quantized StableLM. (#1068) | Laurent Mazare | 2023-10-10 | 1 | -0/+12 |
* | Move the common quantized-nn code to a shared module. (#1063) | Laurent Mazare | 2023-10-09 | 1 | -24/+1 |
* | Quantized version of StableLM. (#1058) | Laurent Mazare | 2023-10-08 | 1 | -0/+299 |