Commit message (Expand) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Simplify the KvCache api. (#2207) | Laurent Mazare | 2024-05-23 | 1 | -1/+0 |
* | Support flash-attn in quantized phi3. (#2194) | Laurent Mazare | 2024-05-18 | 1 | -1/+10 |
* | Add a slice_set op. (#2193) | Laurent Mazare | 2024-05-18 | 1 | -1/+1 |
* | Separate quantized phi-3 implementation. (#2157) | Laurent Mazare | 2024-05-04 | 1 | -3/+15 |
* | Pin the version used for the quantized phi 3 gguf file. (#2156) | Laurent Mazare | 2024-05-03 | 1 | -4/+9 |
* | Add the phi-v3 quantized model. (#2118) | Laurent Mazare | 2024-04-24 | 1 | -4/+31 |
* | Updated quantized phi model (#2099) | Laurent Mazare | 2024-04-21 | 1 | -0/+273 |