summaryrefslogtreecommitdiff
path: root/candle-examples/examples/quantized-phi
Commit message (Expand)AuthorAgeFilesLines
* Simplify the KvCache api. (#2207)Laurent Mazare2024-05-231-1/+0
* Support flash-attn in quantized phi3. (#2194)Laurent Mazare2024-05-181-1/+10
* Add a slice_set op. (#2193)Laurent Mazare2024-05-181-1/+1
* Separate quantized phi-3 implementation. (#2157)Laurent Mazare2024-05-041-3/+15
* Pin the version used for the quantized phi 3 gguf file. (#2156)Laurent Mazare2024-05-031-4/+9
* Add the phi-v3 quantized model. (#2118)Laurent Mazare2024-04-241-4/+31
* Updated quantized phi model (#2099)Laurent Mazare2024-04-211-0/+273