Commit message (Expand) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Documentation Pass for Models (#2617) | zachcp | 2024-11-15 | 1 | -0/+15 |
* | Avoid a contiguous call in the quantized phi 3 model. (#2209) | Laurent Mazare | 2024-05-23 | 1 | -1/+1 |
* | Simplify the KvCache api. (#2207) | Laurent Mazare | 2024-05-23 | 1 | -7/+1 |
* | Support flash-attn in quantized phi3. (#2194) | Laurent Mazare | 2024-05-18 | 1 | -10/+40 |
* | Add a slice_set op. (#2193) | Laurent Mazare | 2024-05-18 | 1 | -22/+19 |
* | Separate quantized phi-3 implementation. (#2157) | Laurent Mazare | 2024-05-04 | 1 | -0/+301 |