Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Update docs (#2553) | zachcp | 2024-11-11 | 1 | -0/+2 |
| | | | | | * add module docs for candle-core * doc each of the candle-nn modules and add the links to the doc page | ||||
* | Add a RotatingKVCache. (#2493) | Laurent Mazare | 2024-09-23 | 1 | -1/+223 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | * Add a RotatingKVCache. * Add some KvCache tests. * Test the reset too. * More kv-cache testing. * More tests for the rotating kv-cache. * Improve the api for the rotating cache so that the whole src tensor gets returned when it's overlarge. * Handle contiguity + bugfix + use in mimi. * Add a way to test the mimi streaming mode. * Mimi streaming fixes. * More rotating kv-cache. * Fix the attn mask generation. * Handle the abs case. * Add some tests for the generated mask. | ||||
* | Simplify the KvCache api. (#2207) | Laurent Mazare | 2024-05-23 | 1 | -36/+53 |
| | |||||
* | Add a couple kv-cache helper functions. (#2206) | Laurent Mazare | 2024-05-23 | 1 | -0/+29 |
| | |||||
* | Add a slice_set op. (#2193) | Laurent Mazare | 2024-05-18 | 1 | -0/+101 |
* Add a slice_set op. * Add some testing. * Add the dedicated kv-cache module. * Derive debug and clone. * Expose more kv-cache functions. * Return the current data when appending. * Use the new cache in the quantized phi3 model. |