summaryrefslogtreecommitdiff
path: root/candle-nn/src/kv_cache.rs
Commit message (Collapse)AuthorAgeFilesLines
* Update docs (#2553)zachcp2024-11-111-0/+2
| | | | | * add module docs for candle-core * doc each of the candle-nn modules and add the links to the doc page
* Add a RotatingKVCache. (#2493)Laurent Mazare2024-09-231-1/+223
| | | | | | | | | | | | | | | | | | | | | | | | | | | * Add a RotatingKVCache. * Add some KvCache tests. * Test the reset too. * More kv-cache testing. * More tests for the rotating kv-cache. * Improve the api for the rotating cache so that the whole src tensor gets returned when it's overlarge. * Handle contiguity + bugfix + use in mimi. * Add a way to test the mimi streaming mode. * Mimi streaming fixes. * More rotating kv-cache. * Fix the attn mask generation. * Handle the abs case. * Add some tests for the generated mask.
* Simplify the KvCache api. (#2207)Laurent Mazare2024-05-231-36/+53
|
* Add a couple kv-cache helper functions. (#2206)Laurent Mazare2024-05-231-0/+29
|
* Add a slice_set op. (#2193)Laurent Mazare2024-05-181-0/+101
* Add a slice_set op. * Add some testing. * Add the dedicated kv-cache module. * Derive debug and clone. * Expose more kv-cache functions. * Return the current data when appending. * Use the new cache in the quantized phi3 model.