diff options
author | Laurent Mazare <laurent.mazare@gmail.com> | 2024-05-23 21:24:55 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-05-23 21:24:55 +0200 |
commit | d54e02d73de3391b34d4511aa7add32f9cffd4f0 (patch) | |
tree | 7391d5b7693e2ee0ac8bea7c573f018d6647b203 /candle-kernels | |
parent | 45e235a7473d473df5c1e50f55504a97e28be822 (diff) | |
download | candle-d54e02d73de3391b34d4511aa7add32f9cffd4f0.tar.gz candle-d54e02d73de3391b34d4511aa7add32f9cffd4f0.tar.bz2 candle-d54e02d73de3391b34d4511aa7add32f9cffd4f0.zip |
Avoid a contiguous call in the quantized phi 3 model. (#2209)
* Simplify the KvCache api.
* Avoid a contiguous call in the quantized phi3 model.
Diffstat (limited to 'candle-kernels')
0 files changed, 0 insertions, 0 deletions