diff options
author | Laurent Mazare <laurent.mazare@gmail.com> | 2023-07-20 07:39:04 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-07-20 06:39:04 +0100 |
commit | 12d6dc018dc7c0c698162cb19e9459bb915c5b84 (patch) | |
tree | 14669bc66d429ce6d8b796abae56c5c5af5c7bc1 /README.md | |
parent | c34f932319f9b25f9e87e1376fa80a427576903b (diff) | |
download | candle-12d6dc018dc7c0c698162cb19e9459bb915c5b84.tar.gz candle-12d6dc018dc7c0c698162cb19e9459bb915c5b84.tar.bz2 candle-12d6dc018dc7c0c698162cb19e9459bb915c5b84.zip |
Support for MQA for llama v2. (#205)
* Support for MQA for llama v2.
* More llama-v2.
* Move the rotary embedding precomputation in the cache.
* Add a v2 flag.
* Use the hf model.
Diffstat (limited to 'README.md')
-rw-r--r-- | README.md | 2 |
1 files changed, 1 insertions, 1 deletions
@@ -13,7 +13,7 @@ let c = a.matmul(&b)?; Check out our [examples](./candle-examples/examples/): - [Whisper](./candle-examples/examples/whisper/) -- [Llama](./candle-examples/examples/llama/) +- [Llama and Llama-v2](./candle-examples/examples/llama/) - [Bert](./candle-examples/examples/bert/) (Useful for sentence embeddings) - [Falcon](./candle-examples/examples/falcon/) |