summaryrefslogtreecommitdiff
path: root/README.md
diff options
context:
space:
mode:
authorLaurent Mazare <laurent.mazare@gmail.com>2023-07-20 07:39:04 +0200
committerGitHub <noreply@github.com>2023-07-20 06:39:04 +0100
commit12d6dc018dc7c0c698162cb19e9459bb915c5b84 (patch)
tree14669bc66d429ce6d8b796abae56c5c5af5c7bc1 /README.md
parentc34f932319f9b25f9e87e1376fa80a427576903b (diff)
downloadcandle-12d6dc018dc7c0c698162cb19e9459bb915c5b84.tar.gz
candle-12d6dc018dc7c0c698162cb19e9459bb915c5b84.tar.bz2
candle-12d6dc018dc7c0c698162cb19e9459bb915c5b84.zip
Support for MQA for llama v2. (#205)
* Support for MQA for llama v2. * More llama-v2. * Move the rotary embedding precomputation in the cache. * Add a v2 flag. * Use the hf model.
Diffstat (limited to 'README.md')
-rw-r--r--README.md2
1 files changed, 1 insertions, 1 deletions
diff --git a/README.md b/README.md
index 108dac6e..2409613f 100644
--- a/README.md
+++ b/README.md
@@ -13,7 +13,7 @@ let c = a.matmul(&b)?;
Check out our [examples](./candle-examples/examples/):
- [Whisper](./candle-examples/examples/whisper/)
-- [Llama](./candle-examples/examples/llama/)
+- [Llama and Llama-v2](./candle-examples/examples/llama/)
- [Bert](./candle-examples/examples/bert/) (Useful for sentence embeddings)
- [Falcon](./candle-examples/examples/falcon/)