Add support for gemma-2. (#2425)

* Add gemma-2. * Support a couple more models. * Sliding window support. * Example + readme updates. * Update the main readme.
author: Laurent Mazare <laurent.mazare@gmail.com> 2024-08-17 19:31:23 +0100
committer: GitHub <noreply@github.com> 2024-08-17 20:31:23 +0200
commit: c1b9e07e3549574659b189389975c1152b0776f5 (patch)
tree: 25f30a3c0dc483a86f920e609c5dd1e52594855d /README.md
parent: 69fdcfe96ac05213b3b166140774f38a99de0b54 (diff)
download: candle-c1b9e07e3549574659b189389975c1152b0776f5.tar.gz
candle-c1b9e07e3549574659b189389975c1152b0776f5.tar.bz2
candle-c1b9e07e3549574659b189389975c1152b0776f5.zip
1 files changed, 2 insertions, 2 deletions
diff --git a/README.md b/README.md
index 543b9ca8..06d3104e 100644
--- a/README.md
+++ b/README.md
@@ -65,7 +65,7 @@ We also provide a some command line based examples using state of the art models
 - [Falcon](./candle-examples/examples/falcon/): general LLM.
 - [Codegeex4](./candle-examples/examples/codegeex4-9b/): Code completion,code interpreter,web search,fuction calling,repository-level
 - [GLM4](./candle-examples/examples/glm4/): Open Multilingual Multimodal Chat LMs by THUDM
-- [Gemma](./candle-examples/examples/gemma/): 2b and 7b general LLMs from Google Deepmind.
+- [Gemma v1 and v2](./candle-examples/examples/gemma/): 2b and 7b+/9b general LLMs from Google Deepmind.
 - [RecurrentGemma](./candle-examples/examples/recurrent-gemma/): 2b and 7b
   Griffin based models from Google that mix attention with a RNN like state.
 - [Phi-1, Phi-1.5, Phi-2, and Phi-3](./candle-examples/examples/phi/): 1.3b,
@@ -208,7 +208,7 @@ If you have an addition to this list, please submit a pull request.
         - StarCoder, StarCoder2.
         - Phi 1, 1.5, 2, and 3.
         - Mamba, Minimal Mamba
-        - Gemma 2b and 7b.
+        - Gemma v1 2b and 7b+, v2 2b and 9b.
         - Mistral 7b v0.1.
         - Mixtral 8x7b v0.1.
         - StableLM-3B-4E1T, StableLM-2-1.6B, Stable-Code-3B.
author	Laurent Mazare <laurent.mazare@gmail.com>	2024-08-17 19:31:23 +0100
committer	GitHub <noreply@github.com>	2024-08-17 20:31:23 +0200
commit	c1b9e07e3549574659b189389975c1152b0776f5 (patch)
tree	25f30a3c0dc483a86f920e609c5dd1e52594855d /README.md
parent	69fdcfe96ac05213b3b166140774f38a99de0b54 (diff)
download	candle-c1b9e07e3549574659b189389975c1152b0776f5.tar.gz candle-c1b9e07e3549574659b189389975c1152b0776f5.tar.bz2 candle-c1b9e07e3549574659b189389975c1152b0776f5.zip