summaryrefslogtreecommitdiff
path: root/README.md
diff options
context:
space:
mode:
authorLaurent Mazare <laurent.mazare@gmail.com>2024-08-17 19:31:23 +0100
committerGitHub <noreply@github.com>2024-08-17 20:31:23 +0200
commitc1b9e07e3549574659b189389975c1152b0776f5 (patch)
tree25f30a3c0dc483a86f920e609c5dd1e52594855d /README.md
parent69fdcfe96ac05213b3b166140774f38a99de0b54 (diff)
downloadcandle-c1b9e07e3549574659b189389975c1152b0776f5.tar.gz
candle-c1b9e07e3549574659b189389975c1152b0776f5.tar.bz2
candle-c1b9e07e3549574659b189389975c1152b0776f5.zip
Add support for gemma-2. (#2425)
* Add gemma-2. * Support a couple more models. * Sliding window support. * Example + readme updates. * Update the main readme.
Diffstat (limited to 'README.md')
-rw-r--r--README.md4
1 files changed, 2 insertions, 2 deletions
diff --git a/README.md b/README.md
index 543b9ca8..06d3104e 100644
--- a/README.md
+++ b/README.md
@@ -65,7 +65,7 @@ We also provide a some command line based examples using state of the art models
- [Falcon](./candle-examples/examples/falcon/): general LLM.
- [Codegeex4](./candle-examples/examples/codegeex4-9b/): Code completion,code interpreter,web search,fuction calling,repository-level
- [GLM4](./candle-examples/examples/glm4/): Open Multilingual Multimodal Chat LMs by THUDM
-- [Gemma](./candle-examples/examples/gemma/): 2b and 7b general LLMs from Google Deepmind.
+- [Gemma v1 and v2](./candle-examples/examples/gemma/): 2b and 7b+/9b general LLMs from Google Deepmind.
- [RecurrentGemma](./candle-examples/examples/recurrent-gemma/): 2b and 7b
Griffin based models from Google that mix attention with a RNN like state.
- [Phi-1, Phi-1.5, Phi-2, and Phi-3](./candle-examples/examples/phi/): 1.3b,
@@ -208,7 +208,7 @@ If you have an addition to this list, please submit a pull request.
- StarCoder, StarCoder2.
- Phi 1, 1.5, 2, and 3.
- Mamba, Minimal Mamba
- - Gemma 2b and 7b.
+ - Gemma v1 2b and 7b+, v2 2b and 9b.
- Mistral 7b v0.1.
- Mixtral 8x7b v0.1.
- StableLM-3B-4E1T, StableLM-2-1.6B, Stable-Code-3B.