diff options
author | Laurent Mazare <laurent.mazare@gmail.com> | 2024-08-17 19:31:23 +0100 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-08-17 20:31:23 +0200 |
commit | c1b9e07e3549574659b189389975c1152b0776f5 (patch) | |
tree | 25f30a3c0dc483a86f920e609c5dd1e52594855d /README.md | |
parent | 69fdcfe96ac05213b3b166140774f38a99de0b54 (diff) | |
download | candle-c1b9e07e3549574659b189389975c1152b0776f5.tar.gz candle-c1b9e07e3549574659b189389975c1152b0776f5.tar.bz2 candle-c1b9e07e3549574659b189389975c1152b0776f5.zip |
Add support for gemma-2. (#2425)
* Add gemma-2.
* Support a couple more models.
* Sliding window support.
* Example + readme updates.
* Update the main readme.
Diffstat (limited to 'README.md')
-rw-r--r-- | README.md | 4 |
1 files changed, 2 insertions, 2 deletions
@@ -65,7 +65,7 @@ We also provide a some command line based examples using state of the art models - [Falcon](./candle-examples/examples/falcon/): general LLM. - [Codegeex4](./candle-examples/examples/codegeex4-9b/): Code completion,code interpreter,web search,fuction calling,repository-level - [GLM4](./candle-examples/examples/glm4/): Open Multilingual Multimodal Chat LMs by THUDM -- [Gemma](./candle-examples/examples/gemma/): 2b and 7b general LLMs from Google Deepmind. +- [Gemma v1 and v2](./candle-examples/examples/gemma/): 2b and 7b+/9b general LLMs from Google Deepmind. - [RecurrentGemma](./candle-examples/examples/recurrent-gemma/): 2b and 7b Griffin based models from Google that mix attention with a RNN like state. - [Phi-1, Phi-1.5, Phi-2, and Phi-3](./candle-examples/examples/phi/): 1.3b, @@ -208,7 +208,7 @@ If you have an addition to this list, please submit a pull request. - StarCoder, StarCoder2. - Phi 1, 1.5, 2, and 3. - Mamba, Minimal Mamba - - Gemma 2b and 7b. + - Gemma v1 2b and 7b+, v2 2b and 9b. - Mistral 7b v0.1. - Mixtral 8x7b v0.1. - StableLM-3B-4E1T, StableLM-2-1.6B, Stable-Code-3B. |