diff options
author | Laurent Mazare <laurent.mazare@gmail.com> | 2024-03-28 23:10:57 +0100 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-03-28 23:10:57 +0100 |
commit | 708e422456e8ed783923dd05ca7f4922099eda8c (patch) | |
tree | c3d7cc91ad2c29d59123c958822eca0b5625972a /README.md | |
parent | c5092f2c2977dbb0b45d16a869d22f4c2790a1e2 (diff) | |
download | candle-708e422456e8ed783923dd05ca7f4922099eda8c.tar.gz candle-708e422456e8ed783923dd05ca7f4922099eda8c.tar.bz2 candle-708e422456e8ed783923dd05ca7f4922099eda8c.zip |
Qwen MoE model. (#1960)
* Qwen MoE model.
* Add the MoE model to the example.
* Fix the scaling.
* Readme updates.
* Readme tweaks.
Diffstat (limited to 'README.md')
-rw-r--r-- | README.md | 4 |
1 files changed, 3 insertions, 1 deletions
@@ -125,6 +125,8 @@ We also provide a some command line based examples using state of the art models [RepVGG](./candle-examples/examples/repvgg): computer vision models. - [BLIP](./candle-examples/examples/blip/): image to text model, can be used to generate captions for an image. +- [CLIP](./candle-examples/examples/clip/): multi-model vision and language + model. - [TrOCR](./candle-examples/examples/trocr/): a transformer OCR model, with dedicated submodels for hand-writing and printed recognition. - [Marian-MT](./candle-examples/examples/marian-mt/): neural machine translation @@ -206,7 +208,7 @@ If you have an addition to this list, please submit a pull request. - Replit-code-v1.5-3B. - Bert. - Yi-6B and Yi-34B. - - Qwen1.5. + - Qwen1.5, Qwen1.5 MoE. - RWKV v5 and v6. - Quantized LLMs. - Llama 7b, 13b, 70b, as well as the chat and code variants. |