summaryrefslogtreecommitdiff
path: root/candle-transformers/src/models/yi.rs
Commit message (Collapse)AuthorAgeFilesLines
* Module Docs (#2624)zachcp2024-11-181-5/+7
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * update whisper * update llama2c * update t5 * update phi and t5 * add a blip model * qlamma doc * add two new docs * add docs and emoji * additional models * openclip * pixtral * edits on the model docs * update yu * update a fe wmore models * add persimmon * add model-level doc * names * update module doc * links in heira * remove empty URL * update more hyperlinks * updated hyperlinks * more links * Update mod.rs --------- Co-authored-by: Laurent Mazare <laurent.mazare@gmail.com>
* Documentation Pass for Models (#2617)zachcp2024-11-151-1/+15
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * links in chinese_clip * links for clip model * add mod docs for flux and llava * module doc for MMDIT and MIMI * add docs for a few more modesl * mod docs for bert naser and beit * add module docs for convmixer colpali codegeex and chatglm * add another series of moddocs * add fastvit-llama2_c * module docs mamba -> mobileone * module docs from moondream-phi3 * mod docs for quantized and qwen * update to yi * fix long names * Update llama2_c.rs * Update llama2_c_weights.rs * Fix the link for mimi + tweaks --------- Co-authored-by: Laurent Mazare <laurent.mazare@gmail.com>
* Use cat for faster MQA computation. (#2043)Laurent Mazare2024-04-121-14/+2
| | | | | | | | | * Use cat for faster MQA computation. * Move the function to utils + use it in mistral. * Use the shared repeat-kv in a few more models. * Fix.
* Use a common with_tracing::RmsNorm in a few models. (#1871)Jani Monoses2024-03-181-22/+1
| | | | | * Add RmsNorm with tracing. * Use with_tracing::RmsNorm in some models.
* Use the llama weight names for the Yi example. (#1381)Laurent Mazare2023-11-271-2/+6
|
* Add the Yi-6b and Yi-34b models. (#1320)Laurent Mazare2023-11-111-0/+377
* Add the Yi-6b model. * Add the 34b model. * Add the yi example. * Fix the weight file names.