summaryrefslogtreecommitdiff
path: root/candle-transformers/src/models/quantized_stable_lm.rs
Commit message (Collapse)AuthorAgeFilesLines
* Documentation Pass for Models (#2617)zachcp2024-11-151-0/+15
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * links in chinese_clip * links for clip model * add mod docs for flux and llava * module doc for MMDIT and MIMI * add docs for a few more modesl * mod docs for bert naser and beit * add module docs for convmixer colpali codegeex and chatglm * add another series of moddocs * add fastvit-llama2_c * module docs mamba -> mobileone * module docs from moondream-phi3 * mod docs for quantized and qwen * update to yi * fix long names * Update llama2_c.rs * Update llama2_c_weights.rs * Fix the link for mimi + tweaks --------- Co-authored-by: Laurent Mazare <laurent.mazare@gmail.com>
* Use cat for faster MQA computation. (#2043)Laurent Mazare2024-04-121-14/+3
| | | | | | | | | * Use cat for faster MQA computation. * Move the function to utils + use it in mistral. * Use the shared repeat-kv in a few more models. * Fix.
* Rustfmt fix. (#1788)Laurent Mazare2024-03-021-1/+5
|
* Update StableLM config (#1787)Frkri2024-03-021-3/+3
|
* Quantized support for stable-lm2. (#1654)Laurent Mazare2024-02-041-4/+9
| | | | | * Quantized support for stable-lm2. * Quantized support for v2-zephyr.
* Make more models cloneable. (#1203)Laurent Mazare2023-10-281-4/+4
|
* Tracing for StableLM and quantized StableLM. (#1068)Laurent Mazare2023-10-101-0/+12
|
* Move the common quantized-nn code to a shared module. (#1063)Laurent Mazare2023-10-091-24/+1
|
* Quantized version of StableLM. (#1058)Laurent Mazare2023-10-081-0/+299
* Quantized version of StableLM. * Adapt the stable-lm example to support quantizsed. * Use some separate hub repo. * Another repo name tweak.