summaryrefslogtreecommitdiff
path: root/candle-core/src/quantized/simd128.rs
diff options
context:
space:
mode:
authorLaurent Mazare <laurent.mazare@gmail.com>2023-09-30 19:25:47 +0200
committerGitHub <noreply@github.com>2023-09-30 18:25:47 +0100
commitdeee7612da7dcda1aa1cfd4237f4858d9f5ed8c7 (patch)
tree4c67080d778fb502ee665eb036b35b8c72a10103 /candle-core/src/quantized/simd128.rs
parent06207332bc58e20680dd1925b7d90bac51f4f21c (diff)
downloadcandle-deee7612da7dcda1aa1cfd4237f4858d9f5ed8c7.tar.gz
candle-deee7612da7dcda1aa1cfd4237f4858d9f5ed8c7.tar.bz2
candle-deee7612da7dcda1aa1cfd4237f4858d9f5ed8c7.zip
Quantized version of mistral. (#1009)
* Quantized version of mistral. * Integrate the quantized mistral variant. * Use the quantized weight files. * Tweak the quantization command. * Fix the dtype when computing the rotary embeddings. * Update the readme with the quantized version. * Fix the decoding of the remaining tokens.
Diffstat (limited to 'candle-core/src/quantized/simd128.rs')
0 files changed, 0 insertions, 0 deletions