index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-core
/
src
/
quantized
/
k_quants.rs
Commit message (
Expand
)
Author
Age
Files
Lines
*
Bugfix for dequantizing q5k layers. (#1569)
Laurent Mazare
2024-01-11
1
-4
/
+4
*
No need for the even constraint on vecdot-q40-q80. (#1202)
Laurent Mazare
2023-10-28
1
-5
/
+0
*
Simd128 optimized q8k vecdot. (#1026)
Laurent Mazare
2023-10-03
1
-0
/
+3
*
AVX optimized q8k vecdot. (#1024)
Laurent Mazare
2023-10-03
1
-0
/
+3
*
neon optimized q8k multiplication. (#1021)
Laurent Mazare
2023-10-02
1
-3
/
+7
*
Add the q8k vec-dot multiplication. (#1019)
Laurent Mazare
2023-10-02
1
-2
/
+18
*
Improve the testing of the optimized quantized vec-dot ops (#1016)
Laurent Mazare
2023-10-02
1
-2
/
+60
*
Simd128 version of q6k vec-dot. (#1015)
Laurent Mazare
2023-10-01
1
-0
/
+3
*
Simd128 version of the q2k-q8k vecdot product. (#1011)
Laurent Mazare
2023-09-30
1
-5
/
+3
*
Simd128 q2k vecdot (#982)
Laurent Mazare
2023-09-28
1
-0
/
+3
*
Sketch a simd128 optimized q4k vecdot. (#977)
Laurent Mazare
2023-09-27
1
-0
/
+3
*
Simd128 vec-dot for q4_0. (#974)
Laurent Mazare
2023-09-27
1
-0
/
+3
*
simd128 optimized q8_0 vecdot (#972)
Laurent Mazare
2023-09-27
1
-0
/
+3
*
Fix some errors about BlockQ8_1 (#776)
zmlcc
2023-09-08
1
-3
/
+5
*
Neon optimized vecdot (#666)
Laurent Mazare
2023-08-29
1
-0
/
+9
*
Add `avx` implemenetations of `q2k`, `q3k` and `q5k` vec-dot functions (#654)
Lukas Kreussel
2023-08-29
1
-0
/
+12
*
AVX version of the q4k vecdot. (#651)
Laurent Mazare
2023-08-29
1
-0
/
+3
*
Neon optimized version of the q4k vecdot product. (#632)
Laurent Mazare
2023-08-27
1
-0
/
+4
*
Fix for q5_1 quantization. (#617)
Laurent Mazare
2023-08-27
1
-1
/
+1
*
Quantization tests + fix some issues. (#616)
Laurent Mazare
2023-08-27
1
-6
/
+6
*
More missing quantized bits. (#615)
Laurent Mazare
2023-08-27
1
-7
/
+94
*
Missing quants ops (#611)
Laurent Mazare
2023-08-26
1
-13
/
+123
*
Another transmute tweak. (#610)
Laurent Mazare
2023-08-26
1
-20
/
+19
*
Avoid using tmp values. (#609)
Laurent Mazare
2023-08-26
1
-20
/
+8
*
Add reference implementation for `q4k` and `q5k` (#586)
Lukas Kreussel
2023-08-26
1
-4
/
+177
*
Avoid some transmutes. (#607)
Laurent Mazare
2023-08-25
1
-10
/
+5
*
Neon intrinsics for the q8_0 vecdot. (#604)
Laurent Mazare
2023-08-25
1
-0
/
+3
*
AVX version for the q8-0 multiplications. (#598)
Laurent Mazare
2023-08-25
1
-0
/
+4
*
Generic implementation of vecdot for q80. (#596)
Laurent Mazare
2023-08-25
1
-2
/
+18
*
Referenze implementations of `q2k` and `q3k` vec-dot functions (#580)
Lukas Kreussel
2023-08-24
1
-7
/
+179
*
Avoid some mutable variables (take 2). (#554)
Laurent Mazare
2023-08-22
1
-23
/
+15
*
Revert "Avoid some mut in quantized functions. (#550)" (#552)
Laurent Mazare
2023-08-22
1
-16
/
+25
*
Avoid some mut in quantized functions. (#550)
Laurent Mazare
2023-08-22
1
-25
/
+16
*
Add quantization support for `q2k`, `q3k`, `q4k` and `q5k` (#524)
Lukas Kreussel
2023-08-22
1
-399
/
+574
*
Neon support for quantization. (#519)
Laurent Mazare
2023-08-19
1
-0
/
+6
*
Basic `qmatmul` parallelization (#492)
Lukas Kreussel
2023-08-18
1
-5
/
+15
*
Q6K quantization (#495)
Laurent Mazare
2023-08-17
1
-2
/
+207
*
AVX version of the q6k vec-dot. (#493)
Laurent Mazare
2023-08-17
1
-0
/
+4
*
Move the avx specific bits to a separate file. (#481)
Laurent Mazare
2023-08-17
1
-116
/
+45
*
AVX version of the vecdot for q4_0. (#474)
Laurent Mazare
2023-08-17
1
-0
/
+75
*
Add vecdot for q6k-q8k. (#476)
Laurent Mazare
2023-08-16
1
-2
/
+56
*
Use a zipped iterator. (#475)
Laurent Mazare
2023-08-16
1
-11
/
+54
*
Add a kv-cache to the quantized llama example. (#466)
Laurent Mazare
2023-08-16
1
-4
/
+4
*
Get the ggml based llama to generate some text. (#464)
Laurent Mazare
2023-08-16
1
-6
/
+7
*
Quantized support for f16 and f32 (#457)
Laurent Mazare
2023-08-15
1
-0
/
+74
*
Split out the quantized file. (#456)
Laurent Mazare
2023-08-15
1
-0
/
+728