index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-core
/
src
/
quantized
Commit message (
Expand
)
Author
Age
Files
Lines
*
Simpler repro for the neon optimization issue + bugfix (#1544)
Laurent Mazare
2024-01-07
1
-152
/
+56
*
Fix the quantized mistral example. (#1478)
Laurent Mazare
2023-12-25
1
-1
/
+1
*
Fix a couple typos (#1451)
Laurent Mazare
2023-12-17
2
-3
/
+3
*
Implement the module trait directly for QMatMul. (#1372)
Laurent Mazare
2023-11-25
1
-2
/
+2
*
Allow using gguf-v3 files. (#1262)
Laurent Mazare
2023-11-03
1
-5
/
+15
*
No need for the even constraint on vecdot-q40-q80. (#1202)
Laurent Mazare
2023-10-28
4
-41
/
+2
*
Add a quantized variant of llama2.c (#1197)
Laurent Mazare
2023-10-27
2
-28
/
+2
*
Better control on the optional dequantization in QMatMul (#1049)
Laurent Mazare
2023-10-07
1
-7
/
+28
*
Simd128 optimized q8k vecdot. (#1026)
Laurent Mazare
2023-10-03
2
-0
/
+33
*
AVX optimized q8k vecdot. (#1024)
Laurent Mazare
2023-10-03
2
-0
/
+35
*
neon optimized q8k multiplication. (#1021)
Laurent Mazare
2023-10-02
2
-3
/
+36
*
Add the q8k vec-dot multiplication. (#1019)
Laurent Mazare
2023-10-02
1
-2
/
+18
*
Improve the quantized whisper setup. (#1018)
Laurent Mazare
2023-10-02
1
-10
/
+19
*
Improve the testing of the optimized quantized vec-dot ops (#1016)
Laurent Mazare
2023-10-02
1
-2
/
+60
*
Simd128 version of q6k vec-dot. (#1015)
Laurent Mazare
2023-10-01
2
-1
/
+127
*
Simd128 version of the q2k-q8k vecdot product. (#1011)
Laurent Mazare
2023-09-30
2
-45
/
+75
*
Simd128 q2k vecdot (#982)
Laurent Mazare
2023-09-28
2
-4
/
+57
*
Sketch a simd128 optimized q4k vecdot. (#977)
Laurent Mazare
2023-09-27
2
-1
/
+97
*
Simd128 vec-dot for q4_0. (#974)
Laurent Mazare
2023-09-27
2
-1
/
+54
*
simd128 optimized q8_0 vecdot (#972)
Laurent Mazare
2023-09-27
3
-0
/
+54
*
Use the gelu-erf activation. (#969)
Laurent Mazare
2023-09-26
1
-3
/
+3
*
Avoid some overflows on wasm32. (#968)
Laurent Mazare
2023-09-26
2
-3
/
+14
*
Add a quantized version of the t5 model. (#921)
Laurent Mazare
2023-09-21
1
-1
/
+1
*
Fix some errors about BlockQ8_1 (#776)
zmlcc
2023-09-08
1
-3
/
+5
*
Add `ggufv2` support (#725)
Lukas Kreussel
2023-09-03
1
-21
/
+97
*
Support for quantized tensors in the python api. (#706)
Laurent Mazare
2023-09-01
1
-3
/
+11
*
Small cleanups (avoid some possible mutations) (#670)
Laurent Mazare
2023-08-30
1
-99
/
+59
*
Neon optimized vecdot (#666)
Laurent Mazare
2023-08-29
2
-8
/
+369
*
Add `avx` implemenetations of `q2k`, `q3k` and `q5k` vec-dot functions (#654)
Lukas Kreussel
2023-08-29
2
-8
/
+403
*
AVX version of the q4k vecdot. (#651)
Laurent Mazare
2023-08-29
2
-9
/
+120
*
Neon optimized version of the q4k vecdot product. (#632)
Laurent Mazare
2023-08-27
2
-1
/
+99
*
Llama quantization. (#625)
Laurent Mazare
2023-08-27
1
-0
/
+4
*
Add the quantize command. (#624)
Laurent Mazare
2023-08-27
1
-1
/
+2
*
Fix for q5_1 quantization. (#617)
Laurent Mazare
2023-08-27
1
-1
/
+1
*
Quantization tests + fix some issues. (#616)
Laurent Mazare
2023-08-27
1
-6
/
+6
*
More missing quantized bits. (#615)
Laurent Mazare
2023-08-27
1
-7
/
+94
*
Missing quants ops (#611)
Laurent Mazare
2023-08-26
1
-13
/
+123
*
Another transmute tweak. (#610)
Laurent Mazare
2023-08-26
1
-20
/
+19
*
Avoid using tmp values. (#609)
Laurent Mazare
2023-08-26
1
-20
/
+8
*
Add reference implementation for `q4k` and `q5k` (#586)
Lukas Kreussel
2023-08-26
1
-4
/
+177
*
Avoid some transmutes. (#607)
Laurent Mazare
2023-08-25
1
-10
/
+5
*
Neon intrinsics for the q8_0 vecdot. (#604)
Laurent Mazare
2023-08-25
2
-0
/
+64
*
AVX version for the q8-0 multiplications. (#598)
Laurent Mazare
2023-08-25
2
-1
/
+23
*
Generic implementation of vecdot for q80. (#596)
Laurent Mazare
2023-08-25
1
-2
/
+18
*
Add a function to write gguf files. (#585)
Laurent Mazare
2023-08-24
2
-4
/
+163
*
Referenze implementations of `q2k` and `q3k` vec-dot functions (#580)
Lukas Kreussel
2023-08-24
1
-7
/
+179
*
GGUF support in the quantized model. (#559)
Laurent Mazare
2023-08-23
1
-2
/
+88
*
Handle GGUF files in tensor-tools. (#558)
Laurent Mazare
2023-08-23
1
-2
/
+10
*
Preliminary GGUF support. (#557)
Laurent Mazare
2023-08-23
2
-0
/
+221
*
Avoid some mutable variables (take 2). (#554)
Laurent Mazare
2023-08-22
2
-37
/
+29
[next]