index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-core
/
tests
/
quantized_tests.rs
Commit message (
Expand
)
Author
Age
Files
Lines
*
Add the cuda dequantize f16 kernels. (#2137)
Laurent Mazare
2024-04-28
1
-1
/
+120
*
Add more QMMV cuda kernels. (#2077)
Laurent Mazare
2024-04-18
1
-7
/
+15
*
Add the mmv kernels for small batch sizes. (#2075)
Laurent Mazare
2024-04-16
1
-1
/
+35
*
Fix for the batch dim in the quantized matmul example. (#2073)
Laurent Mazare
2024-04-15
1
-36
/
+36
*
Add a cuda kernel for dequantizing q8_0. (#1804)
Laurent Mazare
2024-03-05
1
-4
/
+0
*
Handle Q5_0 and Q5_1 quants in cuda.
laurent
2024-02-29
1
-8
/
+0
*
Fix the block size for some cuda kernels. (#1767)
Laurent Mazare
2024-02-27
1
-32
/
+0
*
Quantized GGUF style (#1523)
Nicolas Patry
2024-01-17
1
-143
/
+430
*
Bugfix for dequantizing q5k layers. (#1569)
Laurent Mazare
2024-01-11
1
-1
/
+1
*
Simpler repro for the neon optimization issue + bugfix (#1544)
Laurent Mazare
2024-01-07
1
-16
/
+41
*
Implement the module trait directly for QMatMul. (#1372)
Laurent Mazare
2023-11-25
1
-1
/
+1
*
Add the q8k vec-dot multiplication. (#1019)
Laurent Mazare
2023-10-02
1
-0
/
+28
*
Improve the quantized whisper setup. (#1018)
Laurent Mazare
2023-10-02
1
-7
/
+7
*
Improve the testing of the optimized quantized vec-dot ops (#1016)
Laurent Mazare
2023-10-02
1
-3
/
+8
*
Simd128 version of the q2k-q8k vecdot product. (#1011)
Laurent Mazare
2023-09-30
1
-1
/
+1
*
Move the test-utils bits to a shared place. (#619)
Laurent Mazare
2023-08-27
1
-2
/
+1
*
Fix for q5_1 quantization. (#617)
Laurent Mazare
2023-08-27
1
-48
/
+27
*
Quantization tests + fix some issues. (#616)
Laurent Mazare
2023-08-27
1
-0
/
+93
*
Add reference implementation for `q4k` and `q5k` (#586)
Lukas Kreussel
2023-08-26
1
-1
/
+93
*
Referenze implementations of `q2k` and `q3k` vec-dot functions (#580)
Lukas Kreussel
2023-08-24
1
-0
/
+54
*
Cosmetic tweaks. (#570)
Laurent Mazare
2023-08-23
1
-29
/
+24
*
Mirror GGML's unit tests (#569)
Lukas Kreussel
2023-08-23
1
-16
/
+124
*
Add quantization support for `q2k`, `q3k`, `q4k` and `q5k` (#524)
Lukas Kreussel
2023-08-22
1
-21
/
+174
*
Tensor -> QTensor conversion (#496)
Laurent Mazare
2023-08-18
1
-2
/
+45
*
Q6K quantization (#495)
Laurent Mazare
2023-08-17
1
-0
/
+26
*
AVX version of the vecdot for q4_0. (#474)
Laurent Mazare
2023-08-17
1
-10
/
+10
*
Add vecdot for q6k-q8k. (#476)
Laurent Mazare
2023-08-16
1
-0
/
+22
*
Add a quantized test that use negative values. (#470)
Laurent Mazare
2023-08-16
1
-0
/
+50
*
Get the ggml based llama to generate some text. (#464)
Laurent Mazare
2023-08-16
1
-3
/
+32
*
Add a test for qmatmul. (#459)
Laurent Mazare
2023-08-16
1
-0
/
+13
*
Split out the quantized file. (#456)
Laurent Mazare
2023-08-15
1
-0
/
+33