forks/candle.git -

	Commit message (Expand)	Author	Age	Files	Lines
*	Simpler repro for the neon optimization issue + bugfix (#1544)	Laurent Mazare	2024-01-07	1	-152/+56
*	Fix the quantized mistral example. (#1478)	Laurent Mazare	2023-12-25	1	-1/+1
*	Fix a couple typos (#1451)	Laurent Mazare	2023-12-17	2	-3/+3
*	Implement the module trait directly for QMatMul. (#1372)	Laurent Mazare	2023-11-25	1	-2/+2
*	Allow using gguf-v3 files. (#1262)	Laurent Mazare	2023-11-03	1	-5/+15
*	No need for the even constraint on vecdot-q40-q80. (#1202)	Laurent Mazare	2023-10-28	4	-41/+2
*	Add a quantized variant of llama2.c (#1197)	Laurent Mazare	2023-10-27	2	-28/+2
*	Better control on the optional dequantization in QMatMul (#1049)	Laurent Mazare	2023-10-07	1	-7/+28
*	Simd128 optimized q8k vecdot. (#1026)	Laurent Mazare	2023-10-03	2	-0/+33
*	AVX optimized q8k vecdot. (#1024)	Laurent Mazare	2023-10-03	2	-0/+35
*	neon optimized q8k multiplication. (#1021)	Laurent Mazare	2023-10-02	2	-3/+36
*	Add the q8k vec-dot multiplication. (#1019)	Laurent Mazare	2023-10-02	1	-2/+18
*	Improve the quantized whisper setup. (#1018)	Laurent Mazare	2023-10-02	1	-10/+19
*	Improve the testing of the optimized quantized vec-dot ops (#1016)	Laurent Mazare	2023-10-02	1	-2/+60
*	Simd128 version of q6k vec-dot. (#1015)	Laurent Mazare	2023-10-01	2	-1/+127
*	Simd128 version of the q2k-q8k vecdot product. (#1011)	Laurent Mazare	2023-09-30	2	-45/+75
*	Simd128 q2k vecdot (#982)	Laurent Mazare	2023-09-28	2	-4/+57
*	Sketch a simd128 optimized q4k vecdot. (#977)	Laurent Mazare	2023-09-27	2	-1/+97
*	Simd128 vec-dot for q4_0. (#974)	Laurent Mazare	2023-09-27	2	-1/+54
*	simd128 optimized q8_0 vecdot (#972)	Laurent Mazare	2023-09-27	3	-0/+54
*	Use the gelu-erf activation. (#969)	Laurent Mazare	2023-09-26	1	-3/+3
*	Avoid some overflows on wasm32. (#968)	Laurent Mazare	2023-09-26	2	-3/+14
*	Add a quantized version of the t5 model. (#921)	Laurent Mazare	2023-09-21	1	-1/+1
*	Fix some errors about BlockQ8_1 (#776)	zmlcc	2023-09-08	1	-3/+5
*	Add `ggufv2` support (#725)	Lukas Kreussel	2023-09-03	1	-21/+97
*	Support for quantized tensors in the python api. (#706)	Laurent Mazare	2023-09-01	1	-3/+11
*	Small cleanups (avoid some possible mutations) (#670)	Laurent Mazare	2023-08-30	1	-99/+59
*	Neon optimized vecdot (#666)	Laurent Mazare	2023-08-29	2	-8/+369
*	Add `avx` implemenetations of `q2k`, `q3k` and `q5k` vec-dot functions (#654)	Lukas Kreussel	2023-08-29	2	-8/+403
*	AVX version of the q4k vecdot. (#651)	Laurent Mazare	2023-08-29	2	-9/+120
*	Neon optimized version of the q4k vecdot product. (#632)	Laurent Mazare	2023-08-27	2	-1/+99
*	Llama quantization. (#625)	Laurent Mazare	2023-08-27	1	-0/+4
*	Add the quantize command. (#624)	Laurent Mazare	2023-08-27	1	-1/+2
*	Fix for q5_1 quantization. (#617)	Laurent Mazare	2023-08-27	1	-1/+1
*	Quantization tests + fix some issues. (#616)	Laurent Mazare	2023-08-27	1	-6/+6
*	More missing quantized bits. (#615)	Laurent Mazare	2023-08-27	1	-7/+94
*	Missing quants ops (#611)	Laurent Mazare	2023-08-26	1	-13/+123
*	Another transmute tweak. (#610)	Laurent Mazare	2023-08-26	1	-20/+19
*	Avoid using tmp values. (#609)	Laurent Mazare	2023-08-26	1	-20/+8
*	Add reference implementation for `q4k` and `q5k` (#586)	Lukas Kreussel	2023-08-26	1	-4/+177
*	Avoid some transmutes. (#607)	Laurent Mazare	2023-08-25	1	-10/+5
*	Neon intrinsics for the q8_0 vecdot. (#604)	Laurent Mazare	2023-08-25	2	-0/+64
*	AVX version for the q8-0 multiplications. (#598)	Laurent Mazare	2023-08-25	2	-1/+23
*	Generic implementation of vecdot for q80. (#596)	Laurent Mazare	2023-08-25	1	-2/+18
*	Add a function to write gguf files. (#585)	Laurent Mazare	2023-08-24	2	-4/+163
*	Referenze implementations of `q2k` and `q3k` vec-dot functions (#580)	Lukas Kreussel	2023-08-24	1	-7/+179
*	GGUF support in the quantized model. (#559)	Laurent Mazare	2023-08-23	1	-2/+88
*	Handle GGUF files in tensor-tools. (#558)	Laurent Mazare	2023-08-23	1	-2/+10
*	Preliminary GGUF support. (#557)	Laurent Mazare	2023-08-23	2	-0/+221
*	Avoid some mutable variables (take 2). (#554)	Laurent Mazare	2023-08-22	2	-37/+29