forks/candle.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	Clippy fixes. (#2415)	Laurent Mazare	2024-08-14	2	-8/+6
\| \| \| \| \|	* Clippy fixes. * Bump the web_sys required version.
*	Remove the deprecated wav crate in favor of hound. (#2202)	Laurent Mazare	2024-05-21	2	-9/+12
\|
*	Bump the version number to 0.5.1. (#2155)	Laurent Mazare	2024-05-03	1	-21/+0
\| \| \| \| \| \| \|	* Bump the version number to 0.5.1. * Fix clippy lints for 1.78. * More clippy fixes.
*	Quantized GGUF style (#1523)	Nicolas Patry	2024-01-17	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Metal quantized modifications proposal. - Add a device param, wherever needed. - Create new QMetal storage thing that implements QuantizedType. - Update everywhere needed. Fix Python. Fixing examples. Fix: fmt + clippy + stub. Moving everything around. Only missing the actual implems. Fixing everything + adding dequantized kernels. More work. Fixing matmul. Fmt + Clippy Some clippy fixes. Working state. Q2K Metal -> Bugged (also present in GGML). Q4K CPU -> Bugged (present previously, new test catch it). Q5K CPU -> Bugged (present previously). Q8_1 Both -> Never really implemented it seems Q8K metal -> Never implemented in metal Fixing Q2K bug (present in ggml). * Cleanup. * Fix the rebase. * Removing the fences speeds everything up and is correct this time... * Cleanup the fence. * After rebase. * Bad code removal. * Rebase after phi2 merge + fix replit default to CPU. * Making the CI happy. * More happy tests. --------- Co-authored-by: Nicolas Patry <nicolas@Nicolass-MacBook-Pro.local>
*	Update gloo requirement from 0.8 to 0.11 (#1558)	dependabot[bot]	2024-01-10	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Updates the requirements on [gloo](https://github.com/rustwasm/gloo) to permit the latest version. - [Release notes](https://github.com/rustwasm/gloo/releases) - [Changelog](https://github.com/rustwasm/gloo/blob/master/CHANGELOG.md) - [Commits](https://github.com/rustwasm/gloo/commits) --- updated-dependencies: - dependency-name: gloo dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
*	Simplifying our internal cargo dependencies. (#1529)	Nicolas Patry	2024-01-07	1	-3/+3
\|
*	Fix lints for clippy 1.75. (#1494)	Laurent Mazare	2023-12-28	1	-9/+7
\|
*	Bump the crate version to 0.3.3. (#1490)	Laurent Mazare	2023-12-28	1	-3/+3
\|
*	Bump the crate version to 0.3.2. (#1452)	Laurent Mazare	2023-12-17	1	-3/+3
\|
*	Fix a couple typos (#1451)	Laurent Mazare	2023-12-17	2	-2/+2
\| \| \| \| \|	* Mixtral quantized instruct. * Fix a couple typos.
*	Use the whisper-v3 tokenizer now that it has been added. (#1337)	Laurent Mazare	2023-11-16	1	-1/+7
\| \| \| \| \|	* Use the whisper-v3 tokenizer now that it has been added. * Use the appropriate nospeech token.
*	fix: address clippy 0.1.74 issues (#1336)	drbh	2023-11-16	1	-2/+1
\| \| \| \|	- clippy::needless-borrows-for-generic-args - clippy::reserve-after-initialization
*	Update for 0.3.1. (#1324)	Laurent Mazare	2023-11-11	1	-3/+3
\|
*	Preliminary support for whisper v3. (#1294)	Laurent Mazare	2023-11-08	2	-3/+5
\| \| \| \| \|	* Preliminary support for whisper v3. * Add the missing files.
*	add distil-whisper link (#1261)	Radamés Ajna	2023-11-03	1	-35/+49
\|
*	Remove some unusued bits. (#1067)	Laurent Mazare	2023-10-09	1	-1/+0
\|
*	Whisper quantized wasm (#1028)	Radamés Ajna	2023-10-04	12	-596/+539
\| \| \| \| \| \| \| \| \| \| \| \| \|	* [Whisper] Update to use quantized model * [whisper] add language detection * [whisper] change assets location * [whisper] adapt js example with quantized models * [whisper] better task parsing * [whisper] minor fixes
*	Bump the version to 0.3.0. (#1014)	Laurent Mazare	2023-10-01	1	-2/+2
\| \| \| \| \|	* Bump the version to 0.3.0. * Changelog update.
*	Pass directly the buffer ownership. (#949)	Laurent Mazare	2023-09-24	1	-2/+1
\|
*	Bump the crate versions to v0.2.3. (#886)	Laurent Mazare	2023-09-18	1	-2/+2
\| \| \| \| \|	* Bump the crate version. * Also update the python bindings.
*	minor UI fixes (#856)	Radamés Ajna	2023-09-15	1	-3/+6
\| \| \| \| \| \| \|	* fixes * remove listener * remove event listener
*	Bump the crate version + update the changelog. (#822)	Laurent Mazare	2023-09-12	1	-2/+2
\|
*	force model cache (#751)	Radamés Ajna	2023-09-06	1	-9/+10
\|
*	Minor WASM UI improvements (#748)	Radamés Ajna	2023-09-05	1	-6/+11
\| \| \| \| \| \| \|	* add stats * random seed btn * minor ui improvoments
*	Remove unnecessary file. (#710)	Laurent Mazare	2023-09-01	1	-0/+0
\|
*	Add a repeat penalty to the llama2.c wasm example. (#709)	Laurent Mazare	2023-09-01	1	-0/+0
\|
*	Add the kv-cache to the whisper wasm version. (#689)	Laurent Mazare	2023-08-31	3	-40/+95
\| \| \| \| \|	* Add the kv-cache to the whisper wasm version. * Improve the handling of special tokens.
*	Improve Whisper WASM UI example (#669)	Radamés Ajna	2023-08-30	7	-3/+487
\| \| \| \| \| \| \| \| \| \| \|	* wip add module and js worker example * params * clean up, send error * final UI with whisper webworker * add simple instructions
*	Add some documentation. (#673)	Laurent Mazare	2023-08-30	1	-2/+2
\| \| \| \| \|	* Add some documentation. * Bump the crate version.
*	Dilated convolutions (#657)	Laurent Mazare	2023-08-29	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Add the dilation parameter. * Restore the basic optimizer example. * Dilation support in cudnn. * Use the dilation parameter in the cpu backend. * More dilation support. * No support for dilation in transposed convolutions. * Add dilation to a test. * Remove a print. * Helper function.
*	Remove some dead-code annotations. (#629)	Laurent Mazare	2023-08-27	3	-39/+0
\| \| \| \| \| \| \| \| \|	* Remove some dead-code annotations. * More dead code removal. * One more. * CI fix.
*	Bump the crate version + update CHANGELOG. (#628)	Laurent Mazare	2023-08-27	1	-2/+2
\|
*	Add some group parameter to convolutions. (#566)	Laurent Mazare	2023-08-23	2	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	* Add some group parameter to convolutions. * Avoid some unnecessary groups checks. * Move the tensor convolution bits. * Properh handling of groups. * Bump the crate version. * And add a changelog.
*	Bump the crates version to 0.1.2. (#522)	Laurent Mazare	2023-08-20	1	-2/+2
\|
*	Add a simple Module trait and implement it for the various nn layers (#500)	Laurent Mazare	2023-08-18	1	-1/+1
\| \| \| \| \| \| \|	* Start adding the module trait. * Use the module trait. * Implement module for qmatmul.
*	Rename vec-dot to vec-ops. (#449)	Laurent Mazare	2023-08-15	1	-2/+2
\| \| \| \| \| \| \|	* Rename vec-dot to vec-ops. * Also bump the crate version. * Add a currently empty readme.
*	Add a cuda kernel for upsampling. (#441)	Laurent Mazare	2023-08-14	1	-4/+1
\| \| \| \| \|	* Add a cuda kernel for upsampling. * Update for the latest tokenizers version.
*	Update the repo location. (#305)	Laurent Mazare	2023-08-02	1	-9/+7
\|
*	Add version numbers for all the candle crates (#303)	Laurent Mazare	2023-08-02	1	-2/+2
\| \| \| \| \|	* Switch to candle-gemm for the time being. * Add the missing versions.
*	Rename the candle crate to candle-core (#301)	Laurent Mazare	2023-08-02	1	-1/+1
\| \| \| \| \|	* Rename to candle-core. * More candle-core renaming.
*	Softmax numerical stability. (#267)	Laurent Mazare	2023-07-28	2	-8/+5
\| \| \| \| \|	* Softmax numerical stability. * Fix the flash-attn test.
*	Removing inner dependency on safetensors.	Nicolas Patry	2023-07-27	1	-1/+1
\|
*	TP sharding v2	Nicolas Patry	2023-07-27	2	-2/+3
\|
*	Micro-cleanup. (#256)	Laurent Mazare	2023-07-27	1	-4/+2
\|
*	Simple QOL.	Nicolas Patry	2023-07-26	1	-2/+10
\| \| \| \| \| \| \|	- Add ms/token on llama2.c (15ms/token on my personal machine) - Hide `Run` buttons while models are not ready - Add dummy `progress` while weights are downloading (I briefly looked at putting a real progressbar.. and nothing easy enough came up.)
*	Re-organize the wasm examples (#231)	Laurent Mazare	2023-07-24	10	-0/+1342
	* Move the whisper example. * More renaming. * Add llama2 as a new wasm example. * Live generation. * More of the llama wasm example. * Formatting.