forks/candle.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	Bump the caret version to 0.8.2. (#2703)	Laurent Mazare	2025-01-07	1	-2/+2
\|
*	Bump the crate version to 0.8.1. (#2662)	Laurent Mazare	2024-12-07	1	-2/+2
\|
*	Bump the crate version to 0.8.0. (#2612)	Laurent Mazare	2024-11-12	1	-2/+2
\|
*	Bump the crate version to 0.7.2. (#2517)	Laurent Mazare	2024-09-29	1	-2/+2
\|
*	Move the candle version to 0.7.1. (#2495)	Laurent Mazare	2024-09-22	1	-2/+2
\|
*	Bump the crate version. (#2491)	Laurent Mazare	2024-09-21	1	-2/+2
\|
*	Bump the version to 0.6.1. (#2438)	Laurent Mazare	2024-08-22	1	-2/+2
\|
*	Bump the crate version. (#2248)	Laurent Mazare	2024-06-05	1	-2/+2
\|
*	Bump the version number to 0.5.1. (#2155)	Laurent Mazare	2024-05-03	1	-2/+2
\| \| \| \| \| \| \|	* Bump the version number to 0.5.1. * Fix clippy lints for 1.78. * More clippy fixes.
*	Bumping the version number to 0.5.0. (#2009)	Laurent Mazare	2024-04-04	1	-2/+2
\|
*	Bump the crate versions to 0.4.2. (#1821)	Laurent Mazare	2024-03-08	1	-2/+2
\|
*	Bump the version number to 0.4.1. (#1768)	Laurent Mazare	2024-02-27	1	-2/+2
\| \| \| \| \|	* Fix the block size for some cuda kernels. * Bump the version number to 0.4.1.
*	Bump the crate version to 0.4.0. (#1658)	Laurent Mazare	2024-02-04	1	-2/+2
\|
*	Explicit version for packages that are not in the workspace. (#1642)	Laurent Mazare	2024-01-31	1	-1/+1
\|
*	Moving to a proper build crate `bindgen_cuda`. (#1531)	Nicolas Patry	2024-01-07	1	-2/+2
\| \| \| \| \|	* Moving to a proper build crate `bindgen_cuda`. * Fmt.
*	Unpin more of the workplace relative dependencies. (#1535)	Laurent Mazare	2024-01-07	1	-2/+2
\|
*	Bump the crate version to 0.3.3. (#1490)	Laurent Mazare	2023-12-28	1	-3/+3
\|
*	Bump the crate version to 0.3.2. (#1452)	Laurent Mazare	2023-12-17	1	-3/+3
\|
*	Update for 0.3.1. (#1324)	Laurent Mazare	2023-11-11	1	-3/+3
\|
*	Bump the version to 0.3.0. (#1014)	Laurent Mazare	2023-10-01	1	-3/+3
\| \| \| \| \|	* Bump the version to 0.3.0. * Changelog update.
*	Bump the crate versions to v0.2.3. (#886)	Laurent Mazare	2023-09-18	1	-3/+3
\| \| \| \| \|	* Bump the crate version. * Also update the python bindings.
*	Bump the crate version + update the changelog. (#822)	Laurent Mazare	2023-09-12	1	-3/+3
\|
*	Add some documentation. (#673)	Laurent Mazare	2023-08-30	1	-3/+3
\| \| \| \| \|	* Add some documentation. * Bump the crate version.
*	Bump the crate version + update CHANGELOG. (#628)	Laurent Mazare	2023-08-27	1	-3/+3
\|
*	Add some group parameter to convolutions. (#566)	Laurent Mazare	2023-08-23	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \|	* Add some group parameter to convolutions. * Avoid some unnecessary groups checks. * Move the tensor convolution bits. * Properh handling of groups. * Bump the crate version. * And add a changelog.
*	Bump the crates version to 0.1.2. (#522)	Laurent Mazare	2023-08-20	1	-3/+3
\|
*	Rename vec-dot to vec-ops. (#449)	Laurent Mazare	2023-08-15	1	-3/+3
\| \| \| \| \| \| \|	* Rename vec-dot to vec-ops. * Also bump the crate version. * Add a currently empty readme.
*	Add the license files. (#335)	Laurent Mazare	2023-08-07	1	-1/+1
\|
*	Update the repo location. (#305)	Laurent Mazare	2023-08-02	1	-1/+1
\|
*	Add version numbers for all the candle crates (#303)	Laurent Mazare	2023-08-02	1	-2/+2
\| \| \| \| \|	* Switch to candle-gemm for the time being. * Add the missing versions.
*	Rename the candle crate to candle-core (#301)	Laurent Mazare	2023-08-02	1	-1/+1
\| \| \| \| \|	* Rename to candle-core. * More candle-core renaming.
*	Softmax numerical stability. (#267)	Laurent Mazare	2023-07-28	1	-0/+1
\| \| \| \| \|	* Softmax numerical stability. * Fix the flash-attn test.
*	Add some flash attn test (#253)	Laurent Mazare	2023-07-26	1	-0/+3
\| \| \| \| \| \| \| \| \|	* Add some flash-attn test. * Add the cpu test. * Fail when the head is not a multiple of 8. * Polish the flash attention test.
*	Again set a few extra params in flash-attn. (#245)	Laurent Mazare	2023-07-26	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Again set a few extra params. * Use the appropriate kernel sizes. * Add all the kernel sizes. * Parallel compiling. * Reduce the amount of parallelism. * Add the missing kernel. * Fix a typo. * Remove bf16 support for now.
*	Add flash attention (#241)	Laurent Mazare	2023-07-26	1	-0/+18
	* Add some flash-attn kernel, import the code for flash-attn v2 from Dao-AILab. * More flash attn. * Set up the flash attn parameters. * Get things to compile locally. * Move the flash attention files in a different directory. * Build the static C library with nvcc. * Add more flash attention. * Update the build part. * Better caching. * Exclude flash attention from the default workspace. * Put flash-attn behind a feature gate. * Get the flash attn kernel to run. * Move the flags to a more appropriate place. * Enable flash attention in llama. * Use flash attention in llama.