forks/candle.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	Flash-Attn upgrade / SoftCap Candle-FlashAttn [3/n] (#2690)	Michael Feil	2024-12-31	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* update flash-attn v1 * restore: hdim224 * add 224 flash_fwd_template * remove whitespace * softcap is working, including test and api. * make softcap test case better * unpadded lse added
*	Flash-Attn upgrade / SoftCap Candle-FlashAttn [2/n] (#2689)	Michael Feil	2024-12-31	1	-3/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* update flash-attn v1 * restore: hdim224 * add 224 flash_fwd_template * remove whitespace * softcap is working, including test and api. * make softcap test case better --------- Co-authored-by: laurent <laurent.mazare@gmail.com>
*	Update the flash attn kernels. (#2333)	Laurent Mazare	2024-07-15	1	-10/+10
\|
*	chore: update flash attention kernels (#1518)	OlivierDehaene	2024-01-05	1	-14/+26
\| \| \| \| \| \| \| \| \| \| \|	* chore: update flash attention kernels * fmt * remove unused kernels * force f32 * correct stride
*	Add back the bf16 flash-attn kernels. (#730)	Laurent Mazare	2023-09-04	1	-13/+13
\|
*	Flash attention without padding (varlen). (#281)	Laurent Mazare	2023-07-31	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \|	* Expose the seqlen variable for flash-attn without padding. * Fix the batched call. * Adapt for the varlen variant. * No need to set the batch strides when in varlen mode. * Add a test (disabled at the moment). * Get the test to work properly.
*	Again set a few extra params in flash-attn. (#245)	Laurent Mazare	2023-07-26	1	-0/+109
	* Again set a few extra params. * Use the appropriate kernel sizes. * Add all the kernel sizes. * Parallel compiling. * Reduce the amount of parallelism. * Add the missing kernel. * Fix a typo. * Remove bf16 support for now.