summaryrefslogtreecommitdiff
path: root/candle-flash-attn/src
Commit message (Expand)AuthorAgeFilesLines
* Flash-Attn upgrade / SoftCap Candle-FlashAttn [3/n] (#2690)Michael Feil2024-12-312-4/+5
* Flash-Attn upgrade / SoftCap Candle-FlashAttn [2/n] (#2689)Michael Feil2024-12-312-0/+117
* Use flash-attn in gemma. (#2195)Laurent Mazare2024-05-181-1/+3
* chore: update flash attention kernels (#1518)OlivierDehaene2024-01-052-14/+428
* Fix for flash-attn. (#1310)Laurent Mazare2023-11-101-2/+2
* Properly set the is_bf16 flag. (#738)Laurent Mazare2023-09-041-6/+10
* BF16 support for flash-attn. (#737)Laurent Mazare2023-09-041-41/+81
* Add back the bf16 flash-attn kernels. (#730)Laurent Mazare2023-09-042-0/+3
* Relax the requirements on CustomOp. (#486)Laurent Mazare2023-08-171-2/+2
* Fix the flash-attention function names. (#282)Laurent Mazare2023-07-311-2/+2
* Flash attention without padding (varlen). (#281)Laurent Mazare2023-07-312-1/+232
* Add some flash attn test (#253)Laurent Mazare2023-07-261-10/+24
* Use bail rather than wrapping a string where possible. (#249)Laurent Mazare2023-07-261-2/+2
* Lining up the flash attn version with the non-flash one. (#248)Laurent Mazare2023-07-261-1/+18
* Again set a few extra params in flash-attn. (#245)Laurent Mazare2023-07-261-8/+8
* Proper flash-attn parameters. (#244)Laurent Mazare2023-07-262-7/+100
* Add flash attention (#241)Laurent Mazare2023-07-262-0/+94