| Commit message (Expand) | Author | Age | Files | Lines |
* | feat: parse Cuda compute cap from env (#1066) | OlivierDehaene | 2023-10-16 | 1 | -36/+52 |
* | Bump the version to 0.3.0. (#1014) | Laurent Mazare | 2023-10-01 | 1 | -3/+3 |
* | Bump the crate versions to v0.2.3. (#886) | Laurent Mazare | 2023-09-18 | 1 | -3/+3 |
* | Bump the crate version + update the changelog. (#822) | Laurent Mazare | 2023-09-12 | 1 | -3/+3 |
* | Shape with holes (#770) | Laurent Mazare | 2023-09-08 | 1 | -3/+6 |
* | Add small customization to the build (#768) | Zsombor | 2023-09-08 | 1 | -4/+20 |
* | Properly set the is_bf16 flag. (#738) | Laurent Mazare | 2023-09-04 | 1 | -6/+10 |
* | BF16 support for flash-attn. (#737) | Laurent Mazare | 2023-09-04 | 1 | -41/+81 |
* | Add back the bf16 flash-attn kernels. (#730) | Laurent Mazare | 2023-09-04 | 4 | -22/+25 |
* | Add some documentation. (#673) | Laurent Mazare | 2023-08-30 | 1 | -3/+3 |
* | Bump the crate version + update CHANGELOG. (#628) | Laurent Mazare | 2023-08-27 | 1 | -3/+3 |
* | Add some group parameter to convolutions. (#566) | Laurent Mazare | 2023-08-23 | 1 | -3/+3 |
* | Bump the crates version to 0.1.2. (#522) | Laurent Mazare | 2023-08-20 | 1 | -3/+3 |
* | Relax the requirements on CustomOp. (#486) | Laurent Mazare | 2023-08-17 | 1 | -2/+2 |
* | add c++17 flags (#452) | Chengxu Yang | 2023-08-15 | 1 | -0/+1 |
* | Rename vec-dot to vec-ops. (#449) | Laurent Mazare | 2023-08-15 | 1 | -3/+3 |
* | Add the license files. (#335) | Laurent Mazare | 2023-08-07 | 1 | -1/+1 |
* | Update the repo location. (#305) | Laurent Mazare | 2023-08-02 | 1 | -1/+1 |
* | Add some missing readme files. (#304) | Laurent Mazare | 2023-08-02 | 1 | -0/+1 |
* | Add version numbers for all the candle crates (#303) | Laurent Mazare | 2023-08-02 | 1 | -2/+2 |
* | Rename the candle crate to candle-core (#301) | Laurent Mazare | 2023-08-02 | 1 | -1/+1 |
* | Fix the flash-attention function names. (#282) | Laurent Mazare | 2023-07-31 | 1 | -2/+2 |
* | Flash attention without padding (varlen). (#281) | Laurent Mazare | 2023-07-31 | 4 | -4/+283 |
* | Softmax numerical stability. (#267) | Laurent Mazare | 2023-07-28 | 2 | -1/+2 |
* | Add some flash attn test (#253) | Laurent Mazare | 2023-07-26 | 4 | -12/+123 |
* | Use bail rather than wrapping a string where possible. (#249) | Laurent Mazare | 2023-07-26 | 1 | -2/+2 |
* | Lining up the flash attn version with the non-flash one. (#248) | Laurent Mazare | 2023-07-26 | 1 | -1/+18 |
* | Again set a few extra params in flash-attn. (#245) | Laurent Mazare | 2023-07-26 | 20 | -115/+471 |
* | Proper flash-attn parameters. (#244) | Laurent Mazare | 2023-07-26 | 3 | -8/+122 |
* | Specific cache dir for the flash attn build artifacts. (#242) | Laurent Mazare | 2023-07-26 | 1 | -10/+10 |
* | Add flash attention (#241) | Laurent Mazare | 2023-07-26 | 15 | -0/+2655 |