summaryrefslogtreecommitdiff
path: root/candle-kernels/src/unary.cu
Commit message (Expand)AuthorAgeFilesLines
* Fix sigmoid gradient calculation and move sigmoid into a specialized op (#2114)MilkFather2024-04-291-0/+9
* Add support for "sign" on tensors (#2012)Thomas Santerre2024-04-041-0/+9
* Cuda backend optimization (#1886)Laurent Mazare2024-03-201-2/+2
* Fix the silu cuda kernel. (#1710)Laurent Mazare2024-02-141-1/+1
* feat: add silu activation function (#1706)OlivierDehaene2024-02-141-0/+9
* Add the rounding operators. (#1030)Laurent Mazare2023-10-041-0/+12
* Add the missing kernel. (#955)Laurent Mazare2023-09-241-0/+1
* Add an erf based gelu op (#900)Laurent Mazare2023-09-191-0/+17
* Add tanh. (#675)Laurent Mazare2023-08-301-0/+4
* Add the powf op. (#664)Laurent Mazare2023-08-291-0/+4
* Add the recip op + use it in stable-diffusion. (#331)Laurent Mazare2023-08-061-0/+4
* Add some flash attn test (#253)Laurent Mazare2023-07-261-2/+2
* Add the elu cuda kernel. (#114)Laurent Mazare2023-07-101-0/+38
* Tweak the include order to include math.h first. (#100)Laurent Mazare2023-07-071-1/+1
* Include the math.h file to get access to constants. (#99)Laurent Mazare2023-07-071-0/+2
* Minor tweaks.laurent2023-07-031-0/+3
* Add the bf16 cuda kernels.laurent2023-06-291-0/+14
* Add the relu op.laurent2023-06-281-4/+13
* Refactor the hierarchy.Nicolas Patry2023-06-271-0/+69