index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-kernels
Commit message (
Expand
)
Author
Age
Files
Lines
*
Bump the version number to 0.4.1. (#1768)
Laurent Mazare
2024-02-27
1
-1
/
+1
*
Cuda kernel for dequantizing q8k. (#1760)
Laurent Mazare
2024-02-26
1
-0
/
+35
*
Cuda acceleration for quantized model. (#1754)
Laurent Mazare
2024-02-25
2
-0
/
+1537
*
Fix the silu cuda kernel. (#1710)
Laurent Mazare
2024-02-14
1
-1
/
+1
*
feat: add silu activation function (#1706)
OlivierDehaene
2024-02-14
1
-0
/
+9
*
ConvTranspose1d cuda support. (#1697)
Laurent Mazare
2024-02-12
1
-2
/
+77
*
Bump the crate version to 0.4.0. (#1658)
Laurent Mazare
2024-02-04
1
-1
/
+1
*
Moving to a proper build crate `bindgen_cuda`. (#1531)
Nicolas Patry
2024-01-07
2
-242
/
+5
*
Bump the crate version to 0.3.3. (#1490)
Laurent Mazare
2023-12-28
1
-1
/
+1
*
Bump the crate version to 0.3.2. (#1452)
Laurent Mazare
2023-12-17
1
-1
/
+1
*
Update for 0.3.1. (#1324)
Laurent Mazare
2023-11-11
1
-2
/
+2
*
Rework the cuda casting bits. (#1112)
Laurent Mazare
2023-10-17
1
-31
/
+54
*
feat: parse Cuda compute cap from env (#1066)
OlivierDehaene
2023-10-16
2
-89
/
+110
*
fix: fix index_select cuda kernel for src target dim different than ids dim w...
Gonzalo
2023-10-05
1
-6
/
+8
*
Add the rounding operators. (#1030)
Laurent Mazare
2023-10-04
2
-0
/
+24
*
Bump the version to 0.3.0. (#1014)
Laurent Mazare
2023-10-01
1
-1
/
+1
*
fix: add missing gpu fill_* (#996)
Gonzalo
2023-09-29
1
-0
/
+9
*
Optimize the index-select cuda kernel. (#976)
Laurent Mazare
2023-09-28
1
-14
/
+8
*
Add the missing kernel. (#955)
Laurent Mazare
2023-09-24
1
-0
/
+1
*
cuda cast i64 (#925)
Gonzalo
2023-09-21
1
-0
/
+10
*
Add an erf based gelu op (#900)
Laurent Mazare
2023-09-19
2
-0
/
+25
*
Bump the crate versions to v0.2.3. (#886)
Laurent Mazare
2023-09-18
1
-1
/
+1
*
Add `CANDLE_NVCC_CCBIN` support for `candle-kernels`, and eliminate warning. ...
Charles Lew
2023-09-13
1
-2
/
+9
*
Bump the crate version + update the changelog. (#822)
Laurent Mazare
2023-09-12
1
-1
/
+1
*
im2col version of the conv1d kernel. (#815)
Laurent Mazare
2023-09-11
1
-1
/
+70
*
im2col based conv2d (#802)
Laurent Mazare
2023-09-10
1
-0
/
+89
*
Add a dedicated cuda kernel for softmax. (#746)
Laurent Mazare
2023-09-05
1
-0
/
+55
*
Add tanh. (#675)
Laurent Mazare
2023-08-30
1
-0
/
+4
*
Add some documentation. (#673)
Laurent Mazare
2023-08-30
1
-1
/
+1
*
Support dilation in conv-transpose2d. (#671)
Laurent Mazare
2023-08-30
1
-3
/
+3
*
Add the powf op. (#664)
Laurent Mazare
2023-08-29
1
-0
/
+4
*
Fix the dilated convolutions. (#659)
Laurent Mazare
2023-08-29
1
-2
/
+2
*
Dilated convolutions (#657)
Laurent Mazare
2023-08-29
1
-6
/
+12
*
Cuda conv transpose (#645)
Laurent Mazare
2023-08-28
1
-0
/
+88
*
Bump the crate version + update CHANGELOG. (#628)
Laurent Mazare
2023-08-27
1
-1
/
+1
*
Let's keep the dirty code on its own.
Nicolas Patry
2023-08-25
1
-2
/
+25
*
Intermediary float cast is necessary for cuda 11.8
Nicolas Patry
2023-08-25
1
-2
/
+2
*
`static_cast` ?
Nicolas Patry
2023-08-25
1
-2
/
+2
*
Different casting ?
Nicolas Patry
2023-08-25
1
-2
/
+2
*
Repairing cast bf16/f16
Nicolas Patry
2023-08-25
1
-4
/
+4
*
Add to the cuda example a reproduction of the issue. (#579)
Laurent Mazare
2023-08-24
1
-10
/
+11
*
Add some group parameter to convolutions. (#566)
Laurent Mazare
2023-08-23
1
-1
/
+1
*
Add support for i64 (#563)
Laurent Mazare
2023-08-23
6
-1
/
+65
*
Add a yolo-v3 example. (#528)
Laurent Mazare
2023-08-20
1
-0
/
+12
*
Bump the crates version to 0.1.2. (#522)
Laurent Mazare
2023-08-20
1
-1
/
+1
*
Rename vec-dot to vec-ops. (#449)
Laurent Mazare
2023-08-15
1
-1
/
+1
*
Add a cuda kernel for upsampling. (#441)
Laurent Mazare
2023-08-14
1
-0
/
+62
*
Add a cuda kernel for avg-pool2d. (#440)
Laurent Mazare
2023-08-14
1
-3
/
+157
*
Add a naive conv2d cuda kernel. (#438)
Laurent Mazare
2023-08-14
1
-8
/
+93
*
Compat windows.
Nicolas Patry
2023-08-10
1
-0
/
+9
[next]