index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-core
/
tests
Commit message (
Expand
)
Author
Age
Files
Lines
*
UG metal integration. (#2580)
Laurent Mazare
2024-10-27
1
-5
/
+11
*
Support for UG kernels. (#2579)
Laurent Mazare
2024-10-27
1
-0
/
+30
*
Testcases (#2567)
Anubhab Bandyopadhyay
2024-10-17
1
-0
/
+274
*
Efficient implementation of `Tensor::ones()` for `metal` (#2512)
Anubhab Bandyopadhyay
2024-10-01
1
-0
/
+30
*
Bugfix for the metal elu kernel. (#2490)
Laurent Mazare
2024-09-21
1
-0
/
+13
*
Add a minimal test for the metal bf16 matmul. (#2381)
Laurent Mazare
2024-08-01
1
-0
/
+20
*
Fix log_sum_exp to handle large positive/negative inputs (#2367)
Yun-Jhong Wu
2024-08-01
1
-3
/
+21
*
Fix for backprop in ConvTranspose2D with stride of 2 (#2337)
Ivor Wanders
2024-07-17
1
-0
/
+97
*
Add a slice_set op. (#2193)
Laurent Mazare
2024-05-18
1
-0
/
+25
*
Add SliceSafetensors. (#2179)
Laurent Mazare
2024-05-11
1
-0
/
+47
*
Add the cuda dequantize f16 kernels. (#2137)
Laurent Mazare
2024-04-28
1
-1
/
+120
*
Add a sort function. (#2134)
Laurent Mazare
2024-04-28
1
-0
/
+18
*
Add argsort. (#2132)
Laurent Mazare
2024-04-27
1
-0
/
+17
*
Add more QMMV cuda kernels. (#2077)
Laurent Mazare
2024-04-18
1
-7
/
+15
*
Add the mmv kernels for small batch sizes. (#2075)
Laurent Mazare
2024-04-16
1
-1
/
+35
*
Fix for the batch dim in the quantized matmul example. (#2073)
Laurent Mazare
2024-04-15
1
-36
/
+36
*
Handle zero dims in some simple operations. (#2064)
Laurent Mazare
2024-04-15
1
-0
/
+22
*
Add support for "sign" on tensors (#2012)
Thomas Santerre
2024-04-04
1
-0
/
+8
*
Fix the matmul layout for accelerate & mkl. (#2011)
Laurent Mazare
2024-04-04
2
-22
/
+4
*
Optimize the gelu f16 opt. (#2008)
Laurent Mazare
2024-04-04
1
-0
/
+8
*
Improve the handling of matmul with squeezed layouts. (#1998)
Laurent Mazare
2024-04-02
2
-100
/
+119
*
Support i64 in index-select on metal. (#1951)
Laurent Mazare
2024-03-27
1
-37
/
+42
*
More flexible matmul contiguity checks. (#1949)
Laurent Mazare
2024-03-27
1
-0
/
+25
*
Preliminary support for inplace ops. (#1921)
Laurent Mazare
2024-03-23
1
-0
/
+31
*
Backwards for ConvTranspose2D (#1910)
Kirpal Grewal
2024-03-23
1
-7
/
+154
*
Add support for conv_transpose2d on Metal backend (#1903)
Thomas Santerre
2024-03-21
1
-69
/
+55
*
Avoid copying the data on squeeze and unsqueeze. (#1884)
Laurent Mazare
2024-03-20
1
-1
/
+15
*
Add support for conv_transpose1d for metal backend (#1874)
Thomas Santerre
2024-03-19
1
-5
/
+0
*
Add avg_pool2d metal implementation for the metal backend (#1869)
Thomas Santerre
2024-03-18
1
-3
/
+0
*
Add support for max_pool2d for Metal backend (#1863)
Thomas Santerre
2024-03-18
1
-3
/
+0
*
Optimize the cat operation on contiguous tensors (#1855)
Laurent Mazare
2024-03-17
4
-59
/
+121
*
Fast CPU kernel for transposed 1d convolutions. (#1822)
Laurent Mazare
2024-03-08
1
-20
/
+24
*
Metal random-generation bug fixes (#1811)
Niklas Hallqvist
2024-03-08
1
-0
/
+25
*
Add a cuda kernel for dequantizing q8_0. (#1804)
Laurent Mazare
2024-03-05
1
-4
/
+0
*
Handle Q5_0 and Q5_1 quants in cuda.
laurent
2024-02-29
1
-8
/
+0
*
Fix the block size for some cuda kernels. (#1767)
Laurent Mazare
2024-02-27
1
-32
/
+0
*
Support for attention bias in gemma + refactor things a bit. (#1744)
Laurent Mazare
2024-02-22
1
-46
/
+28
*
Add grads for interpolate1d (#1742)
Kirpal Grewal
2024-02-22
1
-0
/
+33
*
Bugfix for conv-transpose1d (#1734)
Laurent Mazare
2024-02-19
1
-0
/
+14
*
Support for groups in conv-transpose1d. (#1731)
Laurent Mazare
2024-02-18
1
-1
/
+1
*
feat: add silu activation function (#1706)
OlivierDehaene
2024-02-14
2
-0
/
+20
*
ConvTranspose1d cuda support. (#1697)
Laurent Mazare
2024-02-12
1
-11
/
+9
*
Add support for loading Fortran contiguous tensors (#1672)
Dilshod Tadjibaev
2024-02-07
3
-0
/
+44
*
Enhance pickle to retrieve state_dict with a given key (#1671)
Dilshod Tadjibaev
2024-02-06
3
-1
/
+11
*
Fix rustfmt. (#1669)
Laurent Mazare
2024-02-06
1
-1
/
+1
*
Fix: pth files don't load on Windows (#1661)
Roma Klapaukh
2024-02-06
3
-0
/
+14
*
Quantized GGUF style (#1523)
Nicolas Patry
2024-01-17
1
-143
/
+430
*
Add the pow operator. (#1583)
Laurent Mazare
2024-01-13
1
-2
/
+14
*
Bugfix for dequantizing q5k layers. (#1569)
Laurent Mazare
2024-01-11
1
-1
/
+1
*
Simpler repro for the neon optimization issue + bugfix (#1544)
Laurent Mazare
2024-01-07
1
-16
/
+41
[next]