index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-core
Commit message (
Expand
)
Author
Age
Files
Lines
*
Really unique identifier for metal device ids. (#1932)
Laurent Mazare
2024-03-25
2
-9
/
+37
*
Preliminary support for inplace ops. (#1921)
Laurent Mazare
2024-03-23
5
-2
/
+215
*
Backwards for ConvTranspose2D (#1910)
Kirpal Grewal
2024-03-23
2
-10
/
+189
*
Add support for strided index-select on Metal (#1909)
Thomas Santerre
2024-03-22
1
-8
/
+10
*
Add the alloc_uninit function. (#1901)
Laurent Mazare
2024-03-22
9
-16
/
+154
*
Add support for conv_transpose2d on Metal backend (#1903)
Thomas Santerre
2024-03-21
5
-76
/
+177
*
Async tensor copying. (#1900)
Laurent Mazare
2024-03-21
7
-4
/
+59
*
Prepare for the custom-op extension. (#1892)
Laurent Mazare
2024-03-21
5
-247
/
+256
*
Cuda backend optimization (#1886)
Laurent Mazare
2024-03-20
1
-12
/
+47
*
Minor cleanup. (#1885)
Laurent Mazare
2024-03-20
1
-4
/
+0
*
Avoid copying the data on squeeze and unsqueeze. (#1884)
Laurent Mazare
2024-03-20
2
-3
/
+42
*
Add support for conv_transpose1d for metal backend (#1874)
Thomas Santerre
2024-03-19
2
-10
/
+47
*
Add avg_pool2d metal implementation for the metal backend (#1869)
Thomas Santerre
2024-03-18
2
-7
/
+42
*
Add support for max_pool2d for Metal backend (#1863)
Thomas Santerre
2024-03-18
2
-6
/
+41
*
add test for index add and add missing match statements (#1862)
Thomas Santerre
2024-03-17
1
-1
/
+21
*
add support for casting between all datatypes (#1860)
Thomas Santerre
2024-03-17
1
-7
/
+20
*
Optimize the cat operation on contiguous tensors (#1855)
Laurent Mazare
2024-03-17
14
-206
/
+618
*
Add support for index u8/i64 and input f16/bf16 scatter-add on metal (#1849)
Thomas Santerre
2024-03-17
1
-0
/
+8
*
Implement the error trait for DTypeParseError. (#1852)
Laurent Mazare
2024-03-15
1
-2
/
+10
*
Properly handle the batch dimension in cuda quantized matmul. (#1832)
Laurent Mazare
2024-03-10
1
-1
/
+1
*
Fix dequantization. (#1823)
Laurent Mazare
2024-03-08
1
-1
/
+1
*
Fast CPU kernel for transposed 1d convolutions. (#1822)
Laurent Mazare
2024-03-08
2
-21
/
+99
*
Metal random-generation bug fixes (#1811)
Niklas Hallqvist
2024-03-08
2
-1
/
+26
*
Expose more printer options. (#1817)
Laurent Mazare
2024-03-08
1
-5
/
+30
*
Expose a couple layout methods. (#1816)
Laurent Mazare
2024-03-08
1
-3
/
+3
*
Improve metal buffer usage (#1807)
ivarflakstad
2024-03-07
2
-86
/
+137
*
Add a cuda kernel for dequantizing q8_0. (#1804)
Laurent Mazare
2024-03-05
1
-4
/
+0
*
Tweaks to run metavoice on metal (#1792)
Laurent Mazare
2024-03-03
2
-0
/
+6
*
Handle Q5_0 and Q5_1 quants in cuda.
laurent
2024-02-29
2
-24
/
+38
*
Fix the block size for some cuda kernels. (#1767)
Laurent Mazare
2024-02-27
2
-45
/
+15
*
Cuda kernel for dequantizing q8k. (#1760)
Laurent Mazare
2024-02-26
2
-22
/
+20
*
Cuda acceleration for quantized model. (#1754)
Laurent Mazare
2024-02-25
8
-69
/
+458
*
Support for attention bias in gemma + refactor things a bit. (#1744)
Laurent Mazare
2024-02-22
1
-46
/
+28
*
Add grads for interpolate1d (#1742)
Kirpal Grewal
2024-02-22
4
-6
/
+51
*
Add a couple backtraces on cpu errors. (#1738)
Laurent Mazare
2024-02-20
1
-3
/
+3
*
Bugfix for conv-transpose1d (#1734)
Laurent Mazare
2024-02-19
2
-0
/
+15
*
Support for groups in conv-transpose1d. (#1731)
Laurent Mazare
2024-02-18
3
-19
/
+43
*
Fix float unpickling. (#1730)
Laurent Mazare
2024-02-18
1
-2
/
+5
*
Module implementation for options. (#1728)
Laurent Mazare
2024-02-18
1
-0
/
+9
*
feat: add silu activation function (#1706)
OlivierDehaene
2024-02-14
8
-0
/
+169
*
Qmetal tweaks (#1704)
Laurent Mazare
2024-02-13
3
-100
/
+141
*
Fixing quantized llama demo on metal. (#1703)
Nicolas Patry
2024-02-13
3
-0
/
+19
*
Detach the tensors on batch-norm eval. (#1702)
Laurent Mazare
2024-02-13
3
-4
/
+8
*
ConvTranspose1d cuda support. (#1697)
Laurent Mazare
2024-02-12
2
-16
/
+66
*
Support defaultdict in PyTorch checkpoints. (#1696)
Laurent Mazare
2024-02-12
1
-2
/
+4
*
Pickle support: dig within the _rebuild_parameter calls. (#1681)
Laurent Mazare
2024-02-08
1
-0
/
+7
*
Add support for loading Fortran contiguous tensors (#1672)
Dilshod Tadjibaev
2024-02-07
4
-3
/
+61
*
Enhance pickle to retrieve state_dict with a given key (#1671)
Dilshod Tadjibaev
2024-02-06
5
-8
/
+60
*
Fix rustfmt. (#1669)
Laurent Mazare
2024-02-06
1
-1
/
+1
*
Fix clippy lints. (#1667)
Laurent Mazare
2024-02-06
1
-4
/
+5
[next]