index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-metal-kernels
Commit message (
Expand
)
Author
Age
Files
Lines
*
Sync upstream MLX sdpa vector kernels with mask (#2718)
HEAD
main
Eric Buehler
2025-01-16
2
-28
/
+412
*
Bump the caret version to 0.8.2. (#2703)
Laurent Mazare
2025-01-07
1
-1
/
+1
*
Bump the crate version to 0.8.1. (#2662)
Laurent Mazare
2024-12-07
1
-1
/
+1
*
add scatter add (#2656)
zachcp
2024-12-01
1
-0
/
+1
*
add u32 - U32 gather (#2653)
zachcp
2024-11-30
1
-79
/
+80
*
Lint fixes introduced with Rust 1.83 (#2646)
Anubhab Bandyopadhyay
2024-11-28
2
-17
/
+20
*
Add some missing index-select metal kernels. (#2613)
Laurent Mazare
2024-11-12
1
-0
/
+4
*
Bump the crate version to 0.8.0. (#2612)
Laurent Mazare
2024-11-12
1
-1
/
+1
*
Add some fast Metal MLX SDPA kernels (#2584)
Eric Buehler
2024-11-05
2
-1
/
+1579
*
UG metal integration. (#2580)
Laurent Mazare
2024-10-27
2
-8
/
+4
*
Tweak some metal tests. (#2528)
Laurent Mazare
2024-10-02
2
-62
/
+23
*
Efficient implementation of `Tensor::ones()` for `metal` (#2512)
Anubhab Bandyopadhyay
2024-10-01
3
-0
/
+132
*
Bump the crate version to 0.7.2. (#2517)
Laurent Mazare
2024-09-29
1
-1
/
+1
*
Move the candle version to 0.7.1. (#2495)
Laurent Mazare
2024-09-22
1
-1
/
+1
*
Bump the crate version. (#2491)
Laurent Mazare
2024-09-21
1
-1
/
+1
*
Bugfix for the metal elu kernel. (#2490)
Laurent Mazare
2024-09-21
1
-1
/
+1
*
Metal commands refactoring (#2489)
Laurent Mazare
2024-09-21
1
-5
/
+28
*
Fix for metal tanh. (#2475)
Laurent Mazare
2024-09-13
1
-3
/
+8
*
Add some metal gemm benchark. (#2471)
Laurent Mazare
2024-09-11
2
-0
/
+138
*
Integrate the MLX gemm kernels (#2468)
Laurent Mazare
2024-09-11
5
-55
/
+1874
*
Bump the version to 0.6.1. (#2438)
Laurent Mazare
2024-08-22
1
-1
/
+1
*
Revert the bf16 gemm metal changes for now. (#2386)
Laurent Mazare
2024-08-01
2
-19
/
+21
*
Metal bgemm min changes (#2364)
ivarflakstad
2024-08-01
3
-4
/
+76
*
Enable the affine kernel for u8/u32. (#2376)
Laurent Mazare
2024-08-01
1
-0
/
+2
*
Use RAII for terminating the encoding. (#2353)
Laurent Mazare
2024-07-24
2
-61
/
+69
*
Use a trait for the encoder provider (so that encoder can ultimately be reuse...
Laurent Mazare
2024-07-24
2
-120
/
+143
*
Bump the crate version. (#2248)
Laurent Mazare
2024-06-05
1
-1
/
+1
*
add where_cond f32 for metal (#2236)
Lionel Touati
2024-06-02
1
-0
/
+21
*
Add a metal kernel for col2im1d. (#2214)
Laurent Mazare
2024-05-25
2
-1
/
+97
*
Add the layernorm specialized op. (#2212)
Laurent Mazare
2024-05-24
2
-0
/
+144
*
Add some missing where-cond kernels for metal. (#2203)
Laurent Mazare
2024-05-22
1
-14
/
+17
*
Separate quantized phi-3 implementation. (#2157)
Laurent Mazare
2024-05-04
1
-1
/
+1
*
Bump the version number to 0.5.1. (#2155)
Laurent Mazare
2024-05-03
1
-1
/
+1
*
Fix sigmoid gradient calculation and move sigmoid into a specialized op (#2114)
MilkFather
2024-04-29
2
-1
/
+6
*
Add argsort. (#2132)
Laurent Mazare
2024-04-27
3
-0
/
+138
*
Metal Unary: Add benchmarks and process kernels in a tile based fashion (#2056)
Thomas Santerre
2024-04-21
2
-37
/
+97
*
Handle multiple dimensions in metal QMM + two fixes. (#2097)
Laurent Mazare
2024-04-20
1
-7
/
+8
*
Add missing bfloat unary strided kernels and fix typo (#2058)
ivarflakstad
2024-04-14
1
-1
/
+1
*
Support gather on bf16 for metal. (#2035)
Laurent Mazare
2024-04-10
1
-0
/
+3
*
Use BufferOffset in metal backend ops. (#2029)
Laurent Mazare
2024-04-08
2
-128
/
+78
*
Rework the buffer offset logic for metal kernels (#2028)
Laurent Mazare
2024-04-07
3
-247
/
+262
*
Optimize copy-2d for metal. (#2024)
Laurent Mazare
2024-04-07
2
-20
/
+57
*
Add the rope THD kernel. (#2014)
Laurent Mazare
2024-04-05
2
-4
/
+89
*
Add support for "sign" on tensors (#2012)
Thomas Santerre
2024-04-04
2
-1
/
+3
*
update dtypes checks for several metal operations (#2010)
Thomas Santerre
2024-04-04
2
-6
/
+20
*
Bumping the version number to 0.5.0. (#2009)
Laurent Mazare
2024-04-04
1
-1
/
+1
*
Minor cleanups in reduce.metal. (#2004)
Laurent Mazare
2024-04-04
1
-23
/
+1
*
refactor to reduce the amount of code wrapped in template syntax (#2002)
Thomas Santerre
2024-04-04
1
-261
/
+368
*
Fix for the RWKV models. (#1955)
Laurent Mazare
2024-03-28
1
-4
/
+4
*
Support i64 in index-select on metal. (#1951)
Laurent Mazare
2024-03-27
1
-1
/
+7
[next]