index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-metal-kernels
/
src
/
lib.rs
Commit message (
Expand
)
Author
Age
Files
Lines
*
Sync upstream MLX sdpa vector kernels with mask (#2718)
HEAD
main
Eric Buehler
2025-01-16
1
-1
/
+187
*
Lint fixes introduced with Rust 1.83 (#2646)
Anubhab Bandyopadhyay
2024-11-28
1
-10
/
+10
*
Add some fast Metal MLX SDPA kernels (#2584)
Eric Buehler
2024-11-05
1
-1
/
+322
*
UG metal integration. (#2580)
Laurent Mazare
2024-10-27
1
-1
/
+1
*
Tweak some metal tests. (#2528)
Laurent Mazare
2024-10-02
1
-5
/
+0
*
Efficient implementation of `Tensor::ones()` for `metal` (#2512)
Anubhab Bandyopadhyay
2024-10-01
1
-0
/
+28
*
Integrate the MLX gemm kernels (#2468)
Laurent Mazare
2024-09-11
1
-16
/
+195
*
Metal bgemm min changes (#2364)
ivarflakstad
2024-08-01
1
-0
/
+2
*
Use RAII for terminating the encoding. (#2353)
Laurent Mazare
2024-07-24
1
-49
/
+41
*
Use a trait for the encoder provider (so that encoder can ultimately be reuse...
Laurent Mazare
2024-07-24
1
-120
/
+120
*
Add a metal kernel for col2im1d. (#2214)
Laurent Mazare
2024-05-25
1
-0
/
+33
*
Add the layernorm specialized op. (#2212)
Laurent Mazare
2024-05-24
1
-0
/
+63
*
Separate quantized phi-3 implementation. (#2157)
Laurent Mazare
2024-05-04
1
-1
/
+1
*
Fix sigmoid gradient calculation and move sigmoid into a specialized op (#2114)
MilkFather
2024-04-29
1
-1
/
+1
*
Add argsort. (#2132)
Laurent Mazare
2024-04-27
1
-0
/
+40
*
Metal Unary: Add benchmarks and process kernels in a tile based fashion (#2056)
Thomas Santerre
2024-04-21
1
-35
/
+82
*
Handle multiple dimensions in metal QMM + two fixes. (#2097)
Laurent Mazare
2024-04-20
1
-7
/
+8
*
Use BufferOffset in metal backend ops. (#2029)
Laurent Mazare
2024-04-08
1
-104
/
+51
*
Rework the buffer offset logic for metal kernels (#2028)
Laurent Mazare
2024-04-07
1
-223
/
+66
*
Optimize copy-2d for metal. (#2024)
Laurent Mazare
2024-04-07
1
-8
/
+49
*
Add the rope THD kernel. (#2014)
Laurent Mazare
2024-04-05
1
-0
/
+45
*
Add support for "sign" on tensors (#2012)
Thomas Santerre
2024-04-04
1
-1
/
+1
*
Fix for the RWKV models. (#1955)
Laurent Mazare
2024-03-28
1
-4
/
+4
*
More flexible matmul contiguity checks. (#1949)
Laurent Mazare
2024-03-27
1
-4
/
+8
*
Contiguous variant of the rope kernel. (#1929)
Laurent Mazare
2024-03-25
1
-0
/
+43
*
Fast kernels for rotary embeddings. (#1928)
Laurent Mazare
2024-03-24
1
-0
/
+41
*
Add support for strided index-select on Metal (#1909)
Thomas Santerre
2024-03-22
1
-2
/
+10
*
Add support for conv_transpose2d on Metal backend (#1903)
Thomas Santerre
2024-03-21
1
-0
/
+58
*
RmsNorm kernel for metal. (#1895)
Laurent Mazare
2024-03-21
1
-0
/
+58
*
Add support for conv_transpose1d for metal backend (#1874)
Thomas Santerre
2024-03-19
1
-0
/
+53
*
Add avg_pool2d metal implementation for the metal backend (#1869)
Thomas Santerre
2024-03-18
1
-1
/
+1
*
Add support for max_pool2d for Metal backend (#1863)
Thomas Santerre
2024-03-18
1
-0
/
+33
*
Optimize the cat operation on contiguous tensors (#1855)
Laurent Mazare
2024-03-17
1
-0
/
+50
*
Metal random-generation bug fixes (#1811)
Niklas Hallqvist
2024-03-08
1
-4
/
+8
*
feat: add silu activation function (#1706)
OlivierDehaene
2024-02-14
1
-1
/
+1
*
Merge pull request #1606 from FL33TW00D/feature/larger-batches
Christopher Fleetwood
2024-01-29
1
-7
/
+6
|
\
|
*
chore: final
FL33TW00D
2024-01-22
1
-15
/
+10
|
*
chore: actual fix
FL33TW00D
2024-01-19
1
-2
/
+3
|
*
chore: switch to buffer
FL33TW00D
2024-01-19
1
-10
/
+14
|
*
fix: larger batches
FL33TW00D
2024-01-18
1
-7
/
+6
*
|
Revert public EncoderParam
Ivar Flakstad
2024-01-17
1
-1
/
+1
*
|
Merge branch 'main' into ivarflakstad/metal-prng
Ivar Flakstad
2024-01-17
1
-59
/
+180
|
\
|
|
*
Quantized GGUF style (#1523)
Nicolas Patry
2024-01-17
1
-52
/
+176
*
|
Seed should be updated by random kernel result.
Ivar Flakstad
2024-01-15
1
-4
/
+8
*
|
Merge branch 'main' into ivarflakstad/metal-prng
Ivar Flakstad
2024-01-12
1
-2
/
+2
|
\
|
|
*
Add relu kernel for metal (#1488)
Juarez Bochi
2024-01-10
1
-2
/
+2
*
|
Merge branch 'main' into ivarflakstad/metal-prng
Ivar Flakstad
2024-01-07
1
-1
/
+12
|
\
|
|
*
Metal: support unary abs (#1503)
Gonzalo
2023-12-30
1
-1
/
+4
|
*
Metal: more u8/u32 (#1502)
Gonzalo
2023-12-29
1
-0
/
+4
|
*
Metal: i64 basic support (#1495)
Gonzalo
2023-12-29
1
-0
/
+4
[next]