index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-metal-kernels
Commit message (
Expand
)
Author
Age
Files
Lines
...
*
More flexible matmul contiguity checks. (#1949)
Laurent Mazare
2024-03-27
1
-4
/
+8
*
Extend supported dtypes for metal (im2col & upsample_2d) (#1938)
Thomas Santerre
2024-03-26
1
-0
/
+8
*
Contiguous variant of the rope kernel. (#1929)
Laurent Mazare
2024-03-25
2
-5
/
+73
*
Fast kernels for rotary embeddings. (#1928)
Laurent Mazare
2024-03-24
2
-0
/
+64
*
Add support for strided index-select on Metal (#1909)
Thomas Santerre
2024-03-22
3
-15
/
+119
*
Add support for conv_transpose2d on Metal backend (#1903)
Thomas Santerre
2024-03-21
2
-0
/
+144
*
RmsNorm kernel for metal. (#1895)
Laurent Mazare
2024-03-21
2
-0
/
+114
*
Add support for conv_transpose1d for metal backend (#1874)
Thomas Santerre
2024-03-19
3
-0
/
+347
*
Add avg_pool2d metal implementation for the metal backend (#1869)
Thomas Santerre
2024-03-18
3
-13
/
+194
*
Add support for max_pool2d for Metal backend (#1863)
Thomas Santerre
2024-03-18
3
-1
/
+353
*
add test for index add and add missing match statements (#1862)
Thomas Santerre
2024-03-17
2
-15
/
+139
*
add support for casting between all datatypes (#1860)
Thomas Santerre
2024-03-17
2
-96
/
+211
*
Optimize the cat operation on contiguous tensors (#1855)
Laurent Mazare
2024-03-17
3
-1
/
+78
*
Add support for index u8/i64 and input f16/bf16 scatter-add on metal (#1849)
Thomas Santerre
2024-03-17
2
-2
/
+115
*
Bump the crate versions to 0.4.2. (#1821)
Laurent Mazare
2024-03-08
1
-1
/
+1
*
Metal random-generation bug fixes (#1811)
Niklas Hallqvist
2024-03-08
2
-12
/
+24
*
Bump the version number to 0.4.1. (#1768)
Laurent Mazare
2024-02-27
1
-1
/
+1
*
feat: add silu activation function (#1706)
OlivierDehaene
2024-02-14
3
-1
/
+25
*
Bump the crate version to 0.4.0. (#1658)
Laurent Mazare
2024-02-04
1
-1
/
+1
*
Merge pull request #1606 from FL33TW00D/feature/larger-batches
Christopher Fleetwood
2024-01-29
2
-7
/
+6
|
\
|
*
chore: final
FL33TW00D
2024-01-22
2
-15
/
+10
|
*
chore: actual fix
FL33TW00D
2024-01-19
2
-2
/
+3
|
*
chore: switch to buffer
FL33TW00D
2024-01-19
2
-10
/
+14
|
*
fix: larger batches
FL33TW00D
2024-01-18
2
-7
/
+6
*
|
Merge pull request #1533 from huggingface/ivarflakstad/metal-prng
ivarflakstad
2024-01-22
3
-4
/
+402
|
\
\
|
|
/
|
/
|
|
*
Revert public EncoderParam
Ivar Flakstad
2024-01-17
1
-1
/
+1
|
*
Merge branch 'main' into ivarflakstad/metal-prng
Ivar Flakstad
2024-01-17
4
-84
/
+5300
|
|
\
|
*
|
Update metal random kernel and set_seed method
Ivar Flakstad
2024-01-17
1
-8
/
+10
|
*
|
Seed should be updated by random kernel result.
Ivar Flakstad
2024-01-15
3
-20
/
+48
|
*
|
Merge branch 'main' into ivarflakstad/metal-prng
Ivar Flakstad
2024-01-14
2
-30
/
+50
|
|
\
\
|
*
|
|
fmt
Ivar Flakstad
2024-01-12
1
-9
/
+29
|
*
|
|
Merge branch 'main' into ivarflakstad/metal-prng
Ivar Flakstad
2024-01-12
9
-24
/
+206
|
|
\
\
\
|
*
\
\
\
Merge branch 'main' into ivarflakstad/metal-prng
Ivar Flakstad
2024-01-07
6
-8
/
+77
|
|
\
\
\
\
|
*
|
|
|
|
Gaussian normal distribution of PRNG via Box-Muller transform
Ivar Flakstad
2024-01-07
3
-86
/
+178
|
*
|
|
|
|
Implement hybrid Tausworthe + LCG psuedo random number generator in metal
Ivar Flakstad
2024-01-05
3
-4
/
+264
*
|
|
|
|
|
Merge pull request #1602 from mimiquate/fix-metal-kernel-type
ivarflakstad
2024-01-18
1
-1
/
+1
|
\
\
\
\
\
\
|
|
_
|
_
|
_
|
_
|
/
|
/
|
|
|
|
|
|
*
|
|
|
|
Fixes metal kernel u8 type
Gonzalo
2024-01-17
1
-1
/
+1
|
|
|
_
|
_
|
/
|
|
/
|
|
|
*
/
|
|
|
Quantized GGUF style (#1523)
Nicolas Patry
2024-01-17
4
-75
/
+5295
|
/
/
/
/
*
|
|
|
Metal: Activate bfloat affine and add benchmark (#1543)
ivarflakstad
2024-01-12
1
-7
/
+7
*
|
|
|
Metal: f16 and bf16 where_cond + benchmark (#1545)
ivarflakstad
2024-01-12
1
-23
/
+43
|
|
_
|
/
|
/
|
|
*
|
|
remove metal version check
Baye Dieng
2024-01-11
1
-2
/
+0
*
|
|
close ifdef
Baye Dieng
2024-01-11
1
-1
/
+1
*
|
|
feat(bf16): add cast support + tests for cast + bin ops (#1524)
Kyle McCarthy
2024-01-11
4
-15
/
+191
*
|
|
Use __HAVE_BFLOAT__ to check for bfloat support instead of metal version chec...
ivarflakstad
2024-01-10
6
-6
/
+6
*
|
|
Add relu kernel for metal (#1488)
Juarez Bochi
2024-01-10
2
-2
/
+10
|
|
/
|
/
|
*
|
Adding bfloat16 support for the cast kernels. (#1520)
Nicolas Patry
2024-01-04
1
-0
/
+2
*
|
Metal: support unary abs (#1503)
Gonzalo
2023-12-30
2
-1
/
+5
*
|
Metal: more u8/u32 (#1502)
Gonzalo
2023-12-29
4
-4
/
+17
*
|
Metal: i64 basic support (#1495)
Gonzalo
2023-12-29
6
-1
/
+48
*
|
fix bad pattern matching and function name
Baye Dieng
2023-12-29
2
-4
/
+4
[prev]
[next]