index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-metal-kernels
Commit message (
Expand
)
Author
Age
Files
Lines
*
Add missing bfloat unary strided kernels and fix typo (#2058)
ivarflakstad
2024-04-14
1
-1
/
+1
*
Support gather on bf16 for metal. (#2035)
Laurent Mazare
2024-04-10
1
-0
/
+3
*
Use BufferOffset in metal backend ops. (#2029)
Laurent Mazare
2024-04-08
2
-128
/
+78
*
Rework the buffer offset logic for metal kernels (#2028)
Laurent Mazare
2024-04-07
3
-247
/
+262
*
Optimize copy-2d for metal. (#2024)
Laurent Mazare
2024-04-07
2
-20
/
+57
*
Add the rope THD kernel. (#2014)
Laurent Mazare
2024-04-05
2
-4
/
+89
*
Add support for "sign" on tensors (#2012)
Thomas Santerre
2024-04-04
2
-1
/
+3
*
update dtypes checks for several metal operations (#2010)
Thomas Santerre
2024-04-04
2
-6
/
+20
*
Bumping the version number to 0.5.0. (#2009)
Laurent Mazare
2024-04-04
1
-1
/
+1
*
Minor cleanups in reduce.metal. (#2004)
Laurent Mazare
2024-04-04
1
-23
/
+1
*
refactor to reduce the amount of code wrapped in template syntax (#2002)
Thomas Santerre
2024-04-04
1
-261
/
+368
*
Fix for the RWKV models. (#1955)
Laurent Mazare
2024-03-28
1
-4
/
+4
*
Support i64 in index-select on metal. (#1951)
Laurent Mazare
2024-03-27
1
-1
/
+7
*
More flexible matmul contiguity checks. (#1949)
Laurent Mazare
2024-03-27
1
-4
/
+8
*
Extend supported dtypes for metal (im2col & upsample_2d) (#1938)
Thomas Santerre
2024-03-26
1
-0
/
+8
*
Contiguous variant of the rope kernel. (#1929)
Laurent Mazare
2024-03-25
2
-5
/
+73
*
Fast kernels for rotary embeddings. (#1928)
Laurent Mazare
2024-03-24
2
-0
/
+64
*
Add support for strided index-select on Metal (#1909)
Thomas Santerre
2024-03-22
3
-15
/
+119
*
Add support for conv_transpose2d on Metal backend (#1903)
Thomas Santerre
2024-03-21
2
-0
/
+144
*
RmsNorm kernel for metal. (#1895)
Laurent Mazare
2024-03-21
2
-0
/
+114
*
Add support for conv_transpose1d for metal backend (#1874)
Thomas Santerre
2024-03-19
3
-0
/
+347
*
Add avg_pool2d metal implementation for the metal backend (#1869)
Thomas Santerre
2024-03-18
3
-13
/
+194
*
Add support for max_pool2d for Metal backend (#1863)
Thomas Santerre
2024-03-18
3
-1
/
+353
*
add test for index add and add missing match statements (#1862)
Thomas Santerre
2024-03-17
2
-15
/
+139
*
add support for casting between all datatypes (#1860)
Thomas Santerre
2024-03-17
2
-96
/
+211
*
Optimize the cat operation on contiguous tensors (#1855)
Laurent Mazare
2024-03-17
3
-1
/
+78
*
Add support for index u8/i64 and input f16/bf16 scatter-add on metal (#1849)
Thomas Santerre
2024-03-17
2
-2
/
+115
*
Bump the crate versions to 0.4.2. (#1821)
Laurent Mazare
2024-03-08
1
-1
/
+1
*
Metal random-generation bug fixes (#1811)
Niklas Hallqvist
2024-03-08
2
-12
/
+24
*
Bump the version number to 0.4.1. (#1768)
Laurent Mazare
2024-02-27
1
-1
/
+1
*
feat: add silu activation function (#1706)
OlivierDehaene
2024-02-14
3
-1
/
+25
*
Bump the crate version to 0.4.0. (#1658)
Laurent Mazare
2024-02-04
1
-1
/
+1
*
Merge pull request #1606 from FL33TW00D/feature/larger-batches
Christopher Fleetwood
2024-01-29
2
-7
/
+6
|
\
|
*
chore: final
FL33TW00D
2024-01-22
2
-15
/
+10
|
*
chore: actual fix
FL33TW00D
2024-01-19
2
-2
/
+3
|
*
chore: switch to buffer
FL33TW00D
2024-01-19
2
-10
/
+14
|
*
fix: larger batches
FL33TW00D
2024-01-18
2
-7
/
+6
*
|
Merge pull request #1533 from huggingface/ivarflakstad/metal-prng
ivarflakstad
2024-01-22
3
-4
/
+402
|
\
\
|
|
/
|
/
|
|
*
Revert public EncoderParam
Ivar Flakstad
2024-01-17
1
-1
/
+1
|
*
Merge branch 'main' into ivarflakstad/metal-prng
Ivar Flakstad
2024-01-17
4
-84
/
+5300
|
|
\
|
*
|
Update metal random kernel and set_seed method
Ivar Flakstad
2024-01-17
1
-8
/
+10
|
*
|
Seed should be updated by random kernel result.
Ivar Flakstad
2024-01-15
3
-20
/
+48
|
*
|
Merge branch 'main' into ivarflakstad/metal-prng
Ivar Flakstad
2024-01-14
2
-30
/
+50
|
|
\
\
|
*
|
|
fmt
Ivar Flakstad
2024-01-12
1
-9
/
+29
|
*
|
|
Merge branch 'main' into ivarflakstad/metal-prng
Ivar Flakstad
2024-01-12
9
-24
/
+206
|
|
\
\
\
|
*
\
\
\
Merge branch 'main' into ivarflakstad/metal-prng
Ivar Flakstad
2024-01-07
6
-8
/
+77
|
|
\
\
\
\
|
*
|
|
|
|
Gaussian normal distribution of PRNG via Box-Muller transform
Ivar Flakstad
2024-01-07
3
-86
/
+178
|
*
|
|
|
|
Implement hybrid Tausworthe + LCG psuedo random number generator in metal
Ivar Flakstad
2024-01-05
3
-4
/
+264
*
|
|
|
|
|
Merge pull request #1602 from mimiquate/fix-metal-kernel-type
ivarflakstad
2024-01-18
1
-1
/
+1
|
\
\
\
\
\
\
|
|
_
|
_
|
_
|
_
|
/
|
/
|
|
|
|
|
|
*
|
|
|
|
Fixes metal kernel u8 type
Gonzalo
2024-01-17
1
-1
/
+1
|
|
|
_
|
_
|
/
|
|
/
|
|
|
[next]