summaryrefslogtreecommitdiff
path: root/candle-metal-kernels
Commit message (Expand)AuthorAgeFilesLines
* Add missing bfloat unary strided kernels and fix typo (#2058)ivarflakstad2024-04-141-1/+1
* Support gather on bf16 for metal. (#2035)Laurent Mazare2024-04-101-0/+3
* Use BufferOffset in metal backend ops. (#2029)Laurent Mazare2024-04-082-128/+78
* Rework the buffer offset logic for metal kernels (#2028)Laurent Mazare2024-04-073-247/+262
* Optimize copy-2d for metal. (#2024)Laurent Mazare2024-04-072-20/+57
* Add the rope THD kernel. (#2014)Laurent Mazare2024-04-052-4/+89
* Add support for "sign" on tensors (#2012)Thomas Santerre2024-04-042-1/+3
* update dtypes checks for several metal operations (#2010)Thomas Santerre2024-04-042-6/+20
* Bumping the version number to 0.5.0. (#2009)Laurent Mazare2024-04-041-1/+1
* Minor cleanups in reduce.metal. (#2004)Laurent Mazare2024-04-041-23/+1
* refactor to reduce the amount of code wrapped in template syntax (#2002)Thomas Santerre2024-04-041-261/+368
* Fix for the RWKV models. (#1955)Laurent Mazare2024-03-281-4/+4
* Support i64 in index-select on metal. (#1951)Laurent Mazare2024-03-271-1/+7
* More flexible matmul contiguity checks. (#1949)Laurent Mazare2024-03-271-4/+8
* Extend supported dtypes for metal (im2col & upsample_2d) (#1938)Thomas Santerre2024-03-261-0/+8
* Contiguous variant of the rope kernel. (#1929)Laurent Mazare2024-03-252-5/+73
* Fast kernels for rotary embeddings. (#1928)Laurent Mazare2024-03-242-0/+64
* Add support for strided index-select on Metal (#1909)Thomas Santerre2024-03-223-15/+119
* Add support for conv_transpose2d on Metal backend (#1903)Thomas Santerre2024-03-212-0/+144
* RmsNorm kernel for metal. (#1895)Laurent Mazare2024-03-212-0/+114
* Add support for conv_transpose1d for metal backend (#1874)Thomas Santerre2024-03-193-0/+347
* Add avg_pool2d metal implementation for the metal backend (#1869)Thomas Santerre2024-03-183-13/+194
* Add support for max_pool2d for Metal backend (#1863)Thomas Santerre2024-03-183-1/+353
* add test for index add and add missing match statements (#1862)Thomas Santerre2024-03-172-15/+139
* add support for casting between all datatypes (#1860)Thomas Santerre2024-03-172-96/+211
* Optimize the cat operation on contiguous tensors (#1855)Laurent Mazare2024-03-173-1/+78
* Add support for index u8/i64 and input f16/bf16 scatter-add on metal (#1849)Thomas Santerre2024-03-172-2/+115
* Bump the crate versions to 0.4.2. (#1821)Laurent Mazare2024-03-081-1/+1
* Metal random-generation bug fixes (#1811)Niklas Hallqvist2024-03-082-12/+24
* Bump the version number to 0.4.1. (#1768)Laurent Mazare2024-02-271-1/+1
* feat: add silu activation function (#1706)OlivierDehaene2024-02-143-1/+25
* Bump the crate version to 0.4.0. (#1658)Laurent Mazare2024-02-041-1/+1
* Merge pull request #1606 from FL33TW00D/feature/larger-batchesChristopher Fleetwood2024-01-292-7/+6
|\
| * chore: finalFL33TW00D2024-01-222-15/+10
| * chore: actual fixFL33TW00D2024-01-192-2/+3
| * chore: switch to bufferFL33TW00D2024-01-192-10/+14
| * fix: larger batchesFL33TW00D2024-01-182-7/+6
* | Merge pull request #1533 from huggingface/ivarflakstad/metal-prngivarflakstad2024-01-223-4/+402
|\ \ | |/ |/|
| * Revert public EncoderParamIvar Flakstad2024-01-171-1/+1
| * Merge branch 'main' into ivarflakstad/metal-prngIvar Flakstad2024-01-174-84/+5300
| |\
| * | Update metal random kernel and set_seed methodIvar Flakstad2024-01-171-8/+10
| * | Seed should be updated by random kernel result.Ivar Flakstad2024-01-153-20/+48
| * | Merge branch 'main' into ivarflakstad/metal-prngIvar Flakstad2024-01-142-30/+50
| |\ \
| * | | fmtIvar Flakstad2024-01-121-9/+29
| * | | Merge branch 'main' into ivarflakstad/metal-prngIvar Flakstad2024-01-129-24/+206
| |\ \ \
| * \ \ \ Merge branch 'main' into ivarflakstad/metal-prngIvar Flakstad2024-01-076-8/+77
| |\ \ \ \
| * | | | | Gaussian normal distribution of PRNG via Box-Muller transformIvar Flakstad2024-01-073-86/+178
| * | | | | Implement hybrid Tausworthe + LCG psuedo random number generator in metalIvar Flakstad2024-01-053-4/+264
* | | | | | Merge pull request #1602 from mimiquate/fix-metal-kernel-typeivarflakstad2024-01-181-1/+1
|\ \ \ \ \ \ | |_|_|_|_|/ |/| | | | |
| * | | | | Fixes metal kernel u8 typeGonzalo2024-01-171-1/+1
| | |_|_|/ | |/| | |