summaryrefslogtreecommitdiff
path: root/candle-core/src/metal_backend
Commit message (Expand)AuthorAgeFilesLines
* add scatter add (#2656)zachcp2024-12-011-0/+1
* add u32 - U32 gather (#2653)zachcp2024-11-301-0/+1
* 20241118 docs (#2629)zachcp2024-11-191-0/+2
* Add some missing index-select metal kernels. (#2613)Laurent Mazare2024-11-121-1/+10
* UG metal integration. (#2580)Laurent Mazare2024-10-271-0/+22
* Switch to using the MLX matmul by default. (#2547)Laurent Mazare2024-10-061-3/+3
* Efficient implementation of `Tensor::ones()` for `metal` (#2512)Anubhab Bandyopadhyay2024-10-011-4/+32
* Metal commands refactoring (#2489)Laurent Mazare2024-09-212-99/+113
* Add a couple cast metal kernels. (#2479)Laurent Mazare2024-09-151-8/+31
* Missing metal kernels. (#2474)Laurent Mazare2024-09-121-0/+2
* Hook the MLX matmul kernels in candle-core. (#2473)Laurent Mazare2024-09-122-0/+38
* Use the new MLX kernels to handle the BF16 matmul. (#2470)Laurent Mazare2024-09-111-24/+44
* Enable BF16 on metal. (#2380)Laurent Mazare2024-08-011-0/+1
* Enable the affine kernel for u8/u32. (#2376)Laurent Mazare2024-08-011-0/+2
* Add a basic metal example with capture (#2324)Laurent Mazare2024-07-091-1/+7
* Fix a bug in the metal implemtation of col2im1d. (#2284)Laurent Mazare2024-06-221-1/+6
* add where_cond f32 for metal (#2236)Lionel Touati2024-06-021-0/+1
* Add a metal kernel for col2im1d. (#2214)Laurent Mazare2024-05-251-34/+92
* Use write rather than try-write on the metal rw-locks. (#2162)Laurent Mazare2024-05-052-7/+13
* Separate quantized phi-3 implementation. (#2157)Laurent Mazare2024-05-041-3/+0
* Add argsort. (#2132)Laurent Mazare2024-04-271-1/+1
* Add StorageRef. (#2113)Laurent Mazare2024-04-231-1/+14
* Metal Unary: Add benchmarks and process kernels in a tile based fashion (#2056)Thomas Santerre2024-04-211-147/+232
* Fix for the batch dim in the quantized matmul example. (#2073)Laurent Mazare2024-04-151-1/+1
* Add missing bfloat unary strided kernels and fix typo (#2058)ivarflakstad2024-04-141-0/+20
* Add a synchronize method to devices. (#2055)Laurent Mazare2024-04-141-0/+4
* Support gather on bf16 for metal. (#2035)Laurent Mazare2024-04-101-0/+1
* Use BufferOffset in metal backend ops. (#2029)Laurent Mazare2024-04-081-50/+39
* Rework the buffer offset logic for metal kernels (#2028)Laurent Mazare2024-04-071-39/+43
* Add support for "sign" on tensors (#2012)Thomas Santerre2024-04-041-0/+4
* update dtypes checks for several metal operations (#2010)Thomas Santerre2024-04-041-27/+45
* Backend refactoring. (#1966)Laurent Mazare2024-03-292-0/+2071