index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-core
/
src
/
metal_backend
Commit message (
Expand
)
Author
Age
Files
Lines
*
add scatter add (#2656)
zachcp
2024-12-01
1
-0
/
+1
*
add u32 - U32 gather (#2653)
zachcp
2024-11-30
1
-0
/
+1
*
20241118 docs (#2629)
zachcp
2024-11-19
1
-0
/
+2
*
Add some missing index-select metal kernels. (#2613)
Laurent Mazare
2024-11-12
1
-1
/
+10
*
UG metal integration. (#2580)
Laurent Mazare
2024-10-27
1
-0
/
+22
*
Switch to using the MLX matmul by default. (#2547)
Laurent Mazare
2024-10-06
1
-3
/
+3
*
Efficient implementation of `Tensor::ones()` for `metal` (#2512)
Anubhab Bandyopadhyay
2024-10-01
1
-4
/
+32
*
Metal commands refactoring (#2489)
Laurent Mazare
2024-09-21
2
-99
/
+113
*
Add a couple cast metal kernels. (#2479)
Laurent Mazare
2024-09-15
1
-8
/
+31
*
Missing metal kernels. (#2474)
Laurent Mazare
2024-09-12
1
-0
/
+2
*
Hook the MLX matmul kernels in candle-core. (#2473)
Laurent Mazare
2024-09-12
2
-0
/
+38
*
Use the new MLX kernels to handle the BF16 matmul. (#2470)
Laurent Mazare
2024-09-11
1
-24
/
+44
*
Enable BF16 on metal. (#2380)
Laurent Mazare
2024-08-01
1
-0
/
+1
*
Enable the affine kernel for u8/u32. (#2376)
Laurent Mazare
2024-08-01
1
-0
/
+2
*
Add a basic metal example with capture (#2324)
Laurent Mazare
2024-07-09
1
-1
/
+7
*
Fix a bug in the metal implemtation of col2im1d. (#2284)
Laurent Mazare
2024-06-22
1
-1
/
+6
*
add where_cond f32 for metal (#2236)
Lionel Touati
2024-06-02
1
-0
/
+1
*
Add a metal kernel for col2im1d. (#2214)
Laurent Mazare
2024-05-25
1
-34
/
+92
*
Use write rather than try-write on the metal rw-locks. (#2162)
Laurent Mazare
2024-05-05
2
-7
/
+13
*
Separate quantized phi-3 implementation. (#2157)
Laurent Mazare
2024-05-04
1
-3
/
+0
*
Add argsort. (#2132)
Laurent Mazare
2024-04-27
1
-1
/
+1
*
Add StorageRef. (#2113)
Laurent Mazare
2024-04-23
1
-1
/
+14
*
Metal Unary: Add benchmarks and process kernels in a tile based fashion (#2056)
Thomas Santerre
2024-04-21
1
-147
/
+232
*
Fix for the batch dim in the quantized matmul example. (#2073)
Laurent Mazare
2024-04-15
1
-1
/
+1
*
Add missing bfloat unary strided kernels and fix typo (#2058)
ivarflakstad
2024-04-14
1
-0
/
+20
*
Add a synchronize method to devices. (#2055)
Laurent Mazare
2024-04-14
1
-0
/
+4
*
Support gather on bf16 for metal. (#2035)
Laurent Mazare
2024-04-10
1
-0
/
+1
*
Use BufferOffset in metal backend ops. (#2029)
Laurent Mazare
2024-04-08
1
-50
/
+39
*
Rework the buffer offset logic for metal kernels (#2028)
Laurent Mazare
2024-04-07
1
-39
/
+43
*
Add support for "sign" on tensors (#2012)
Thomas Santerre
2024-04-04
1
-0
/
+4
*
update dtypes checks for several metal operations (#2010)
Thomas Santerre
2024-04-04
1
-27
/
+45
*
Backend refactoring. (#1966)
Laurent Mazare
2024-03-29
2
-0
/
+2071