summaryrefslogtreecommitdiff
path: root/candle-metal-kernels/src/reduce.metal
Commit message (Expand)AuthorAgeFilesLines
* Add the layernorm specialized op. (#2212)Laurent Mazare2024-05-241-0/+81
* Add the rope THD kernel. (#2014)Laurent Mazare2024-04-051-4/+44
* update dtypes checks for several metal operations (#2010)Thomas Santerre2024-04-041-0/+4
* Minor cleanups in reduce.metal. (#2004)Laurent Mazare2024-04-041-23/+1
* refactor to reduce the amount of code wrapped in template syntax (#2002)Thomas Santerre2024-04-041-261/+368
* Contiguous variant of the rope kernel. (#1929)Laurent Mazare2024-03-251-5/+30
* Fast kernels for rotary embeddings. (#1928)Laurent Mazare2024-03-241-0/+23
* RmsNorm kernel for metal. (#1895)Laurent Mazare2024-03-211-0/+56
* Use __HAVE_BFLOAT__ to check for bfloat support instead of metal version chec...ivarflakstad2024-01-101-1/+1
* Metal: more u8/u32 (#1502)Gonzalo2023-12-291-0/+5
* Metal: i64 basic support (#1495)Gonzalo2023-12-291-0/+9
* Finish reduce kernels.Nicolas Patry2023-12-171-10/+153
* Renamed all kernel names.Nicolas Patry2023-12-151-6/+6
* Fixing softmax.Nicolas Patry2023-12-151-4/+7
* Fix softmax for long sequences (missing barrier).Nicolas Patry2023-12-141-6/+9
* Lots of updates including some stack of command buffers.nicolas2023-12-121-1/+1
* Starting to fix some tests.Nicolas Patry2023-11-301-76/+80
* Adding indexing.Nicolas Patry2023-11-201-39/+54
* Adding the actual backendNicolas Patry2023-11-201-0/+124