Commit message (Expand) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Metal: Activate bfloat affine and add benchmark (#1543) | ivarflakstad | 2024-01-12 | 1 | -7/+7 |
* | Metal: f16 and bf16 where_cond + benchmark (#1545) | ivarflakstad | 2024-01-12 | 1 | -23/+43 |
* | remove metal version check | Baye Dieng | 2024-01-11 | 1 | -2/+0 |
* | close ifdef | Baye Dieng | 2024-01-11 | 1 | -1/+1 |
* | feat(bf16): add cast support + tests for cast + bin ops (#1524) | Kyle McCarthy | 2024-01-11 | 4 | -15/+191 |
* | Use __HAVE_BFLOAT__ to check for bfloat support instead of metal version chec... | ivarflakstad | 2024-01-10 | 6 | -6/+6 |
* | Add relu kernel for metal (#1488) | Juarez Bochi | 2024-01-10 | 2 | -2/+10 |
* | Adding bfloat16 support for the cast kernels. (#1520) | Nicolas Patry | 2024-01-04 | 1 | -0/+2 |
* | Metal: support unary abs (#1503) | Gonzalo | 2023-12-30 | 2 | -1/+5 |
* | Metal: more u8/u32 (#1502) | Gonzalo | 2023-12-29 | 4 | -4/+17 |
* | Metal: i64 basic support (#1495) | Gonzalo | 2023-12-29 | 6 | -1/+48 |
* | fix bad pattern matching and function name | Baye Dieng | 2023-12-29 | 2 | -4/+4 |
* | add urecip op to metal backend | Baye Dieng | 2023-12-28 | 2 | -3/+6 |
* | Bump the crate version to 0.3.3. (#1490) | Laurent Mazare | 2023-12-28 | 1 | -1/+1 |
* | Adding upsample_nearest_2d. | Nicolas Patry | 2023-12-25 | 2 | -0/+104 |
* | Fixing matmul for convolutions. | Nicolas Patry | 2023-12-25 | 1 | -2/+2 |
* | Adding the convolutions (1d + 2d) to candle on metal. | Nicolas Patry | 2023-12-21 | 4 | -74/+260 |
* | Merge pull request #1318 from huggingface/metal4 | Nicolas Patry | 2023-12-20 | 15 | -390/+1810 |
|\ | |||||
| * | Optimizing decode matmul (Phi at 28tok/s on M3). | Nicolas Patry | 2023-12-20 | 1 | -5/+15 |
| * | Clippy pass. | Nicolas Patry | 2023-12-18 | 1 | -1/+0 |
| * | Missing cast. | Nicolas Patry | 2023-12-18 | 1 | -0/+1 |
| * | Index add. | Nicolas Patry | 2023-12-18 | 2 | -56/+109 |
| * | Scatter add. | Nicolas Patry | 2023-12-18 | 2 | -7/+97 |
| * | Adding gather op. | Nicolas Patry | 2023-12-17 | 2 | -15/+125 |
| * | Adding CMP | Nicolas Patry | 2023-12-17 | 2 | -13/+24 |
| * | Finish reduce kernels. | Nicolas Patry | 2023-12-17 | 4 | -15/+227 |
| * | Addressing a lot of comments. | Nicolas Patry | 2023-12-15 | 2 | -11/+16 |
| * | Remove test file. | Nicolas Patry | 2023-12-15 | 1 | -209/+0 |
| * | Renamed all kernel names. | Nicolas Patry | 2023-12-15 | 5 | -36/+36 |
| * | Better error message on older macos | Nicolas Patry | 2023-12-15 | 1 | -3/+5 |
| * | Adding a bunch of docs ! | Nicolas Patry | 2023-12-15 | 1 | -0/+17 |
| * | More cleanup. | Nicolas Patry | 2023-12-15 | 1 | -15/+16 |
| * | Fixing softmax. | Nicolas Patry | 2023-12-15 | 1 | -4/+7 |
| * | Fix softmax for long sequences (missing barrier). | Nicolas Patry | 2023-12-14 | 2 | -17/+49 |
| * | Fix use resource. | Nicolas Patry | 2023-12-14 | 1 | -0/+40 |
| * | Working with merging encoders and using fences. | Nicolas Patry | 2023-12-14 | 2 | -2/+247 |
| * | Fixing tests + matmul from MFA | Nicolas Patry | 2023-12-13 | 2 | -12/+108 |
| * | Removed MPSMatrix entirely (buggy). | Nicolas Patry | 2023-12-13 | 2 | -20/+286 |
| * | Lots of updates including some stack of command buffers. | nicolas | 2023-12-12 | 4 | -10/+199 |
| * | Fix gelu for large x | Juarez Bochi | 2023-12-06 | 2 | -5/+29 |
| * | Put back affine strided tests | Nicolas Patry | 2023-11-30 | 1 | -12/+15 |
| * | Starting to fix some tests. | Nicolas Patry | 2023-11-30 | 12 | -250/+470 |
* | | Bump the crate version to 0.3.2. (#1452) | Laurent Mazare | 2023-12-17 | 1 | -1/+1 |
|/ | |||||
* | Moving tests around. | Nicolas Patry | 2023-11-20 | 2 | -623/+617 |
* | Fixing cos_f16 test. | Nicolas Patry | 2023-11-20 | 1 | -2/+2 |
* | Fix comments. | Nicolas Patry | 2023-11-20 | 2 | -33/+13 |
* | Update candle-metal-kernels/Cargo.toml | Nicolas Patry | 2023-11-20 | 1 | -1/+1 |
* | Cleanup fixed a few ops removed debugging scaffolding. | Nicolas Patry | 2023-11-20 | 2 | -1/+3 |
* | Debugging rope. | Nicolas Patry | 2023-11-20 | 1 | -2/+1 |
* | Fixed matmul (display still broken without casting back to CPU first? ) | Nicolas Patry | 2023-11-20 | 1 | -1/+2 |