| Commit message (Expand) | Author | Age | Files | Lines |
* | UG metal integration. (#2580) | Laurent Mazare | 2024-10-27 | 1 | -1/+2 |
* | Support for UG kernels. (#2579) | Laurent Mazare | 2024-10-27 | 1 | -1/+3 |
* | Add a basic metal example with capture (#2324) | Laurent Mazare | 2024-07-09 | 1 | -0/+4 |
* | feat(bf16): add cast support + tests for cast + bin ops (#1524) | Kyle McCarthy | 2024-01-11 | 1 | -1/+0 |
* | Seperate benchmarks by enabled features (#1538) | ivarflakstad | 2024-01-11 | 1 | -1/+1 |
* | Simplifying our internal cargo dependencies. (#1529) | Nicolas Patry | 2024-01-07 | 1 | -2/+2 |
* | Bump the crate version to 0.3.3. (#1490) | Laurent Mazare | 2023-12-28 | 1 | -2/+2 |
* | Merge pull request #1318 from huggingface/metal4 | Nicolas Patry | 2023-12-20 | 1 | -0/+7 |
|\ |
|
| * | Optimizing decode matmul (Phi at 28tok/s on M3). | Nicolas Patry | 2023-12-20 | 1 | -0/+7 |
* | | Bump the crate version to 0.3.2. (#1452) | Laurent Mazare | 2023-12-17 | 1 | -2/+2 |
|/ |
|
* | Fix comments. | Nicolas Patry | 2023-11-20 | 1 | -1/+1 |
* | Adding the actual backend | Nicolas Patry | 2023-11-20 | 1 | -1/+2 |
* | Update for 0.3.1. (#1324) | Laurent Mazare | 2023-11-11 | 1 | -1/+1 |
* | Metal part 1 - Scaffolding for metal. (#1308) | Nicolas Patry | 2023-11-10 | 1 | -0/+2 |
* | Bump the version to 0.3.0. (#1014) | Laurent Mazare | 2023-10-01 | 1 | -1/+1 |
* | Use yoke to provide a self-referential container for mmaped safetenso… (#939) | Laurent Mazare | 2023-09-23 | 1 | -0/+1 |
* | Bump the crate versions to v0.2.3. (#886) | Laurent Mazare | 2023-09-18 | 1 | -1/+1 |
* | Bump the crate version + update the changelog. (#822) | Laurent Mazare | 2023-09-12 | 1 | -1/+1 |
* | Add some documentation. (#673) | Laurent Mazare | 2023-08-30 | 1 | -1/+1 |
* | Bump the crate version + update CHANGELOG. (#628) | Laurent Mazare | 2023-08-27 | 1 | -1/+1 |
* | Add some group parameter to convolutions. (#566) | Laurent Mazare | 2023-08-23 | 1 | -1/+1 |
* | Bump the crates version to 0.1.2. (#522) | Laurent Mazare | 2023-08-20 | 1 | -1/+1 |
* | Rename vec-dot to vec-ops. (#449) | Laurent Mazare | 2023-08-15 | 1 | -1/+1 |
* | Simd support (#448) | Laurent Mazare | 2023-08-15 | 1 | -1/+0 |
* | Cudnn support (#445) | Laurent Mazare | 2023-08-14 | 1 | -1/+2 |
* | Parallelise the CPU kernels for the conv ops. (#401) | Laurent Mazare | 2023-08-11 | 1 | -0/+1 |
* | Small example for benchmarking some cpu ops (#394) | Laurent Mazare | 2023-08-10 | 1 | -0/+1 |
* | Conv1d optimize (#392) | Laurent Mazare | 2023-08-10 | 1 | -0/+1 |
* | Fix randn cpu (#382) | Lei | 2023-08-10 | 1 | -0/+1 |
* | Support the Accelerate BLAS on macOS. (#325) | Laurent Mazare | 2023-08-05 | 1 | -0/+2 |
* | Update the repo location. (#305) | Laurent Mazare | 2023-08-02 | 1 | -8/+7 |
* | Add version numbers for all the candle crates (#303) | Laurent Mazare | 2023-08-02 | 1 | -1/+1 |
* | Rename the candle crate to candle-core (#301) | Laurent Mazare | 2023-08-02 | 1 | -1/+1 |
* | Centralize the dependency versions and inherit them. (#177) | Laurent Mazare | 2023-07-16 | 1 | -18/+14 |
* | Removing cuda default. | Nicolas Patry | 2023-07-14 | 1 | -1/+1 |
* | Random initializers. (#128) | Laurent Mazare | 2023-07-10 | 1 | -0/+1 |
* | Remove the dependency to blas and use mkl directly. (#125) | Laurent Mazare | 2023-07-10 | 1 | -2/+2 |
* | Sketch the candle-nn crate. (#115) | Laurent Mazare | 2023-07-10 | 1 | -1/+2 |
* | Use cublas bf16. (#101) | Laurent Mazare | 2023-07-07 | 1 | -1/+2 |
* | Add mkl support for matrix multiply. (#86) | Laurent Mazare | 2023-07-06 | 1 | -0/+3 |
* | Move llama in a cargo-examples directory. | laurent | 2023-07-03 | 1 | -5/+0 |
* | Use the patched gemm for the time being. | laurent | 2023-07-03 | 1 | -1/+3 |
* | Move more safetensors bits to the shared module. | laurent | 2023-07-03 | 1 | -7/+7 |
* | Add backtraces. | laurent | 2023-06-29 | 1 | -1/+1 |
* | Tmp. | Ubuntu | 2023-06-28 | 1 | -0/+3 |
* | Use num-cpus to enable parallelism. | laurent | 2023-06-27 | 1 | -0/+1 |
* | Refactor the hierarchy. | Nicolas Patry | 2023-06-27 | 1 | -0/+32 |