summaryrefslogtreecommitdiff
path: root/candle-core
Commit message (Expand)AuthorAgeFilesLines
* Fix lints for clippy 1.75. (#1494)Laurent Mazare2023-12-281-17/+17
* Bump the crate version to 0.3.3. (#1490)Laurent Mazare2023-12-281-2/+2
* Adding upsample_nearest_2d.Nicolas Patry2023-12-251-2/+33
* Merge pull request #1461 from huggingface/metal-convNicolas Patry2023-12-251-10/+137
|\
| * Fixing matmul for convolutions.Nicolas Patry2023-12-251-1/+2
| * Adding the convolutions (1d + 2d) to candle on metal.Nicolas Patry2023-12-211-10/+136
* | Fix the quantized mistral example. (#1478)Laurent Mazare2023-12-251-1/+1
* | Validate the kernel size in pooling ops. (#1473)Laurent Mazare2023-12-231-12/+16
* | Sketch the minimal mamba example. (#1465)Laurent Mazare2023-12-221-1/+0
|/
* Merge pull request #1318 from huggingface/metal4Nicolas Patry2023-12-205-385/+917
|\
| * Optimizing decode matmul (Phi at 28tok/s on M3).Nicolas Patry2023-12-202-0/+50
| * Clippy pass.Nicolas Patry2023-12-181-10/+8
| * Remove print.Nicolas Patry2023-12-181-1/+0
| * Missing cast.Nicolas Patry2023-12-181-0/+2
| * Index add.Nicolas Patry2023-12-181-7/+42
| * Scatter add.Nicolas Patry2023-12-181-10/+50
| * Adding gather op.Nicolas Patry2023-12-171-2/+32
| * Adding CMPNicolas Patry2023-12-171-72/+116
| * Implement randn (CPU-> device)Nicolas Patry2023-12-171-4/+3
| * Finish reduce kernels.Nicolas Patry2023-12-172-24/+31
| * Addressing a lot of comments.Nicolas Patry2023-12-151-8/+15
| * Remove `unwrap()`.Nicolas Patry2023-12-151-46/+75
| * Renamed all kernel names.Nicolas Patry2023-12-151-17/+17
| * More cleanup.Nicolas Patry2023-12-152-6/+1
| * Adding a bunch of docs !Nicolas Patry2023-12-151-53/+105
| * cleanup.Nicolas Patry2023-12-151-27/+4
| * Fixing softmax.Nicolas Patry2023-12-151-4/+6
| * Fix softmax for long sequences (missing barrier).Nicolas Patry2023-12-141-1/+1
| * Working with merging encoders and using fences.Nicolas Patry2023-12-142-90/+32
| * Fixing tests + matmul from MFANicolas Patry2023-12-131-9/+19
| * Removed MPSMatrix entirely (buggy).Nicolas Patry2023-12-131-174/+25
| * Better version ?Nicolas Patry2023-12-131-21/+47
| * Lots of updates including some stack of command buffers.nicolas2023-12-122-101/+290
| * Starting to fix some tests.Nicolas Patry2023-11-301-227/+475
* | Add a simple full method. (#1455)Laurent Mazare2023-12-172-0/+19
* | Bump the crate version to 0.3.2. (#1452)Laurent Mazare2023-12-171-2/+2
* | Fix a couple typos (#1451)Laurent Mazare2023-12-174-6/+6
* | Fix the logsumexp test. (#1426)Laurent Mazare2023-12-121-11/+9
* | Add logsumexp function (#1424)Wenqing Zong2023-12-122-1/+33
* | Upsample grad (#1420)KGrewal12023-12-104-6/+192
|/
* Merge pull request #1323 from huggingface/metal3Nicolas Patry2023-11-277-54/+127
|\
| * Allowing feature metal to compile.Nicolas Patry2023-11-201-6/+5
| * Missing new test.Nicolas Patry2023-11-201-0/+1
| * Adding the test scaffolding.Nicolas Patry2023-11-206-48/+121
* | Add a basic implementation for slice-assign. (#1377)Laurent Mazare2023-11-262-0/+87
* | Small tweak: remove the macro usage for the range indexing trait. (#1376)Laurent Mazare2023-11-261-28/+22
* | Implement the module trait directly for QMatMul. (#1372)Laurent Mazare2023-11-254-15/+8
|/
* Fmt.Nicolas Patry2023-11-201-4/+11
* Fix comments.Nicolas Patry2023-11-202-70/+52
* Cleanup fixed a few ops removed debugging scaffolding.Nicolas Patry2023-11-202-44/+24