Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Bump the crate version to 0.3.3. (#1490) | Laurent Mazare | 2023-12-28 | 21 | -55/+55 |
| | |||||
* | Add some mention to SOLAR-10.7B in the readme. (#1487) | Laurent Mazare | 2023-12-27 | 1 | -2/+3 |
| | |||||
* | Rework the llama example config, add the solar model. (#1485) | Laurent Mazare | 2023-12-26 | 1 | -72/+36 |
| | |||||
* | Use the new hub helper function. (#1484) | Laurent Mazare | 2023-12-26 | 2 | -16/+2 |
| | |||||
* | Helper function to load sharded safetensors files (#1481) | Laurent Mazare | 2023-12-25 | 7 | -67/+40 |
| | | | | | | | * Fix the quantized mistral example. * Add a helper function to load sharded safetensors weights. * Use the sharded loader. | ||||
* | Merge pull request #1479 from huggingface/upsample_metal | Nicolas Patry | 2023-12-25 | 3 | -2/+137 |
|\ | | | | | Adding upsample_nearest_2d. | ||||
| * | Adding upsample_nearest_2d. | Nicolas Patry | 2023-12-25 | 3 | -2/+137 |
|/ | |||||
* | Merge pull request #1461 from huggingface/metal-conv | Nicolas Patry | 2023-12-25 | 5 | -86/+399 |
|\ | | | | | Adding the convolutions (1d + 2d) to candle on metal. | ||||
| * | Fixing matmul for convolutions. | Nicolas Patry | 2023-12-25 | 2 | -3/+4 |
| | | |||||
| * | Adding the convolutions (1d + 2d) to candle on metal. | Nicolas Patry | 2023-12-21 | 5 | -84/+396 |
| | | |||||
* | | Fix the quantized mistral example. (#1478) | Laurent Mazare | 2023-12-25 | 2 | -4/+14 |
| | | |||||
* | | Support mistral instruct v0.2. (#1475) | Laurent Mazare | 2023-12-23 | 2 | -7/+18 |
| | | | | | | | | | | * Support mistral instruct v0.2. * Use the safetensors model now that they are available. | ||||
* | | MMLU evaluation for Phi. (#1474) | Laurent Mazare | 2023-12-23 | 2 | -13/+105 |
| | | | | | | | | | | * MMLU evaluation for Phi. * Improve the evaluation. | ||||
* | | Validate the kernel size in pooling ops. (#1473) | Laurent Mazare | 2023-12-23 | 1 | -12/+16 |
| | | | | | | | | | | * Validate the kernel size in pooling ops. * Revert the changes to basics. | ||||
* | | Fix for mamba 2.8b. (#1472) | Laurent Mazare | 2023-12-23 | 1 | -1/+1 |
| | | |||||
* | | Support different mamba models. (#1471) | Laurent Mazare | 2023-12-23 | 2 | -7/+55 |
| | | |||||
* | | Sketch the minimal mamba example. (#1465) | Laurent Mazare | 2023-12-22 | 4 | -1/+458 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Sketch the minimal mamba example. * Fix rustfmt. * Forward pass for mamba. * Finish the forward pass. * Inference fixes. * Bugfixes. * More fixes. * Add a readme. | ||||
* | | feat: add clear_kv_cache to mistral and qmistral models (#1464) | drbh | 2023-12-21 | 2 | -0/+28 |
| | | |||||
* | | make fn name generic (#1459) | Daniel Clough | 2023-12-21 | 1 | -1/+2 |
| | | | | | | Co-authored-by: Ubuntu <danielclough@users.noreply.github.com> | ||||
* | | add fn config_chat_ml (#1458) | Daniel Clough | 2023-12-20 | 1 | -0/+19 |
|/ | | | | | | | | | | * add fn config_chat_ml * Add a link to the original config. --------- Co-authored-by: Ubuntu <danielclough@users.noreply.github.com> Co-authored-by: laurent <laurent.mazare@gmail.com> | ||||
* | Merge pull request #1318 from huggingface/metal4 | Nicolas Patry | 2023-12-20 | 25 | -776/+2775 |
|\ | | | | | Starting to fix some tests. | ||||
| * | Optimizing decode matmul (Phi at 28tok/s on M3). | Nicolas Patry | 2023-12-20 | 4 | -5/+66 |
| | | | | | | | | Adding some benchmark in order to help checking out matmul performance. | ||||
| * | Clippy pass. | Nicolas Patry | 2023-12-18 | 3 | -14/+11 |
| | | |||||
| * | Remove print. | Nicolas Patry | 2023-12-18 | 1 | -1/+0 |
| | | |||||
| * | Missing cast. | Nicolas Patry | 2023-12-18 | 2 | -0/+3 |
| | | |||||
| * | Index add. | Nicolas Patry | 2023-12-18 | 3 | -63/+151 |
| | | |||||
| * | Scatter add. | Nicolas Patry | 2023-12-18 | 3 | -17/+147 |
| | | |||||
| * | Adding gather op. | Nicolas Patry | 2023-12-17 | 3 | -17/+157 |
| | | |||||
| * | Adding CMP | Nicolas Patry | 2023-12-17 | 3 | -85/+140 |
| | | |||||
| * | Implement randn (CPU-> device) | Nicolas Patry | 2023-12-17 | 1 | -4/+3 |
| | | |||||
| * | Finish reduce kernels. | Nicolas Patry | 2023-12-17 | 6 | -39/+258 |
| | | |||||
| * | Addressing a lot of comments. | Nicolas Patry | 2023-12-15 | 4 | -20/+33 |
| | | |||||
| * | Remove `unwrap()`. | Nicolas Patry | 2023-12-15 | 2 | -48/+77 |
| | | |||||
| * | Remove test file. | Nicolas Patry | 2023-12-15 | 1 | -209/+0 |
| | | |||||
| * | Renamed all kernel names. | Nicolas Patry | 2023-12-15 | 7 | -56/+56 |
| | | |||||
| * | Better error message on older macos | Nicolas Patry | 2023-12-15 | 1 | -3/+5 |
| | | |||||
| * | More cleanup. | Nicolas Patry | 2023-12-15 | 2 | -6/+1 |
| | | |||||
| * | Revert candle-transformers. | Nicolas Patry | 2023-12-15 | 1 | -38/+4 |
| | | |||||
| * | Adding a bunch of docs ! | Nicolas Patry | 2023-12-15 | 2 | -53/+122 |
| | | | | | | | | Co-authored-by: Ivar Flakstad <69173633+ivarflakstad@users.noreply.github.com> | ||||
| * | More cleanup. | Nicolas Patry | 2023-12-15 | 1 | -15/+16 |
| | | |||||
| * | cleanup. | Nicolas Patry | 2023-12-15 | 1 | -27/+4 |
| | | |||||
| * | Fixing softmax. | Nicolas Patry | 2023-12-15 | 4 | -13/+14 |
| | | |||||
| * | Fix softmax for long sequences (missing barrier). | Nicolas Patry | 2023-12-14 | 3 | -18/+50 |
| | | |||||
| * | Fix use resource. | Nicolas Patry | 2023-12-14 | 1 | -0/+40 |
| | | |||||
| * | Working with merging encoders and using fences. | Nicolas Patry | 2023-12-14 | 5 | -94/+279 |
| | | |||||
| * | Fixing tests + matmul from MFA | Nicolas Patry | 2023-12-13 | 5 | -23/+128 |
| | | |||||
| * | Removed MPSMatrix entirely (buggy). | Nicolas Patry | 2023-12-13 | 3 | -194/+311 |
| | | |||||
| * | Better version ? | Nicolas Patry | 2023-12-13 | 2 | -25/+52 |
| | | |||||
| * | Lots of updates including some stack of command buffers. | nicolas | 2023-12-12 | 10 | -117/+537 |
| | | |||||
| * | Merge pull request #1408 from jbochi/metal_gelu2 | Nicolas Patry | 2023-12-09 | 2 | -5/+29 |
| |\ | | | | | | | Fix NaN errors for Gelu in Metal |