summaryrefslogtreecommitdiff
path: root/candle-core/src/layout.rs
Commit message (Collapse)AuthorAgeFilesLines
* 20241118 docs (#2629)zachcp2024-11-191-0/+1
| | | | | | | | | | | | | | | | | * module docs * varbuilder gguf docs * add a link to gguf files * small additonal mod doc titles * safetensor docs * more core docs * more module docs in canlde_core * 2 more link fixes
* Add some fast Metal MLX SDPA kernels (#2584)Eric Buehler2024-11-051-0/+6
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Add some fast Metal MLX SDPA kernels (#32) * Sketch the sdpa kernel * Add full sdpa kernel, * Add test * Add vectorized kernel for decoding * Update tests * Add some docs * Fix sdpa_vector names * Add softcapping for vectorized sdpa * Add softcapping for full sdpa * Add support for head dim 32, 96, 256 * Add support for head dim 32, 96, 256 * Update docs * Add update notice * Clippy and format * Conditional compilation for bf16 * Use it in quantized llama * Some review comments * Use set_params! * Remove unused * Remove feature * Fix metal sdpa for v stride * Remove comma * Add the dim method to layout and shape. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>
* Expose a couple layout methods. (#1816)Laurent Mazare2024-03-081-3/+3
|
* Retrieve more information from PyTorch checkpoints. (#515)Laurent Mazare2023-08-191-0/+8
| | | | | * Retrieve more information from PyTorch checkpoints. * Add enough support to load dino-v2 backbone weights.
* Add the permute op (similar to pytorch). (#504)Laurent Mazare2023-08-181-0/+25
| | | | | * Add the permute op (similar to pytorch). * Add the backprop for dimension permutation.
* Broadcasting performance optimization (cpu) (#182)Laurent Mazare2023-07-171-0/+56
| | | | | * Avoid recomputing the index from scratch each time. * More performance optimisations.
* Iteration over strided blocks (#175)Laurent Mazare2023-07-151-1/+32
| | | | | | | * Introduce the strided blocks. * Use the strided blocks to fasten the copy. * Add more testing.
* Add backtrace information to errors where relevant. (#166)Laurent Mazare2023-07-141-8/+13
| | | | | | | * Add backtrace information to errors where relevant. * More backtrace information. * Add to the FAQ.
* Cleanup the main crate error and add a couple dedicated ones (#142)Laurent Mazare2023-07-121-7/+13
| | | | | | | | | * Cosmetic cleanups to the error enum. * More error cleanup. * Proper error handling rather than panicing. * Add some conv1d dedicated error.
* Get the cpu backend to compile.laurent2023-06-281-2/+2
|
* Propagate the changes on the cpu backend.laurent2023-06-281-0/+11
|
* Propagate the layout refactoring.laurent2023-06-281-3/+64
|
* Simplify the narrow implementation.laurent2023-06-281-1/+22
|
* Start refactoring the stride.laurent2023-06-281-0/+47