summaryrefslogtreecommitdiff
path: root/Cargo.toml
Commit message (Collapse)AuthorAgeFilesLines
...
* Debugging rope.Nicolas Patry2023-11-201-2/+1
|
* Fixed matmul (display still broken without casting back to CPU first? )Nicolas Patry2023-11-201-1/+2
|
* Adding the actual backendNicolas Patry2023-11-201-0/+1
|
* Update for 0.3.1. (#1324)Laurent Mazare2023-11-111-1/+1
|
* Metal part 1 - Scaffolding for metal. (#1308)Nicolas Patry2023-11-101-0/+1
| | | | | * Metal part 1 - Scaffolding for metal. * Remove tracing.
* Add support to UL2 model family (#1300)Juarez Bochi2023-11-091-0/+1
| | | | | | | | | | | | | | | | | | | | | | | | | * Add support to UL2 model family * Update docs with UL2 * Create ActivationWithOptionalGating to avoid polluting activations * Also refactor quantized t5 * Remove useless conversion * Revert Activation::NewGelu name change * Remove useless return * Apply rustfmt and clippy recommendations * Reuse t5::ActivationWithOptionalGating in quantized version * (cosmetic change) use a match rather than ifs + avoid early returns. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>
* Put the onnx example behind a feature flag. (#1276)Laurent Mazare2023-11-061-2/+5
| | | | | | | * Put the onnx example behind a feature flag. * Exclude the onnx bits from the workspace. * README tweaks.
* Add some preliminary ONNX support (#1260)Laurent Mazare2023-11-041-0/+1
| | | | | | | | | | | | | | | | | * Add the onnx protos. * Move the reading bits. * Install protoc on the CI. * Install protoc on the cuda CI too. * Use clap for the onnx tool. * Tweak the CI protoc install. * Add some simple evalution function. * Add some binary operator support.
* [Wasm] BLIP Example (#1183)Radamés Ajna2023-10-261-7/+1
| | | | | | | | | | | * blip wasm start * fix dependency issue, move token stream here * vanilla js worker * roll back vscode * spell
* Another gemm update. (#1088)Laurent Mazare2023-10-141-1/+1
|
* Update to gemm 0.16.1 (#1083)Laurent Mazare2023-10-131-1/+1
|
* Update to gemm-0.16. (#1082)Laurent Mazare2023-10-121-2/+1
| | | | | * Update to gemm-0.16. * Enable wasm-simd128.
* Bump the version to 0.3.0. (#1014)Laurent Mazare2023-10-011-1/+1
| | | | | * Bump the version to 0.3.0. * Changelog update.
* Simd128 vec-dot for q4_0. (#974)Laurent Mazare2023-09-271-1/+2
| | | | | | | | | | | * Simd128 vec-dot for q4_0. * Bugfix. * Add wasm tests. * Bugfix for the q40 vecdot. * More quantization tests.
* Phi 1.5 wasm module (#966)Radamés Ajna2023-09-271-0/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | * add phi wasm module * replace input with textarea * trim input prompt * stop on <|endoftext|> * formatting * clean up * add blurb, and syntax highlighting * add phi-v1.5 wasm * add note * hide Options on details * add first token to generated text * whitespaces for new line * fix: abort -> aborted
* Use yoke to provide a self-referential container for mmaped safetenso… (#939)Laurent Mazare2023-09-231-2/+3
| | | | | | | | | * Use yoke to provide a self-referential container for mmaped safetensor files. * Add the new self-owned type for safetensor files without removing the previous version. * Add routing. * Add an initializer for the case of multiple files.
* T5 Wasm (#918)Radamés Ajna2023-09-221-0/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * init t5 wasm model * split workers for each model * clean up * add some ui * readme * index * typo * remove cache param, clear_kv_cache * add max_length as param * add model tasks option to ui * add method to load quantized gguf from buffer * Add quantized wasm module * add quantized models to UI, dynamic import wasms * link to quantized * fix copy * fix ModelEncoder * fix README.md
* BERT Wasm (#902)Radamés Ajna2023-09-191-4/+2
| | | | | | | | | | | | | * implement wasm module * add example to workspace * add UI explore semantic similiarity * change status messages * formatting * minor changes
* Bump the crate versions to v0.2.3. (#886)Laurent Mazare2023-09-181-1/+1
| | | | | * Bump the crate version. * Also update the python bindings.
* Update the candle-gemm version. (#885)Laurent Mazare2023-09-181-1/+1
|
* Bump the crate version + update the changelog. (#822)Laurent Mazare2023-09-121-1/+1
|
* Add a wasm module for the segment anything example. (#797)Laurent Mazare2023-09-101-0/+1
|
* Add some documentation. (#673)Laurent Mazare2023-08-301-1/+1
| | | | | * Add some documentation. * Bump the crate version.
* Upgrading hf-hub.Nicolas Patry2023-08-291-1/+1
|
* Cleanup:Nicolas Patry2023-08-281-0/+2
| | | | | - Moved around book from `examples` to `candle-book` proper (overlapping the book and the lib structures)
* Bump the crate version + update CHANGELOG. (#628)Laurent Mazare2023-08-271-1/+1
|
* Add some configurable legend for yolo detection. (#603)Laurent Mazare2023-08-251-0/+1
| | | | | * Add some configurable legend for yolo detection. * Clippyness.
* Move the yolo model bits in a separate file. (#602)Laurent Mazare2023-08-251-0/+1
| | | | | | | * Move the yolo model bits in a separate file. * Improve the drawing. * Bugfix.
* Use resolver 2 explicitely. (#597)Laurent Mazare2023-08-251-0/+1
|
* Add some group parameter to convolutions. (#566)Laurent Mazare2023-08-231-1/+1
| | | | | | | | | | | | | * Add some group parameter to convolutions. * Avoid some unnecessary groups checks. * Move the tensor convolution bits. * Properh handling of groups. * Bump the crate version. * And add a changelog.
* Sketch the yolo wasm example. (#546)Laurent Mazare2023-08-221-0/+1
| | | | | | | | | | | | | | | * Sketch the yolo wasm example. * Web ui. * Get the web ui to work. * UI tweaks. * More UI tweaks. * Use the natural width/height. * Add a link to the hf space in the readme.
* Bump the crates version to 0.1.2. (#522)Laurent Mazare2023-08-201-1/+1
|
* Rename vec-dot to vec-ops. (#449)Laurent Mazare2023-08-151-1/+1
| | | | | | | * Rename vec-dot to vec-ops. * Also bump the crate version. * Add a currently empty readme.
* Simd support (#448)Laurent Mazare2023-08-151-2/+1
| | | | | | | | | * Import the simd intrinsics in candle-core. * simd version of reduce-sum. * Bugfix. * Fix some clippy lints.
* Add a cuda kernel for upsampling. (#441)Laurent Mazare2023-08-141-1/+1
| | | | | * Add a cuda kernel for upsampling. * Update for the latest tokenizers version.
* Parallelise the CPU kernels for the conv ops. (#401)Laurent Mazare2023-08-111-0/+1
| | | | | | | | | * Parallelise the conv2d op. * Tighter control on threading. * Also parallelise conv1d. * Add some safety comment.
* Working now.Nicolas Patry2023-08-101-1/+1
|
* [DO NOT MERGE] temporary PR so users can try out on older GPUs.Nicolas Patry2023-08-101-1/+1
|
* Conv1d optimize (#392)Laurent Mazare2023-08-101-0/+1
| | | | | | | | | * Reorder the conv1d loops in the cpu backend. * Optimize the 1d convolution. * Conv1D optimize. * Fix some clippy lints.
* Fix randn cpu (#382)Lei2023-08-101-0/+1
| | | | | | | | | | | * Change distributions Standard generates in [0, 1), Normal is correct. * Add test Not sure if this is the best place to put the test * Remove unnecessary use
* Update gemm to 0.15.6. (#378)Laurent Mazare2023-08-091-1/+1
|
* Write the generated images using the image crate. (#363)Laurent Mazare2023-08-091-0/+1
| | | | | * Use the image crate to write the generated images. * Make the dependency optional.
* Add the license files. (#335)Laurent Mazare2023-08-071-1/+1
|
* Support the Accelerate BLAS on macOS. (#325)Laurent Mazare2023-08-051-0/+1
| | | | | * Add the accelerate feature. * Ffi tweaks.
* Add the candle-datasets crate (#322)Laurent Mazare2023-08-051-0/+1
| | | | | | | | | | | | | * Move the vision datasets to a separate crate. * Move the batcher bits. * Update the readme. * Move the tiny-stories bits. --------- Co-authored-by: Jane Doe <jane.doe@example.org>
* Update the repo location. (#305)Laurent Mazare2023-08-021-0/+9
|
* Add version numbers for all the candle crates (#303)Laurent Mazare2023-08-021-4/+2
| | | | | * Switch to candle-gemm for the time being. * Add the missing versions.
* Update cudarc now that it includes the cublas-f16 and nccl changes. (#300)Laurent Mazare2023-08-021-3/+1
|
* Fixing nccl feature.Nicolas Patry2023-07-281-1/+1
|
* Merge pull request #216 from LaurentMazare/llama_multiprocess2Nicolas Patry2023-07-281-2/+4
|\ | | | | TP sharding v2