summaryrefslogtreecommitdiff
path: root/candle-wasm-examples/whisper
Commit message (Collapse)AuthorAgeFilesLines
* Clippy fixes. (#2415)Laurent Mazare2024-08-142-8/+6
| | | | | * Clippy fixes. * Bump the web_sys required version.
* Remove the deprecated wav crate in favor of hound. (#2202)Laurent Mazare2024-05-212-9/+12
|
* Bump the version number to 0.5.1. (#2155)Laurent Mazare2024-05-031-21/+0
| | | | | | | * Bump the version number to 0.5.1. * Fix clippy lints for 1.78. * More clippy fixes.
* Quantized GGUF style (#1523)Nicolas Patry2024-01-171-0/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Metal quantized modifications proposal. - Add a device param, wherever needed. - Create new QMetal storage thing that implements QuantizedType. - Update everywhere needed. Fix Python. Fixing examples. Fix: fmt + clippy + stub. Moving everything around. Only missing the actual implems. Fixing everything + adding dequantized kernels. More work. Fixing matmul. Fmt + Clippy Some clippy fixes. Working state. Q2K Metal -> Bugged (also present in GGML). Q4K CPU -> Bugged (present previously, new test catch it). Q5K CPU -> Bugged (present previously). Q8_1 Both -> Never really implemented it seems Q8K metal -> Never implemented in metal Fixing Q2K bug (present in ggml). * Cleanup. * Fix the rebase. * Removing the fences speeds everything up and *is* correct this time... * Cleanup the fence. * After rebase. * Bad code removal. * Rebase after phi2 merge + fix replit default to CPU. * Making the CI happy. * More happy tests. --------- Co-authored-by: Nicolas Patry <nicolas@Nicolass-MacBook-Pro.local>
* Update gloo requirement from 0.8 to 0.11 (#1558)dependabot[bot]2024-01-101-1/+1
| | | | | | | | | | | | | | | Updates the requirements on [gloo](https://github.com/rustwasm/gloo) to permit the latest version. - [Release notes](https://github.com/rustwasm/gloo/releases) - [Changelog](https://github.com/rustwasm/gloo/blob/master/CHANGELOG.md) - [Commits](https://github.com/rustwasm/gloo/commits) --- updated-dependencies: - dependency-name: gloo dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Simplifying our internal cargo dependencies. (#1529)Nicolas Patry2024-01-071-3/+3
|
* Fix lints for clippy 1.75. (#1494)Laurent Mazare2023-12-281-9/+7
|
* Bump the crate version to 0.3.3. (#1490)Laurent Mazare2023-12-281-3/+3
|
* Bump the crate version to 0.3.2. (#1452)Laurent Mazare2023-12-171-3/+3
|
* Fix a couple typos (#1451)Laurent Mazare2023-12-172-2/+2
| | | | | * Mixtral quantized instruct. * Fix a couple typos.
* Use the whisper-v3 tokenizer now that it has been added. (#1337)Laurent Mazare2023-11-161-1/+7
| | | | | * Use the whisper-v3 tokenizer now that it has been added. * Use the appropriate nospeech token.
* fix: address clippy 0.1.74 issues (#1336)drbh2023-11-161-2/+1
| | | | - clippy::needless-borrows-for-generic-args - clippy::reserve-after-initialization
* Update for 0.3.1. (#1324)Laurent Mazare2023-11-111-3/+3
|
* Preliminary support for whisper v3. (#1294)Laurent Mazare2023-11-082-3/+5
| | | | | * Preliminary support for whisper v3. * Add the missing files.
* add distil-whisper link (#1261)Radamés Ajna2023-11-031-35/+49
|
* Remove some unusued bits. (#1067)Laurent Mazare2023-10-091-1/+0
|
* Whisper quantized wasm (#1028)Radamés Ajna2023-10-0412-596/+539
| | | | | | | | | | | | | * [Whisper] Update to use quantized model * [whisper] add language detection * [whisper] change assets location * [whisper] adapt js example with quantized models * [whisper] better task parsing * [whisper] minor fixes
* Bump the version to 0.3.0. (#1014)Laurent Mazare2023-10-011-2/+2
| | | | | * Bump the version to 0.3.0. * Changelog update.
* Pass directly the buffer ownership. (#949)Laurent Mazare2023-09-241-2/+1
|
* Bump the crate versions to v0.2.3. (#886)Laurent Mazare2023-09-181-2/+2
| | | | | * Bump the crate version. * Also update the python bindings.
* minor UI fixes (#856)Radamés Ajna2023-09-151-3/+6
| | | | | | | * fixes * remove listener * remove event listener
* Bump the crate version + update the changelog. (#822)Laurent Mazare2023-09-121-2/+2
|
* force model cache (#751)Radamés Ajna2023-09-061-9/+10
|
* Minor WASM UI improvements (#748)Radamés Ajna2023-09-051-6/+11
| | | | | | | * add stats * random seed btn * minor ui improvoments
* Remove unnecessary file. (#710)Laurent Mazare2023-09-011-0/+0
|
* Add a repeat penalty to the llama2.c wasm example. (#709)Laurent Mazare2023-09-011-0/+0
|
* Add the kv-cache to the whisper wasm version. (#689)Laurent Mazare2023-08-313-40/+95
| | | | | * Add the kv-cache to the whisper wasm version. * Improve the handling of special tokens.
* Improve Whisper WASM UI example (#669)Radamés Ajna2023-08-307-3/+487
| | | | | | | | | | | * wip add module and js worker example * params * clean up, send error * final UI with whisper webworker * add simple instructions
* Add some documentation. (#673)Laurent Mazare2023-08-301-2/+2
| | | | | * Add some documentation. * Bump the crate version.
* Dilated convolutions (#657)Laurent Mazare2023-08-291-0/+2
| | | | | | | | | | | | | | | | | | | * Add the dilation parameter. * Restore the basic optimizer example. * Dilation support in cudnn. * Use the dilation parameter in the cpu backend. * More dilation support. * No support for dilation in transposed convolutions. * Add dilation to a test. * Remove a print. * Helper function.
* Remove some dead-code annotations. (#629)Laurent Mazare2023-08-273-39/+0
| | | | | | | | | * Remove some dead-code annotations. * More dead code removal. * One more. * CI fix.
* Bump the crate version + update CHANGELOG. (#628)Laurent Mazare2023-08-271-2/+2
|
* Add some group parameter to convolutions. (#566)Laurent Mazare2023-08-232-2/+4
| | | | | | | | | | | | | * Add some group parameter to convolutions. * Avoid some unnecessary groups checks. * Move the tensor convolution bits. * Properh handling of groups. * Bump the crate version. * And add a changelog.
* Bump the crates version to 0.1.2. (#522)Laurent Mazare2023-08-201-2/+2
|
* Add a simple Module trait and implement it for the various nn layers (#500)Laurent Mazare2023-08-181-1/+1
| | | | | | | * Start adding the module trait. * Use the module trait. * Implement module for qmatmul.
* Rename vec-dot to vec-ops. (#449)Laurent Mazare2023-08-151-2/+2
| | | | | | | * Rename vec-dot to vec-ops. * Also bump the crate version. * Add a currently empty readme.
* Add a cuda kernel for upsampling. (#441)Laurent Mazare2023-08-141-4/+1
| | | | | * Add a cuda kernel for upsampling. * Update for the latest tokenizers version.
* Update the repo location. (#305)Laurent Mazare2023-08-021-9/+7
|
* Add version numbers for all the candle crates (#303)Laurent Mazare2023-08-021-2/+2
| | | | | * Switch to candle-gemm for the time being. * Add the missing versions.
* Rename the candle crate to candle-core (#301)Laurent Mazare2023-08-021-1/+1
| | | | | * Rename to candle-core. * More candle-core renaming.
* Softmax numerical stability. (#267)Laurent Mazare2023-07-282-8/+5
| | | | | * Softmax numerical stability. * Fix the flash-attn test.
* Removing inner dependency on safetensors.Nicolas Patry2023-07-271-1/+1
|
* TP sharding v2Nicolas Patry2023-07-272-2/+3
|
* Micro-cleanup. (#256)Laurent Mazare2023-07-271-4/+2
|
* Simple QOL.Nicolas Patry2023-07-261-2/+10
| | | | | | | - Add ms/token on llama2.c (15ms/token on my personal machine) - Hide `Run` buttons while models are not ready - Add dummy `progress` while weights are downloading (I briefly looked at putting a real progressbar.. and nothing easy enough came up.)
* Re-organize the wasm examples (#231)Laurent Mazare2023-07-2410-0/+1342
* Move the whisper example. * More renaming. * Add llama2 as a new wasm example. * Live generation. * More of the llama wasm example. * Formatting.