summaryrefslogtreecommitdiff
path: root/candle-transformers/src/models/whisper/quantized_model.rs
Commit message (Collapse)AuthorAgeFilesLines
* feat: support microphone whisper streaming (#1678)drbh2024-02-121-0/+28
| | | | | | | | | | | | | | | * feat: support microphone whisper streaming * fix: cleanup print stmts and adjust how input is read * fix: remove incorrect comment * feat: split into new example and simplify * fix: feature flag example file * fix: fmt fixes * feat: simplify and remove redundant files
* feat: support multithread spectrogram and small perf tweaks (#1674)drbh2024-02-081-4/+4
| | | | | | | | | | | * feat: support multithread spectrogram and small perf tweaks * feat: clippy improvement for loop variable * fix: add back speed up scale down logic * fix: readd mirroring logic * feat: prefer scoped thread and simplify/improve logic/traits
* Make the whisper model cloneable (#1200)Laurent Mazare2023-10-271-0/+5
| | | | | | | * Add a quantized variant of llama2.c * Clippy fixes. * Make the whisper model cloneable.
* Move the common quantized-nn code to a shared module. (#1063)Laurent Mazare2023-10-091-42/+6
|
* Better control on the optional dequantization in QMatMul (#1049)Laurent Mazare2023-10-071-6/+5
| | | | | | | * Cosmetic change to the quantized whisper model. * Fix the dequantization. * Add the dequantize all variable.
* Add a quantized variant of whisper (#1017)Laurent Mazare2023-10-021-0/+403
* Add the quantized-whisper model. * Quantized the whisper model. * Adapt the whisper example to handle quantization. * Add the quantized flag. * Load the proper weights.