summaryrefslogtreecommitdiff
path: root/candle-examples/examples/mimi/README.md
diff options
context:
space:
mode:
Diffstat (limited to 'candle-examples/examples/mimi/README.md')
-rw-r--r--candle-examples/examples/mimi/README.md20
1 files changed, 20 insertions, 0 deletions
diff --git a/candle-examples/examples/mimi/README.md b/candle-examples/examples/mimi/README.md
new file mode 100644
index 00000000..bbcfcdb7
--- /dev/null
+++ b/candle-examples/examples/mimi/README.md
@@ -0,0 +1,20 @@
+# candle-mimi
+
+[Mimi](https://huggingface.co/kyutai/mimi) is a state of the art audio
+compression model using an encoder/decoder architecture with residual vector
+quantization. The candle implementation supports streaming meaning that it's
+possible to encode or decode a stream of audio tokens on the flight to provide
+low latency interaction with an audio model.
+
+## Running one example
+
+Generating some audio tokens from an audio files.
+```bash
+wget https://github.com/metavoiceio/metavoice-src/raw/main/assets/bria.mp3
+cargo run --example mimi --features mimi --release -- audio-to-code bria.mp3 bria.safetensors
+```
+
+And decoding the audio tokens back into a sound file.
+```bash
+cargo run --example mimi --features mimi --release -- code-to-audio bria.safetensors bria.wav
+```