summaryrefslogtreecommitdiff
path: root/candle-examples/examples/efficientvit/README.md
diff options
context:
space:
mode:
Diffstat (limited to 'candle-examples/examples/efficientvit/README.md')
-rw-r--r--candle-examples/examples/efficientvit/README.md20
1 files changed, 20 insertions, 0 deletions
diff --git a/candle-examples/examples/efficientvit/README.md b/candle-examples/examples/efficientvit/README.md
new file mode 100644
index 00000000..7a989a25
--- /dev/null
+++ b/candle-examples/examples/efficientvit/README.md
@@ -0,0 +1,20 @@
+# candle-efficientvit
+
+[EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention](https://arxiv.org/abs/2305.07027).
+
+This candle implementation uses a pre-trained EfficientViT (from Microsoft Research Asia) network for inference.
+The classification head has been trained on the ImageNet dataset and returns the probabilities for the top-5 classes.
+
+## Running an example
+
+```
+$ cargo run --example efficientvit --release -- --image candle-examples/examples/yolo-v8/assets/bike.jpg --which m1
+
+loaded image Tensor[dims 3, 224, 224; f32]
+model built
+mountain bike, all-terrain bike, off-roader: 69.80%
+unicycle, monocycle : 13.03%
+bicycle-built-for-two, tandem bicycle, tandem: 9.28%
+crash helmet : 2.25%
+alp : 0.46%
+```