summaryrefslogtreecommitdiff
path: root/candle-examples/examples/yolo-v8
Commit message (Collapse)AuthorAgeFilesLines
* Clippy fixes for 1.81.0. (#2461)Laurent Mazare2024-09-051-1/+1
| | | | | * Clippy fixes for 1.81.0. * Another fix.
* onnx: fix pad, unsqueeze (#2317)shua2024-07-231-1/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * onnx: fix pad, unsqueeze both implementations have off-by-one errors: - Pad 'reflect' cycle for eg `dim==3` is `[0,1,2,1]` which has length of 4 (or `dim*2 - 2`) not 5 (current code `dim*2 - 1`) - Unsqueeze(-1) for tensor with `dim==3` should be 3 (ie `dim+index+1`) not 2 (ie currently `dim+index`) in addition, Pad is incorrectly calculating the starting padding. If we want to pad out 2 elements to the start, and we have this cycle of indices of length 6, then we should skip 4 elements, but currently we skip 2. A more visual representation of what's going on is below: ``` pad_start: 2 data: [a,b,c,d] indices: [0, 1, 2, 3, 2, 1, 0, 1, 2, 3, 2, 1, 0, ..] // zigzag between 0..4 actual: skip [ c d| c b a b] expected: ~ skip ~ [ c b| a b c d] ``` The values between `[` and `|` are padding and the values between `|` and `]` in the example should match the original data being padded. * Fix clippy lints. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>
* Support alternative dtypes for mamba (#2036)Laurent Mazare2024-04-101-0/+0
| | | | | * Allow different dtypes in mamba. * Add a dtype flag.
* Update the image crate + use the re-exported version. (#1893)Laurent Mazare2024-03-211-22/+23
| | | | | * Update the image crate + use the re-exported version. * Update to using ab_glyph.
* Fix linspace implementation (#1358)MilkFather2023-11-231-1/+1
| | | | | | | | | | | | | * Fix linspace implementation `steps` should be strictly greater than 1 to make it consistent with the context. * Handle steps == 0 and steps == 1. * Fix rustfmt. --------- Co-authored-by: laurent <laurent.mazare@gmail.com>
* Ensure to copy data to cpu before iterating. (#1360)Marcus Asteborg2023-11-231-1/+3
|
* Fix pose estimation image path (#1326)Bernardo de Lemos2023-11-121-1/+1
|
* Add fuse-conv-bn method for Conv2d (#1196)jamjamjon2023-10-271-7/+2
| | | | | | | * Add fuse-conv-bn method for Conv2d * no unwrap * run rustfmp and clippy
* Adapt more examples to the updated safetensor api. (#947)Laurent Mazare2023-09-231-3/+1
| | | | | | | | | * Simplify the safetensor usage. * Convert more examples. * Move more examples. * Adapt stable-diffusion.
* Add tracing. (#943)Laurent Mazare2023-09-232-10/+90
|
* Readme gallery (#834)Laurent Mazare2023-09-131-1/+1
| | | | | * More readme tweaks. * Update README.md
* Segment Anything readme (#827)Laurent Mazare2023-09-121-0/+0
| | | | | | | | | * Add a readme for the segment-anything model. * Add the original image. * Clean-up the segment anything cli example. * Also print the mask id in the outputs.
* Add a readme for yolo-v8. (#824)Laurent Mazare2023-09-125-0/+47
|
* Move more models to candle-transformers (#796)Laurent Mazare2023-09-101-1/+1
| | | | | | | | | * Move dinov2. * Move efficientnet. * Move the quantized llama model. * Move segment-anything.
* Automatic mask generation (#779)Laurent Mazare2023-09-081-6/+6
| | | | | | | | | * A few more contiguous fixes for cuda. * Mask generation. * Generic bbox. * Generate all the masks.
* Simplify usage of the pool functions. (#662)Laurent Mazare2023-08-291-3/+3
| | | | | | | * Simplify usage of the pool functions. * Small tweak. * Attempt at using apply to simplify the convnet definition.
* Dilated convolutions (#657)Laurent Mazare2023-08-291-0/+1
| | | | | | | | | | | | | | | | | | | * Add the dilation parameter. * Restore the basic optimizer example. * Dilation support in cudnn. * Use the dilation parameter in the cpu backend. * More dilation support. * No support for dilation in transposed convolutions. * Add dilation to a test. * Remove a print. * Helper function.
* Remove some dead-code annotations. (#629)Laurent Mazare2023-08-271-1/+0
| | | | | | | | | * Remove some dead-code annotations. * More dead code removal. * One more. * CI fix.
* Cleanup the pose reporting code. (#605)Laurent Mazare2023-08-251-65/+58
|
* Add some configurable legend for yolo detection. (#603)Laurent Mazare2023-08-252-1/+43
| | | | | * Add some configurable legend for yolo detection. * Clippyness.
* Move the yolo model bits in a separate file. (#602)Laurent Mazare2023-08-252-747/+795
| | | | | | | * Move the yolo model bits in a separate file. * Improve the drawing. * Bugfix.
* More support for pose estimation in yolo-v8. (#599)Laurent Mazare2023-08-251-15/+161
| | | | | * More support for pose estimation in yolo-v8. * Support both object detection and pose-estimation in the yolo-v8 example.
* Add the pose estimation head for yolo. (#589)Laurent Mazare2023-08-241-6/+104
| | | | | | | | | | | * Add the pose estimation head for yolo. * Properly handle the added position dimensions. * Integrate the pose estimation head in the forward pass. * Renaming. * Fix for pose estimation.
* Add some group parameter to convolutions. (#566)Laurent Mazare2023-08-231-1/+5
| | | | | | | | | | | | | * Add some group parameter to convolutions. * Avoid some unnecessary groups checks. * Move the tensor convolution bits. * Properh handling of groups. * Bump the crate version. * And add a changelog.
* Improve the aspect ratio handling on yolo-v8. (#549)Laurent Mazare2023-08-221-14/+35
| | | | | * Fix the aspect ratio handling in yolo-v8. * Typo.
* Move the yolo shared bits to a common place. (#548)Laurent Mazare2023-08-222-132/+34
| | | | | | | * Move the yolo shared bits to a common place. * Share more code. * Configurable thresholds.
* Sketch the yolo wasm example. (#546)Laurent Mazare2023-08-221-4/+0
| | | | | | | | | | | | | | | * Sketch the yolo wasm example. * Web ui. * Get the web ui to work. * UI tweaks. * More UI tweaks. * Use the natural width/height. * Add a link to the hf space in the readme.
* Use the yolo-v8 weights from the hub. (#544)Laurent Mazare2023-08-211-4/+30
| | | | | * Use the weights from the hub. * Add to the readme.
* Yolo v8 fixes (#542)Laurent Mazare2023-08-211-14/+15
| | | | | | | | | | | * Fixes for the yolo-v8 layout. * Bugfixes. * Another silly bugfix. * Remove the hf-hub dependency. * Remove the transformers dependency.
* Add yolo v8 as an example (#541)Laurent Mazare2023-08-212-0/+861
* Sketching yolo-v8. * Get the model to load. * yolo-v8 forward pass. * Complete(?) the forward pass. * Fix some shape issues. * Add the missing padding. * Process the predictions.