summaryrefslogtreecommitdiff
path: root/candle-examples/examples/reinforcement-learning/ddpg.rs
Commit message (Collapse)AuthorAgeFilesLines
* Remove some unused macros. (#2618)Laurent Mazare2024-11-151-3/+5
| | | | | * Remove some unused macros. * More unused fixes.
* Detach the tensors on batch-norm eval. (#1702)Laurent Mazare2024-02-131-1/+1
| | | | | | | | | | | | | * Detach the tensors on batch-norm eval. * Fix pyo3 bindings. * Black tweak. * Formatting. * Also update the pyo3-onnx formatting. * Apply black.
* Add Policy Gradient to Reinforcement Learning examples (#1500)s-casci2023-12-301-0/+105
| | | | | | | | | | | | | | | * added policy_gradient, modified main, ddpg and README * fixed typo in README * removed unnecessary imports * small refactor * Use clap for picking up the subcommand to run. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>
* Add DDPG and fix Gym wrapper (#1207)Travis Hammond2023-10-281-0/+451
* Fix Gym wrapper - It was returning things in the wrong order - Gym now differentiates between terminated and truncated * Add DDPG * Apply fixes * Remove Result annotations * Also remove Vec annotation * rustfmt * Various small improvements (avoid cloning, mutability, get clippy to pass, ...) --------- Co-authored-by: Travis Hammond <travis.hammond@alexanderthamm.com> Co-authored-by: Laurent <laurent.mazare@gmail.com>