donkeycar-rl-autoresearch/IMPLEMENTATION_PLAN.md at master

3B-01 — LanePositionWrapper: reward = 1 - abs(cte - target)/max_cte with configurable target CTE offset

3B-02 — AntiOscillationWrapper: adds penalty for rapid steering changes (smoothness reward)

3B-03 — AsymmetricCTEWrapper: penalizes left-of-center more (enforces right-lane rule)

3B-04 — Tests for all three wrappers (no simulator required)

3B-05 — Integrate wrapper selection into autoresearch_controller.py via --behavior flag

Implementation Plan — DonkeyCar RL Autoresearch