donkeycar-rl-autoresearch/agent/models/exp22-generated-pair-warm-v6
Paul Huliganga 138c65270f feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments
- reward_wrapper: detect barrier/wall/tree solid hits, terminate on head-on impact
  or 4 sustained solid-hit frames; prevents car wedging against invisible barriers
- reward_wrapper: add low-speed/wedge termination — kills episode when car is pinned
  motionless (below threshold, no displacement) after grace period
- reward_wrapper: high-CTE exploit fix — return -0.25 immediately when CTE >
  max_cte_terminate (not after patience), so PPO cannot collect positive speed
  rewards while driving the large outside-road circle
- tests: 23 passing unit tests covering all new termination paths
- exp20/21/22: add parallel DummyVecEnv experiments on generated_road+generated_track
  with warm-start from champion model; exp22 is current active run
- SESSION_HANDOFF.md: live handoff doc for next session continuity

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-05 14:46:13 -04:00
..
current.pid feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments 2026-05-05 14:46:13 -04:00
run_2026-04-28_2132_openfix.log feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments 2026-05-05 14:46:13 -04:00
run_2026-04-28_211553.log feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments 2026-05-05 14:46:13 -04:00
run_2026-04-28_211650.log feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments 2026-05-05 14:46:13 -04:00
run_2026-05-05_141211_barrierfix.log feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments 2026-05-05 14:46:13 -04:00
run_2026-05-05_141343_barrierfix.log feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments 2026-05-05 14:46:13 -04:00
run_2026-05-05_141929_strictcte.log feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments 2026-05-05 14:46:13 -04:00
run_2026-05-05_142620_wedgefix.log feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments 2026-05-05 14:46:13 -04:00
run_2026-05-05_restart.log feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments 2026-05-05 14:46:13 -04:00
run_session.log feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments 2026-05-05 14:46:13 -04:00