donkeycar-rl-autoresearch/agent/models/exp26-warmstart
Paul Huliganga 8de4838c6b feat(exp26): warm-start training from exp25 best_model (300k steps)
Loads exp25 best_model (381r @ 80k) to skip early exploration. Runs 300k
steps on generated_road with road regen every 10k steps. Python-side hit
check is now active (added late in exp25, not loaded then). Final cross-model
eval: exp26 best (9/10 full eps, 381.2r mean) — top performer.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-14 15:32:16 -04:00
..
current.pid feat(exp26): warm-start training from exp25 best_model (300k steps) 2026-05-14 15:32:16 -04:00
run_2026-05-06_073652_warmstart.log feat(exp26): warm-start training from exp25 best_model (300k steps) 2026-05-14 15:32:16 -04:00