donkeycar-rl-autoresearch/agent/models/exp29-wave4-finetune
Paul Huliganga 1d53bf613f feat(exp29): fine-tune wave4-trial-0009 on generated track (continuous actions)
Warm-starts from wave4-trial-0009/model.zip (best mini-monaco model, completed
laps). Fine-tunes on generated track with continuous Box action space preserved
(no DiscretizedActionWrapper) at LR=0.00005. 50k steps, checkpoint every 5k,
zero-shot mini-monaco eval at end.

Tests whether additional generated-track exposure improves corner handling on
mini-monaco without catastrophic forgetting of driving skill.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-14 15:32:43 -04:00
..
current.pid feat(exp29): fine-tune wave4-trial-0009 on generated track (continuous actions) 2026-05-14 15:32:43 -04:00
run_2026-05-06_225559_wave4_finetune.log feat(exp29): fine-tune wave4-trial-0009 on generated track (continuous actions) 2026-05-14 15:32:43 -04:00
stdout.log feat(exp29): fine-tune wave4-trial-0009 on generated track (continuous actions) 2026-05-14 15:32:43 -04:00