Commit Graph

1 Commits

Author SHA1 Message Date
Paul Huliganga 1d53bf613f feat(exp29): fine-tune wave4-trial-0009 on generated track (continuous actions)
Warm-starts from wave4-trial-0009/model.zip (best mini-monaco model, completed
laps). Fine-tunes on generated track with continuous Box action space preserved
(no DiscretizedActionWrapper) at LR=0.00005. 50k steps, checkpoint every 5k,
zero-shot mini-monaco eval at end.

Tests whether additional generated-track exposure improves corner handling on
mini-monaco without catastrophic forgetting of driving skill.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-14 15:32:43 -04:00