Warm-starts from exp26/best_model (best road model) and fine-tunes on donkey-generated-track-v0 (shadows, trees) at LR=0.00005. Adds N_THROTTLE=3 variable throttle to force learning corner braking. 50k steps, eval on mini-monaco (zero-shot) at completion. Goal: visual diversity + throttle variation → better mini-monaco generalization. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| run_2026-05-06_223031_gentrack_finetune.log | ||
| run_2026-05-06_223604_gentrack_finetune.log | ||
| run_2026-05-06_224117_gentrack_finetune.log | ||
| run_2026-05-06_224220_gentrack_finetune.log | ||
| stdout.log | ||