Commit Graph

3 Commits

Author SHA1 Message Date
Paul Huliganga b504b89b2a feat: add exp17 parallel DummyVecEnv 450k training + strategy docs
- exp17_parallel_450k.py: parallel two-track training (generated_track:9091,
  mountain_track:9093), 450k steps, v6 reward, HOST=localhost
- DECISIONS.md: ADR-025 (parallel strategy) and ADR-026 (mountain friction fix)
- docs/STATE.md: updated to April 2026 state with current champions and strategy
- docs/TEST_HISTORY.md: mountain friction fix notes + Exp 17 full design
- outerloop-results: exp14 finetune logs and robust mountain eval results

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-28 02:42:20 -04:00
Paul Huliganga a6831459dd docs: STATE.md updated with April 16 test results
Key findings:
- Trial 9: drives generated_track (3/3) AND mini_monaco zero-shot (40s laps)
- Trial 19: drives generated_track (2/3)
- Trial 3: corrupted, policy-only recovery still crashes at ~104 steps
- Generated_track lighting variation per episode may be key to generalisation
- Phase 2 champion: confirmed still drives generated_road perfectly

Agent: pi
Tests: 102 passed
Tests-Added: 0
TypeScript: N/A
2026-04-16 20:45:45 -04:00
Paul Huliganga 792b6734f7 docs: STATE.md — full project state as of April 16 end of Wave 4
Documents all 25 trial results, known models, what is confirmed vs
unknown, and the 6 pending verification tests agreed with user.

Agent: pi
Tests: 102 passed
Tests-Added: 0
TypeScript: N/A
2026-04-16 20:17:41 -04:00