Commit Graph

  • 6e9546cd22 save: all experiment scripts moved from /tmp to agent/experiments/ master Paul Huliganga 2026-04-18 21:30:08 -0400
  • de7b9bc302 fix: multitrack_runner must use VecTransposeImage(DummyVecEnv) not plain wrap_env Paul Huliganga 2026-04-18 18:33:40 -0400
  • fecba1dd35 docs: TEST_HISTORY Exp10 plan added Paul Huliganga 2026-04-18 17:59:07 -0400
  • b19dcc8b80 feat: run_eval.py — standard eval runner with persistent logging Paul Huliganga 2026-04-18 15:32:36 -0400
  • eb4fd39056 docs: TEST_HISTORY updated with Exp8 results and Exp9 plan Paul Huliganga 2026-04-18 13:40:45 -0400
  • 041481916d docs: TEST_HISTORY.md — comprehensive record of all experiments Paul Huliganga 2026-04-18 11:18:53 -0400
  • 47d8e5b346 fix: short-lap exploit now TERMINATES the episode, not just penalises Paul Huliganga 2026-04-18 10:42:23 -0400
  • 10719b4ff6 fix: save numbered checkpoint every segment, never overwrite Paul Huliganga 2026-04-17 22:10:37 -0400
  • fc01057c14 docs: ADR-017 — always save best model, never just latest Paul Huliganga 2026-04-17 16:03:59 -0400
  • 4f77b8a468 fix: always save and return the BEST model, not the last one Paul Huliganga 2026-04-17 14:45:37 -0400
  • 0b5ce6ab7e docs: ARCHITECTURE.md — complete system architecture guide Paul Huliganga 2026-04-17 14:06:38 -0400
  • b8a13dea81 feat: v5 reward — speed × CTE-quality, drop efficiency term Paul Huliganga 2026-04-17 13:25:38 -0400
  • a6831459dd docs: STATE.md updated with April 16 test results Paul Huliganga 2026-04-16 20:45:45 -0400
  • 792b6734f7 docs: STATE.md — full project state as of April 16 end of Wave 4 Paul Huliganga 2026-04-16 20:17:41 -0400
  • 619188bf17 wave3: autoresearch trial 25 results Paul Huliganga 2026-04-16 20:01:55 -0400
  • c8c17e2e46 wave3: autoresearch trial 25 results Paul Huliganga 2026-04-16 20:01:51 -0400
  • a3a49fbcaf feat: eval_on_track.py — proper zero-shot eval on any track Paul Huliganga 2026-04-16 19:47:56 -0400
  • a5577fb3e7 feat: shuttle-exploit detection in mini_monaco eval Paul Huliganga 2026-04-16 17:29:30 -0400
  • 96c49dd057 wave3: autoresearch trial 20 results Paul Huliganga 2026-04-16 14:10:06 -0400
  • 45b057e9c1 wave3: autoresearch trial 15 results Paul Huliganga 2026-04-16 08:43:17 -0400
  • 0505de7e63 wave3: autoresearch trial 10 results Paul Huliganga 2026-04-16 03:31:41 -0400
  • b00f63dfbc fix: save_dir not in scope inside train_multitrack — crashed every trial Paul Huliganga 2026-04-15 22:47:29 -0400
  • ff8bdd8b8a docs: ADR-013 through ADR-016 — decisions that were lost to context compaction Paul Huliganga 2026-04-15 22:34:48 -0400
  • a9eed2faa3 fix: restart with verified config + seed GP with overnight 1943 result Paul Huliganga 2026-04-15 22:26:53 -0400
  • e61ebc5b38 fix: prevent trial timeouts losing all data Paul Huliganga 2026-04-15 21:54:50 -0400
  • 5714a96bfb wave3: autoresearch trial 5 results Paul Huliganga 2026-04-15 17:08:50 -0400
  • c10e56d894 fix: cap total_timesteps at 120k to prevent 2hr timeout Paul Huliganga 2026-04-15 16:30:07 -0400
  • f9f6a09744 fix: StuckTerminationWrapper + deque import + 102 tests Paul Huliganga 2026-04-15 09:17:27 -0400
  • 5d1227833d fix: close short-lap circle exploit and cap segment eval episode length Paul Huliganga 2026-04-15 09:06:25 -0400
  • 1be95b7c82 wave3: autoresearch trial 5 results Paul Huliganga 2026-04-15 07:15:57 -0400
  • 860e3d6610 fix: fresh PPO verbose=0 suppressed all training output — set verbose=1 Paul Huliganga 2026-04-14 22:44:22 -0400
  • 7534527722 Wave 4: scratch training on generated_track + mountain_track, zero-shot mini_monaco Paul Huliganga 2026-04-14 22:40:38 -0400
  • 650f893d2d fix: complete LR override — must patch lr_schedule, not just param_groups Paul Huliganga 2026-04-14 21:27:43 -0400
  • 298cd1790a fix: LR override was not reaching the optimizer — all trials ran at 0.000225 Paul Huliganga 2026-04-14 20:37:48 -0400
  • 2a747bb97c wave3: autoresearch trial 5 results Paul Huliganga 2026-04-14 18:22:44 -0400
  • 349396f967 fix: stream runner output in real-time instead of buffering Paul Huliganga 2026-04-14 15:13:10 -0400
  • 7ed2456896 fix: remove Warren from test set — indoor carpet, broken done condition Paul Huliganga 2026-04-14 13:47:28 -0400
  • 86657a26b8 wave3: fix track-switch bug (viewer not raw socket) + shorten trial budgets Paul Huliganga 2026-04-14 13:29:49 -0400
  • 4ca5304a71 wave3: add multi-track autoresearch system (83 tests passing) Paul Huliganga 2026-04-14 12:47:12 -0400
  • 26251c7d0c results: complete multi-track generalization baseline — 1/10 tracks drivable pre-Wave3 Paul Huliganga 2026-04-14 11:31:08 -0400
  • 5a626c87be feat: comprehensive multi-track evaluation script + research log updates Paul Huliganga 2026-04-14 10:11:47 -0400
  • ce120393af fix: track switching via unwrapped viewer.exit_scene() — automatic scene changes work Paul Huliganga 2026-04-14 10:04:15 -0400
  • 0fbd15a941 eval: multi-track generalization test — all 3 models drive new road + generated track Paul Huliganga 2026-04-14 09:50:28 -0400
  • e68d618d29 feat: Phase 3 — behavioral control, enhanced evaluator, 53 tests Paul Huliganga 2026-04-14 09:28:43 -0400
  • cfd1f843a4 autoresearch: phase1 trial 20 results Paul Huliganga 2026-04-14 04:35:49 -0400
  • 5114a95a74 autoresearch: phase1 trial 20 results Paul Huliganga 2026-04-14 04:35:45 -0400
  • 52b8a4a10e autoresearch: phase1 trial 15 results Paul Huliganga 2026-04-14 02:56:38 -0400
  • 6c8c5b25a9 autoresearch: phase1 trial 10 results Paul Huliganga 2026-04-14 00:56:14 -0400
  • 2d6fe2c962 autoresearch: phase1 trial 5 results Paul Huliganga 2026-04-13 22:46:54 -0400
  • c8a495dd22 fix: reward v4 — full sim bypass kills circular driving at root Paul Huliganga 2026-04-13 20:56:32 -0400
  • 7b8830f0cb milestone: Phase 1 complete — genuine driving confirmed; launch Phase 2 corner learning Paul Huliganga 2026-04-13 19:33:06 -0400
  • cb82121e98 autoresearch: phase1 trial 50 results Paul Huliganga 2026-04-13 19:18:00 -0400
  • 3cbe4bd26e autoresearch: phase1 trial 50 results Paul Huliganga 2026-04-13 19:17:56 -0400
  • 4c9b68dd47 autoresearch: phase1 trial 40 results Paul Huliganga 2026-04-13 18:15:31 -0400
  • ed65cf5997 autoresearch: phase1 trial 30 results Paul Huliganga 2026-04-13 17:28:19 -0400
  • 29a45e017b autoresearch: phase1 trial 20 results Paul Huliganga 2026-04-13 16:38:17 -0400
  • caf91c9fe6 autoresearch: phase1 trial 10 results Paul Huliganga 2026-04-13 16:00:23 -0400
  • 87cff0c9b7 autoresearch: phase1 trial 40 results Paul Huliganga 2026-04-13 15:28:05 -0400
  • 1734e1359e autoresearch: phase1 trial 30 results Paul Huliganga 2026-04-13 15:13:21 -0400
  • 362c616457 autoresearch: phase1 trial 20 results Paul Huliganga 2026-04-13 14:41:55 -0400
  • cdb7b80494 autoresearch: phase1 trial 10 results Paul Huliganga 2026-04-13 14:07:58 -0400
  • fcb6ea1ac2 fix: path-efficiency reward (v3) defeats circular driving exploit Paul Huliganga 2026-04-13 13:36:17 -0400
  • d25bc71008 autoresearch: phase1 trial 10 results Paul Huliganga 2026-04-13 13:11:06 -0400
  • 5e93dae316 fix: hack-proof reward shaping + reward hacking detection + research log Paul Huliganga 2026-04-13 12:27:48 -0400
  • 0c6263352b autoresearch: phase1 trial 10 results Paul Huliganga 2026-04-13 12:01:17 -0400
  • 8c9fd76c68 fix: reduce timesteps to 1k-5k for Phase 1 CPU training; add sim health/stuck detection; fix PPO throttle clamp Paul Huliganga 2026-04-13 11:17:08 -0400
  • c804189dd0 feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing Paul Huliganga 2026-04-13 10:03:15 -0400
  • 083326a497 AUTORESEARCH: 300 total trials complete - best mean_reward=141.85 at n_steer=8, n_throttle=5, lr=0.00202 Paul Huliganga 2026-04-13 01:56:06 -0400
  • 3446e5f7c1 AUTORESEARCH: 100 trials complete - best mean_reward=114.56 at n_steer=8, n_throttle=4, lr=0.00208 Paul Huliganga 2026-04-13 01:13:20 -0400
  • bb9e6d9105 AUTORESEARCH: Full Karpathy-style GP+UCB meta-controller, clean base data, fixed all paths, ready to run Paul Huliganga 2026-04-13 00:52:00 -0400
  • 4a4e61d463 CLEAN: Robust multi-episode RL runner, no legacy save/model logic; outer loop points to project dir runner. Paul Huliganga 2026-04-13 00:28:45 -0400
  • c98bc7ef38 Initial commit Paul Huliganga 2026-04-12 23:44:36 -0400
  • 2cadd1a78a Initial commit: stable RL sweep runner, legacy and new scripts, full docs included Paul Huliganga 2026-04-12 22:57:50 -0400