This website requires JavaScript.
6e9546cd22
save: all experiment scripts moved from /tmp to agent/experiments/
master
Paul Huliganga
2026-04-18 21:30:08 -0400
de7b9bc302
fix: multitrack_runner must use VecTransposeImage(DummyVecEnv) not plain wrap_env
Paul Huliganga
2026-04-18 18:33:40 -0400
fecba1dd35
docs: TEST_HISTORY Exp10 plan added
Paul Huliganga
2026-04-18 17:59:07 -0400
b19dcc8b80
feat: run_eval.py — standard eval runner with persistent logging
Paul Huliganga
2026-04-18 15:32:36 -0400
eb4fd39056
docs: TEST_HISTORY updated with Exp8 results and Exp9 plan
Paul Huliganga
2026-04-18 13:40:45 -0400
041481916d
docs: TEST_HISTORY.md — comprehensive record of all experiments
Paul Huliganga
2026-04-18 11:18:53 -0400
47d8e5b346
fix: short-lap exploit now TERMINATES the episode, not just penalises
Paul Huliganga
2026-04-18 10:42:23 -0400
10719b4ff6
fix: save numbered checkpoint every segment, never overwrite
Paul Huliganga
2026-04-17 22:10:37 -0400
fc01057c14
docs: ADR-017 — always save best model, never just latest
Paul Huliganga
2026-04-17 16:03:59 -0400
4f77b8a468
fix: always save and return the BEST model, not the last one
Paul Huliganga
2026-04-17 14:45:37 -0400
0b5ce6ab7e
docs: ARCHITECTURE.md — complete system architecture guide
Paul Huliganga
2026-04-17 14:06:38 -0400
b8a13dea81
feat: v5 reward — speed × CTE-quality, drop efficiency term
Paul Huliganga
2026-04-17 13:25:38 -0400
a6831459dd
docs: STATE.md updated with April 16 test results
Paul Huliganga
2026-04-16 20:45:45 -0400
792b6734f7
docs: STATE.md — full project state as of April 16 end of Wave 4
Paul Huliganga
2026-04-16 20:17:41 -0400
619188bf17
wave3: autoresearch trial 25 results
Paul Huliganga
2026-04-16 20:01:55 -0400
c8c17e2e46
wave3: autoresearch trial 25 results
Paul Huliganga
2026-04-16 20:01:51 -0400
a3a49fbcaf
feat: eval_on_track.py — proper zero-shot eval on any track
Paul Huliganga
2026-04-16 19:47:56 -0400
a5577fb3e7
feat: shuttle-exploit detection in mini_monaco eval
Paul Huliganga
2026-04-16 17:29:30 -0400
96c49dd057
wave3: autoresearch trial 20 results
Paul Huliganga
2026-04-16 14:10:06 -0400
45b057e9c1
wave3: autoresearch trial 15 results
Paul Huliganga
2026-04-16 08:43:17 -0400
0505de7e63
wave3: autoresearch trial 10 results
Paul Huliganga
2026-04-16 03:31:41 -0400
b00f63dfbc
fix: save_dir not in scope inside train_multitrack — crashed every trial
Paul Huliganga
2026-04-15 22:47:29 -0400
ff8bdd8b8a
docs: ADR-013 through ADR-016 — decisions that were lost to context compaction
Paul Huliganga
2026-04-15 22:34:48 -0400
a9eed2faa3
fix: restart with verified config + seed GP with overnight 1943 result
Paul Huliganga
2026-04-15 22:26:53 -0400
e61ebc5b38
fix: prevent trial timeouts losing all data
Paul Huliganga
2026-04-15 21:54:50 -0400
5714a96bfb
wave3: autoresearch trial 5 results
Paul Huliganga
2026-04-15 17:08:50 -0400
c10e56d894
fix: cap total_timesteps at 120k to prevent 2hr timeout
Paul Huliganga
2026-04-15 16:30:07 -0400
f9f6a09744
fix: StuckTerminationWrapper + deque import + 102 tests
Paul Huliganga
2026-04-15 09:17:27 -0400
5d1227833d
fix: close short-lap circle exploit and cap segment eval episode length
Paul Huliganga
2026-04-15 09:06:25 -0400
1be95b7c82
wave3: autoresearch trial 5 results
Paul Huliganga
2026-04-15 07:15:57 -0400
860e3d6610
fix: fresh PPO verbose=0 suppressed all training output — set verbose=1
Paul Huliganga
2026-04-14 22:44:22 -0400
7534527722
Wave 4: scratch training on generated_track + mountain_track, zero-shot mini_monaco
Paul Huliganga
2026-04-14 22:40:38 -0400
650f893d2d
fix: complete LR override — must patch lr_schedule, not just param_groups
Paul Huliganga
2026-04-14 21:27:43 -0400
298cd1790a
fix: LR override was not reaching the optimizer — all trials ran at 0.000225
Paul Huliganga
2026-04-14 20:37:48 -0400
2a747bb97c
wave3: autoresearch trial 5 results
Paul Huliganga
2026-04-14 18:22:44 -0400
349396f967
fix: stream runner output in real-time instead of buffering
Paul Huliganga
2026-04-14 15:13:10 -0400
7ed2456896
fix: remove Warren from test set — indoor carpet, broken done condition
Paul Huliganga
2026-04-14 13:47:28 -0400
86657a26b8
wave3: fix track-switch bug (viewer not raw socket) + shorten trial budgets
Paul Huliganga
2026-04-14 13:29:49 -0400
4ca5304a71
wave3: add multi-track autoresearch system (83 tests passing)
Paul Huliganga
2026-04-14 12:47:12 -0400
26251c7d0c
results: complete multi-track generalization baseline — 1/10 tracks drivable pre-Wave3
Paul Huliganga
2026-04-14 11:31:08 -0400
5a626c87be
feat: comprehensive multi-track evaluation script + research log updates
Paul Huliganga
2026-04-14 10:11:47 -0400
ce120393af
fix: track switching via unwrapped viewer.exit_scene() — automatic scene changes work
Paul Huliganga
2026-04-14 10:04:15 -0400
0fbd15a941
eval: multi-track generalization test — all 3 models drive new road + generated track
Paul Huliganga
2026-04-14 09:50:28 -0400
e68d618d29
feat: Phase 3 — behavioral control, enhanced evaluator, 53 tests
Paul Huliganga
2026-04-14 09:28:43 -0400
cfd1f843a4
autoresearch: phase1 trial 20 results
Paul Huliganga
2026-04-14 04:35:49 -0400
5114a95a74
autoresearch: phase1 trial 20 results
Paul Huliganga
2026-04-14 04:35:45 -0400
52b8a4a10e
autoresearch: phase1 trial 15 results
Paul Huliganga
2026-04-14 02:56:38 -0400
6c8c5b25a9
autoresearch: phase1 trial 10 results
Paul Huliganga
2026-04-14 00:56:14 -0400
2d6fe2c962
autoresearch: phase1 trial 5 results
Paul Huliganga
2026-04-13 22:46:54 -0400
c8a495dd22
fix: reward v4 — full sim bypass kills circular driving at root
Paul Huliganga
2026-04-13 20:56:32 -0400
7b8830f0cb
milestone: Phase 1 complete — genuine driving confirmed; launch Phase 2 corner learning
Paul Huliganga
2026-04-13 19:33:06 -0400
cb82121e98
autoresearch: phase1 trial 50 results
Paul Huliganga
2026-04-13 19:18:00 -0400
3cbe4bd26e
autoresearch: phase1 trial 50 results
Paul Huliganga
2026-04-13 19:17:56 -0400
4c9b68dd47
autoresearch: phase1 trial 40 results
Paul Huliganga
2026-04-13 18:15:31 -0400
ed65cf5997
autoresearch: phase1 trial 30 results
Paul Huliganga
2026-04-13 17:28:19 -0400
29a45e017b
autoresearch: phase1 trial 20 results
Paul Huliganga
2026-04-13 16:38:17 -0400
caf91c9fe6
autoresearch: phase1 trial 10 results
Paul Huliganga
2026-04-13 16:00:23 -0400
87cff0c9b7
autoresearch: phase1 trial 40 results
Paul Huliganga
2026-04-13 15:28:05 -0400
1734e1359e
autoresearch: phase1 trial 30 results
Paul Huliganga
2026-04-13 15:13:21 -0400
362c616457
autoresearch: phase1 trial 20 results
Paul Huliganga
2026-04-13 14:41:55 -0400
cdb7b80494
autoresearch: phase1 trial 10 results
Paul Huliganga
2026-04-13 14:07:58 -0400
fcb6ea1ac2
fix: path-efficiency reward (v3) defeats circular driving exploit
Paul Huliganga
2026-04-13 13:36:17 -0400
d25bc71008
autoresearch: phase1 trial 10 results
Paul Huliganga
2026-04-13 13:11:06 -0400
5e93dae316
fix: hack-proof reward shaping + reward hacking detection + research log
Paul Huliganga
2026-04-13 12:27:48 -0400
0c6263352b
autoresearch: phase1 trial 10 results
Paul Huliganga
2026-04-13 12:01:17 -0400
8c9fd76c68
fix: reduce timesteps to 1k-5k for Phase 1 CPU training; add sim health/stuck detection; fix PPO throttle clamp
Paul Huliganga
2026-04-13 11:17:08 -0400
c804189dd0
feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing
Paul Huliganga
2026-04-13 10:03:15 -0400
083326a497
AUTORESEARCH: 300 total trials complete - best mean_reward=141.85 at n_steer=8, n_throttle=5, lr=0.00202
Paul Huliganga
2026-04-13 01:56:06 -0400
3446e5f7c1
AUTORESEARCH: 100 trials complete - best mean_reward=114.56 at n_steer=8, n_throttle=4, lr=0.00208
Paul Huliganga
2026-04-13 01:13:20 -0400
bb9e6d9105
AUTORESEARCH: Full Karpathy-style GP+UCB meta-controller, clean base data, fixed all paths, ready to run
Paul Huliganga
2026-04-13 00:52:00 -0400
4a4e61d463
CLEAN: Robust multi-episode RL runner, no legacy save/model logic; outer loop points to project dir runner.
Paul Huliganga
2026-04-13 00:28:45 -0400
c98bc7ef38
Initial commit
Paul Huliganga
2026-04-12 23:44:36 -0400
2cadd1a78a
Initial commit: stable RL sweep runner, legacy and new scripts, full docs included
Paul Huliganga
2026-04-12 22:57:50 -0400