donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_phase4_log.txt

[2026-04-14 22:40:44] =================================================================
[2026-04-14 22:40:44] [Wave4] Multi-Track Autoresearch — GP+UCB Generalization Search
[2026-04-14 22:40:44] [Wave4] Training tracks : generated_track, mountain_track  (no generated_road, no warm-start)
[2026-04-14 22:40:44] [Wave4] Test tracks     : mini_monaco only (zero-shot; warren removed — broken done condition)
[2026-04-14 22:40:44] [Wave4] Max trials      : 25 | kappa=2.0 | push every 5
[2026-04-14 22:40:44] [Wave4] Results file    : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase4.jsonl
[2026-04-14 22:40:44] [Wave4] Champion dir    : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-champion
[2026-04-14 22:40:44] [Wave4] Warm start      : NONE (training from scratch each trial)
[2026-04-14 22:40:44] =================================================================
[2026-04-14 22:40:44] [Wave4] Loaded 0 existing Phase 3 results.
[2026-04-14 22:40:44] [Wave4] No Wave 3 champion yet.
[2026-04-14 22:40:44] [Wave4] Starting from trial 1.
[2026-04-14 22:40:44]
[Wave4] ========== Trial 1/25 ==========
[2026-04-14 22:40:44] [Wave4] Seed trial 1/2: using hardcoded params.
[2026-04-14 22:40:44] [Wave4] Proposed params: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-14 22:40:46] [Wave4] Launching trial 1: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-14 22:40:46] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 80000 --steps-per-switch 6000 --learning-rate 0.0003 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0001
[2026-04-14 22:44:24] =================================================================
[2026-04-14 22:44:24] [Wave4] Multi-Track Autoresearch — GP+UCB Generalization Search
[2026-04-14 22:44:24] [Wave4] Training tracks : generated_track, mountain_track  (no generated_road, no warm-start)
[2026-04-14 22:44:24] [Wave4] Test tracks     : mini_monaco only (zero-shot; warren removed — broken done condition)
[2026-04-14 22:44:24] [Wave4] Max trials      : 25 | kappa=2.0 | push every 5
[2026-04-14 22:44:24] [Wave4] Results file    : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase4.jsonl
[2026-04-14 22:44:24] [Wave4] Champion dir    : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-champion
[2026-04-14 22:44:24] [Wave4] Warm start      : NONE (training from scratch each trial)
[2026-04-14 22:44:24] =================================================================
[2026-04-14 22:44:24] [Wave4] Loaded 0 existing Phase 3 results.
[2026-04-14 22:44:24] [Wave4] No Wave 3 champion yet.
[2026-04-14 22:44:24] [Wave4] Starting from trial 1.
[2026-04-14 22:44:24]
[Wave4] ========== Trial 1/25 ==========
[2026-04-14 22:44:24] [Wave4] Seed trial 1/2: using hardcoded params.
[2026-04-14 22:44:24] [Wave4] Proposed params: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-14 22:44:26] [Wave4] Launching trial 1: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-14 22:44:26] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 80000 --steps-per-switch 6000 --learning-rate 0.0003 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0001
[2026-04-15 00:02:45] [Wave4] Trial 1 finished in 4699.3s, rc=0
[2026-04-15 00:02:45] [Wave4] Parsed: combined=45.6693 mini_monaco=45.6693
[2026-04-15 00:02:45] [Champion] 🏆 NEW BEST! Trial 1: score=45.67 (mini_monaco=45.7) params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 00:02:45] [Wave4] ===== Trial 1 Summary =====
[2026-04-15 00:02:45]   GP data points : 1
[2026-04-15 00:02:45]   Wave4 Champion: trial=1 score=45.67 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 00:02:45]   Top 5:
[2026-04-15 00:02:45]     score=45.67  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 00:02:47]
[Wave4] ========== Trial 2/25 ==========
[2026-04-15 00:02:47] [Wave4] Seed trial 2/2: using hardcoded params.
[2026-04-15 00:02:47] [Wave4] Proposed params: {'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 00:02:49] [Wave4] Launching trial 2: {'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 00:02:49] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 80000 --steps-per-switch 6000 --learning-rate 0.001 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0002
[2026-04-15 01:21:38] [Wave4] Trial 2 finished in 4728.4s, rc=0
[2026-04-15 01:21:38] [Wave4] Parsed: combined=222.0731 mini_monaco=222.0731
[2026-04-15 01:21:38] [Champion] 🏆 NEW BEST! Trial 2: score=222.07 (mini_monaco=222.1) params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 01:21:38] [Wave4] ===== Trial 2 Summary =====
[2026-04-15 01:21:38]   GP data points : 2
[2026-04-15 01:21:38]   Wave4 Champion: trial=2 score=222.07 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 01:21:38]   Top 5:
[2026-04-15 01:21:38]     score=222.07  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 01:21:38]     score=45.67  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 01:21:40]
[Wave4] ========== Trial 3/25 ==========
[2026-04-15 01:21:40] [Wave4] Only 2 results — using random proposal.
[2026-04-15 01:21:40] [Wave4] Proposed params: {'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 01:21:42] [Wave4] Launching trial 3: {'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 01:21:42] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 157743 --steps-per-switch 17499 --learning-rate 0.0006852550685205609 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0003
[2026-04-15 03:15:46] [Wave4] Trial 3 finished in 6843.7s, rc=0
[2026-04-15 03:15:46] [Wave4] Parsed: combined=1943.1038 mini_monaco=1943.1038
[2026-04-15 03:15:46] [Champion] 🏆 NEW BEST! Trial 3: score=1943.10 (mini_monaco=1943.1) params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 03:15:46] [Wave4] ===== Trial 3 Summary =====
[2026-04-15 03:15:46]   GP data points : 3
[2026-04-15 03:15:46]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 03:15:46]   Top 5:
[2026-04-15 03:15:46]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 03:15:46]     score=222.07  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 03:15:46]     score=45.67  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 03:15:48]
[Wave4] ========== Trial 4/25 ==========
[2026-04-15 03:15:48] [Wave4] GP UCB top-5 proposals:
[2026-04-15 03:15:48]   UCB=2.4560 mu=0.8788 σ=0.7886 params={'learning_rate': 0.0003250095463348546, 'steps_per_switch': 19054, 'total_timesteps': 197116}
[2026-04-15 03:15:48]   UCB=2.4518 mu=0.8393 σ=0.8062 params={'learning_rate': 0.00121703003154963, 'steps_per_switch': 16951, 'total_timesteps': 180865}
[2026-04-15 03:15:48]   UCB=2.4512 mu=0.7637 σ=0.8437 params={'learning_rate': 0.00036067077082995895, 'steps_per_switch': 16532, 'total_timesteps': 211219}
[2026-04-15 03:15:48]   UCB=2.4501 mu=0.9283 σ=0.7609 params={'learning_rate': 0.0005325315186424085, 'steps_per_switch': 18992, 'total_timesteps': 205595}
[2026-04-15 03:15:48]   UCB=2.4492 mu=0.9106 σ=0.7693 params={'learning_rate': 0.001163360064352729, 'steps_per_switch': 19652, 'total_timesteps': 151744}
[2026-04-15 03:15:48] [Wave4] Proposed params: {'learning_rate': 0.0003250095463348546, 'steps_per_switch': 19054, 'total_timesteps': 197116}
[2026-04-15 03:15:50] [Wave4] Launching trial 4: {'learning_rate': 0.0003250095463348546, 'steps_per_switch': 19054, 'total_timesteps': 197116}
[2026-04-15 03:15:50] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 197116 --steps-per-switch 19054 --learning-rate 0.0003250095463348546 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0004
[2026-04-15 05:15:51] [Wave4] Trial 4 TIMED OUT — killing runner.
[2026-04-15 05:15:51] [Wave4] Trial 4 finished in 7200.5s, rc=-9
[2026-04-15 05:15:51] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 05:15:51] [Wave4] ⚠️  No test score parsed — defaulting to 0.0
[2026-04-15 05:15:51] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 05:15:51] [Wave4] ===== Trial 4 Summary =====
[2026-04-15 05:15:51]   GP data points : 3
[2026-04-15 05:15:51]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 05:15:51]   Top 5:
[2026-04-15 05:15:51]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 05:15:51]     score=222.07  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 05:15:51]     score=45.67  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 05:15:53]
[Wave4] ========== Trial 5/25 ==========
[2026-04-15 05:15:53] [Wave4] GP UCB top-5 proposals:
[2026-04-15 05:15:53]   UCB=2.4597 mu=0.8329 σ=0.8134 params={'learning_rate': 0.0003927960467617446, 'steps_per_switch': 19892, 'total_timesteps': 201785}
[2026-04-15 05:15:53]   UCB=2.4568 mu=0.8585 σ=0.7991 params={'learning_rate': 0.0011330710879806035, 'steps_per_switch': 18089, 'total_timesteps': 193054}
[2026-04-15 05:15:53]   UCB=2.4560 mu=0.7832 σ=0.8364 params={'learning_rate': 0.0006110661120319741, 'steps_per_switch': 17141, 'total_timesteps': 219583}
[2026-04-15 05:15:53]   UCB=2.4560 mu=0.8338 σ=0.8111 params={'learning_rate': 0.000602366907571214, 'steps_per_switch': 16527, 'total_timesteps': 215069}
[2026-04-15 05:15:53]   UCB=2.4522 mu=0.8120 σ=0.8201 params={'learning_rate': 0.0004035684210100053, 'steps_per_switch': 16067, 'total_timesteps': 208387}
[2026-04-15 05:15:53] [Wave4] Proposed params: {'learning_rate': 0.0003927960467617446, 'steps_per_switch': 19892, 'total_timesteps': 201785}
[2026-04-15 05:15:55] [Wave4] Launching trial 5: {'learning_rate': 0.0003927960467617446, 'steps_per_switch': 19892, 'total_timesteps': 201785}
[2026-04-15 05:15:55] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 201785 --steps-per-switch 19892 --learning-rate 0.0003927960467617446 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0005
[2026-04-15 07:15:57] [Wave4] Trial 5 TIMED OUT — killing runner.
[2026-04-15 07:15:57] [Wave4] Trial 5 finished in 7202.3s, rc=-9
[2026-04-15 07:15:57] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 07:15:57] [Wave4] ⚠️  No test score parsed — defaulting to 0.0
[2026-04-15 07:15:57] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 07:15:57] [Wave4] ===== Trial 5 Summary =====
[2026-04-15 07:15:57]   GP data points : 3
[2026-04-15 07:15:57]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 07:15:57]   Top 5:
[2026-04-15 07:15:57]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 07:15:57]     score=222.07  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 07:15:57]     score=45.67  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 07:15:59] [Wave4] ✅ Git push complete after trial 5
[2026-04-15 07:16:01]
[Wave4] ========== Trial 6/25 ==========
[2026-04-15 07:16:01] [Wave4] GP UCB top-5 proposals:
[2026-04-15 07:16:01]   UCB=2.4565 mu=0.8712 σ=0.7926 params={'learning_rate': 0.0011062087200910864, 'steps_per_switch': 18318, 'total_timesteps': 194470}
[2026-04-15 07:16:01]   UCB=2.4485 mu=0.9338 σ=0.7573 params={'learning_rate': 0.0004307107164246544, 'steps_per_switch': 19141, 'total_timesteps': 199878}
[2026-04-15 07:16:01]   UCB=2.4478 mu=0.8840 σ=0.7819 params={'learning_rate': 0.00041215765557335777, 'steps_per_switch': 16229, 'total_timesteps': 203707}
[2026-04-15 07:16:01]   UCB=2.4468 mu=0.8283 σ=0.8092 params={'learning_rate': 0.0009928039664024839, 'steps_per_switch': 19629, 'total_timesteps': 113788}
[2026-04-15 07:16:01]   UCB=2.4456 mu=0.9298 σ=0.7579 params={'learning_rate': 0.0002412156295150517, 'steps_per_switch': 19116, 'total_timesteps': 179367}
[2026-04-15 07:16:01] [Wave4] Proposed params: {'learning_rate': 0.0011062087200910864, 'steps_per_switch': 18318, 'total_timesteps': 194470}
[2026-04-15 07:16:03] [Wave4] Launching trial 6: {'learning_rate': 0.0011062087200910864, 'steps_per_switch': 18318, 'total_timesteps': 194470}
[2026-04-15 07:16:03] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 194470 --steps-per-switch 18318 --learning-rate 0.0011062087200910864 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0006
[2026-04-15 09:06:26] =================================================================
[2026-04-15 09:06:26] [Wave4] Multi-Track Autoresearch — GP+UCB Generalization Search
[2026-04-15 09:06:26] [Wave4] Training tracks : generated_track, mountain_track  (no generated_road, no warm-start)
[2026-04-15 09:06:26] [Wave4] Test tracks     : mini_monaco only (zero-shot; warren removed — broken done condition)
[2026-04-15 09:06:26] [Wave4] Max trials      : 25 | kappa=2.0 | push every 5
[2026-04-15 09:06:26] [Wave4] Results file    : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase4.jsonl
[2026-04-15 09:06:26] [Wave4] Champion dir    : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-champion
[2026-04-15 09:06:26] [Wave4] Warm start      : NONE (training from scratch each trial)
[2026-04-15 09:06:26] =================================================================
[2026-04-15 09:06:26] [Wave4] Loaded 0 existing Phase 3 results.
[2026-04-15 09:06:26] [Wave4] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 09:06:26] [Wave4] Starting from trial 1.
[2026-04-15 09:06:26]
[Wave4] ========== Trial 1/25 ==========
[2026-04-15 09:06:26] [Wave4] Seed trial 1/2: using hardcoded params.
[2026-04-15 09:06:26] [Wave4] Proposed params: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 09:06:28] [Wave4] Launching trial 1: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 09:06:28] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 80000 --steps-per-switch 6000 --learning-rate 0.0003 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0001
[2026-04-15 09:17:28] =================================================================
[2026-04-15 09:17:28] [Wave4] Multi-Track Autoresearch — GP+UCB Generalization Search
[2026-04-15 09:17:28] [Wave4] Training tracks : generated_track, mountain_track  (no generated_road, no warm-start)
[2026-04-15 09:17:28] [Wave4] Test tracks     : mini_monaco only (zero-shot; warren removed — broken done condition)
[2026-04-15 09:17:28] [Wave4] Max trials      : 25 | kappa=2.0 | push every 5
[2026-04-15 09:17:28] [Wave4] Results file    : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase4.jsonl
[2026-04-15 09:17:28] [Wave4] Champion dir    : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-champion
[2026-04-15 09:17:28] [Wave4] Warm start      : NONE (training from scratch each trial)
[2026-04-15 09:17:28] =================================================================
[2026-04-15 09:17:28] [Wave4] Loaded 0 existing Phase 3 results.
[2026-04-15 09:17:28] [Wave4] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 09:17:28] [Wave4] Starting from trial 1.
[2026-04-15 09:17:28]
[Wave4] ========== Trial 1/25 ==========
[2026-04-15 09:17:28] [Wave4] Seed trial 1/2: using hardcoded params.
[2026-04-15 09:17:28] [Wave4] Proposed params: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 09:17:30] [Wave4] Launching trial 1: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 09:17:30] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 80000 --steps-per-switch 6000 --learning-rate 0.0003 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0001
[2026-04-15 10:33:27] [Wave4] Trial 1 finished in 4557.0s, rc=0
[2026-04-15 10:33:27] [Wave4] Parsed: combined=42.2964 mini_monaco=42.2964
[2026-04-15 10:33:27] [Wave4] ===== Trial 1 Summary =====
[2026-04-15 10:33:27]   GP data points : 1
[2026-04-15 10:33:27]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 10:33:27]   Top 5:
[2026-04-15 10:33:27]     score=42.30  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 10:33:29]
[Wave4] ========== Trial 2/25 ==========
[2026-04-15 10:33:29] [Wave4] Seed trial 2/2: using hardcoded params.
[2026-04-15 10:33:29] [Wave4] Proposed params: {'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 10:33:31] [Wave4] Launching trial 2: {'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 10:33:31] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 80000 --steps-per-switch 6000 --learning-rate 0.001 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0002
[2026-04-15 11:39:05] [Wave4] Trial 2 finished in 3934.0s, rc=0
[2026-04-15 11:39:05] [Wave4] Parsed: combined=93.3894 mini_monaco=93.3894
[2026-04-15 11:39:05] [Wave4] ===== Trial 2 Summary =====
[2026-04-15 11:39:05]   GP data points : 2
[2026-04-15 11:39:05]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 11:39:05]   Top 5:
[2026-04-15 11:39:05]     score=93.39  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 11:39:05]     score=42.30  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 11:39:07]
[Wave4] ========== Trial 3/25 ==========
[2026-04-15 11:39:07] [Wave4] Only 2 results — using random proposal.
[2026-04-15 11:39:07] [Wave4] Proposed params: {'learning_rate': 0.0008162408849407889, 'steps_per_switch': 8441, 'total_timesteps': 140634}
[2026-04-15 11:39:09] [Wave4] Launching trial 3: {'learning_rate': 0.0008162408849407889, 'steps_per_switch': 8441, 'total_timesteps': 140634}
[2026-04-15 11:39:09] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 140634 --steps-per-switch 8441 --learning-rate 0.0008162408849407889 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0003
[2026-04-15 13:39:32] [Wave4] Trial 3 TIMED OUT — killing runner.
[2026-04-15 13:39:32] [Wave4] Trial 3 finished in 7222.7s, rc=-9
[2026-04-15 13:39:32] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 13:39:32] [Wave4] ⚠️  No test score parsed — defaulting to 0.0
[2026-04-15 13:39:32] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 13:39:32] [Wave4] ===== Trial 3 Summary =====
[2026-04-15 13:39:32]   GP data points : 2
[2026-04-15 13:39:32]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 13:39:32]   Top 5:
[2026-04-15 13:39:32]     score=93.39  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 13:39:32]     score=42.30  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 13:39:34]
[Wave4] ========== Trial 4/25 ==========
[2026-04-15 13:39:34] [Wave4] Only 2 results — using random proposal.
[2026-04-15 13:39:34] [Wave4] Proposed params: {'learning_rate': 0.00020853884350577402, 'steps_per_switch': 19927, 'total_timesteps': 138928}
[2026-04-15 13:39:36] [Wave4] Launching trial 4: {'learning_rate': 0.00020853884350577402, 'steps_per_switch': 19927, 'total_timesteps': 138928}
[2026-04-15 13:39:36] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 138928 --steps-per-switch 19927 --learning-rate 0.00020853884350577402 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0004
[2026-04-15 15:39:37] [Wave4] Trial 4 TIMED OUT — killing runner.
[2026-04-15 15:39:37] [Wave4] Trial 4 finished in 7200.7s, rc=-9
[2026-04-15 15:39:37] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 15:39:37] [Wave4] ⚠️  No test score parsed — defaulting to 0.0
[2026-04-15 15:39:37] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 15:39:37] [Wave4] ===== Trial 4 Summary =====
[2026-04-15 15:39:37]   GP data points : 2
[2026-04-15 15:39:37]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 15:39:37]   Top 5:
[2026-04-15 15:39:37]     score=93.39  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 15:39:37]     score=42.30  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 15:39:39]
[Wave4] ========== Trial 5/25 ==========
[2026-04-15 15:39:39] [Wave4] Only 2 results — using random proposal.
[2026-04-15 15:39:39] [Wave4] Proposed params: {'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 15:39:41] [Wave4] Launching trial 5: {'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 15:39:41] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 104878 --steps-per-switch 9368 --learning-rate 0.0007517877668650138 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0005
[2026-04-15 17:08:50] [Wave4] Trial 5 finished in 5348.8s, rc=0
[2026-04-15 17:08:50] [Wave4] Parsed: combined=31.73 mini_monaco=31.73
[2026-04-15 17:08:50] [Wave4] ===== Trial 5 Summary =====
[2026-04-15 17:08:50]   GP data points : 3
[2026-04-15 17:08:50]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 17:08:50]   Top 5:
[2026-04-15 17:08:50]     score=93.39  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 17:08:50]     score=42.30  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 17:08:50]     score=31.73  params={'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 17:08:51] [Wave4] ✅ Git push complete after trial 5
[2026-04-15 17:08:53]
[Wave4] ========== Trial 6/25 ==========
[2026-04-15 17:08:53] [Wave4] GP UCB top-5 proposals:
[2026-04-15 17:08:53]   UCB=2.9352 mu=1.3419 σ=0.7966 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 17:08:53]   UCB=2.8442 mu=1.0827 σ=0.8808 params={'learning_rate': 0.0017357820530198068, 'steps_per_switch': 5009, 'total_timesteps': 87890}
[2026-04-15 17:08:53]   UCB=2.8258 mu=1.0947 σ=0.8655 params={'learning_rate': 0.001668571948240882, 'steps_per_switch': 4814, 'total_timesteps': 101589}
[2026-04-15 17:08:53]   UCB=2.8168 mu=0.9976 σ=0.9096 params={'learning_rate': 0.001788300003253932, 'steps_per_switch': 4215, 'total_timesteps': 80751}
[2026-04-15 17:08:53]   UCB=2.8147 mu=1.5860 σ=0.6144 params={'learning_rate': 0.0012975326127189415, 'steps_per_switch': 3961, 'total_timesteps': 100527}
[2026-04-15 17:08:53] [Wave4] Proposed params: {'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 17:08:55] [Wave4] Launching trial 6: {'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 17:08:55] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 79532 --steps-per-switch 5524 --learning-rate 0.0016223486895735558 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0006
[2026-04-15 18:19:30] [Wave4] Trial 6 finished in 4235.3s, rc=0
[2026-04-15 18:19:30] [Wave4] Parsed: combined=176.6721 mini_monaco=176.6721
[2026-04-15 18:19:31] [Wave4] ===== Trial 6 Summary =====
[2026-04-15 18:19:31]   GP data points : 4
[2026-04-15 18:19:31]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 18:19:31]   Top 5:
[2026-04-15 18:19:31]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 18:19:31]     score=93.39  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 18:19:31]     score=42.30  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 18:19:31]     score=31.73  params={'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 18:19:33]
[Wave4] ========== Trial 7/25 ==========
[2026-04-15 18:19:33] [Wave4] GP UCB top-5 proposals:
[2026-04-15 18:19:33]   UCB=2.5520 mu=1.1319 σ=0.7101 params={'learning_rate': 0.001779556625962812, 'steps_per_switch': 4226, 'total_timesteps': 123651}
[2026-04-15 18:19:33]   UCB=2.5115 mu=0.7590 σ=0.8763 params={'learning_rate': 0.0017725367196782225, 'steps_per_switch': 3941, 'total_timesteps': 145375}
[2026-04-15 18:19:33]   UCB=2.4988 mu=1.1176 σ=0.6906 params={'learning_rate': 0.0018568268138302447, 'steps_per_switch': 6910, 'total_timesteps': 119000}
[2026-04-15 18:19:33]   UCB=2.4973 mu=0.8571 σ=0.8201 params={'learning_rate': 0.0019597767383017994, 'steps_per_switch': 9245, 'total_timesteps': 113699}
[2026-04-15 18:19:33]   UCB=2.4874 mu=1.2498 σ=0.6188 params={'learning_rate': 0.001739834862935009, 'steps_per_switch': 4764, 'total_timesteps': 117423}
[2026-04-15 18:19:33] [Wave4] Proposed params: {'learning_rate': 0.001779556625962812, 'steps_per_switch': 4226, 'total_timesteps': 123651}
[2026-04-15 18:19:35] [Wave4] Launching trial 7: {'learning_rate': 0.001779556625962812, 'steps_per_switch': 4226, 'total_timesteps': 123651}
[2026-04-15 18:19:35] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 123651 --steps-per-switch 4226 --learning-rate 0.001779556625962812 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0007
[2026-04-15 20:19:38] [Wave4] Trial 7 TIMED OUT — killing runner.
[2026-04-15 20:19:38] [Wave4] Trial 7 finished in 7203.4s, rc=-9
[2026-04-15 20:19:38] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 20:19:38] [Wave4] ⚠️  No test score parsed — defaulting to 0.0
[2026-04-15 20:19:38] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 20:19:38] [Wave4] ===== Trial 7 Summary =====
[2026-04-15 20:19:38]   GP data points : 4
[2026-04-15 20:19:38]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 20:19:38]   Top 5:
[2026-04-15 20:19:38]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 20:19:38]     score=93.39  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 20:19:38]     score=42.30  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 20:19:38]     score=31.73  params={'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 20:19:40]
[Wave4] ========== Trial 8/25 ==========
[2026-04-15 20:19:40] [Wave4] GP UCB top-5 proposals:
[2026-04-15 20:19:40]   UCB=2.5481 mu=1.2081 σ=0.6700 params={'learning_rate': 0.0019521225364821895, 'steps_per_switch': 4185, 'total_timesteps': 110996}
[2026-04-15 20:19:40]   UCB=2.5357 mu=0.8887 σ=0.8235 params={'learning_rate': 0.001901474046587741, 'steps_per_switch': 5568, 'total_timesteps': 136179}
[2026-04-15 20:19:40]   UCB=2.5188 mu=1.1408 σ=0.6890 params={'learning_rate': 0.0018359364507444984, 'steps_per_switch': 5746, 'total_timesteps': 122356}
[2026-04-15 20:19:40]   UCB=2.4994 mu=0.7140 σ=0.8927 params={'learning_rate': 0.0019730433529852004, 'steps_per_switch': 4578, 'total_timesteps': 144668}
[2026-04-15 20:19:40]   UCB=2.4934 mu=1.4169 σ=0.5382 params={'learning_rate': 0.0018644207617691767, 'steps_per_switch': 3071, 'total_timesteps': 75515}
[2026-04-15 20:19:40] [Wave4] Proposed params: {'learning_rate': 0.0019521225364821895, 'steps_per_switch': 4185, 'total_timesteps': 110996}
[2026-04-15 20:19:42] [Wave4] Launching trial 8: {'learning_rate': 0.0019521225364821895, 'steps_per_switch': 4185, 'total_timesteps': 110996}
[2026-04-15 20:19:42] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 110996 --steps-per-switch 4185 --learning-rate 0.0019521225364821895 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0008
[2026-04-15 22:19:42] [Wave4] Trial 8 TIMED OUT — killing runner.
[2026-04-15 22:19:42] [Wave4] Trial 8 finished in 7200.2s, rc=-9
[2026-04-15 22:19:42] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 22:19:42] [Wave4] ⚠️  No test score parsed — defaulting to 0.0
[2026-04-15 22:19:42] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 22:19:42] [Wave4] ===== Trial 8 Summary =====
[2026-04-15 22:19:42]   GP data points : 4
[2026-04-15 22:19:42]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 22:19:42]   Top 5:
[2026-04-15 22:19:42]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 22:19:42]     score=93.39  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 22:19:42]     score=42.30  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 22:19:42]     score=31.73  params={'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 22:19:44]
[Wave4] ========== Trial 9/25 ==========
[2026-04-15 22:19:44] [Wave4] GP UCB top-5 proposals:
[2026-04-15 22:19:44]   UCB=2.5432 mu=1.2960 σ=0.6236 params={'learning_rate': 0.001989433235306402, 'steps_per_switch': 3575, 'total_timesteps': 65457}
[2026-04-15 22:19:44]   UCB=2.5396 mu=1.1027 σ=0.7185 params={'learning_rate': 0.001979671928972082, 'steps_per_switch': 8884, 'total_timesteps': 73116}
[2026-04-15 22:19:44]   UCB=2.5350 mu=1.2006 σ=0.6672 params={'learning_rate': 0.0019228314496482347, 'steps_per_switch': 4960, 'total_timesteps': 115468}
[2026-04-15 22:19:44]   UCB=2.5256 mu=1.1804 σ=0.6726 params={'learning_rate': 0.0016231147459723914, 'steps_per_switch': 3259, 'total_timesteps': 115708}
[2026-04-15 22:19:44]   UCB=2.5201 mu=0.9834 σ=0.7683 params={'learning_rate': 0.0016588035055714473, 'steps_per_switch': 4314, 'total_timesteps': 131749}
[2026-04-15 22:19:44] [Wave4] Proposed params: {'learning_rate': 0.001989433235306402, 'steps_per_switch': 3575, 'total_timesteps': 65457}
[2026-04-15 22:19:46] [Wave4] Launching trial 9: {'learning_rate': 0.001989433235306402, 'steps_per_switch': 3575, 'total_timesteps': 65457}
[2026-04-15 22:19:46] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 65457 --steps-per-switch 3575 --learning-rate 0.001989433235306402 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0009
[2026-04-15 22:23:21] [Wave4] Trial 9 finished in 215.0s, rc=101
[2026-04-15 22:23:21] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 22:23:21] [Wave4] ⚠️  No test score parsed — defaulting to 0.0
[2026-04-15 22:23:21] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 22:23:21] [Wave4] ===== Trial 9 Summary =====
[2026-04-15 22:23:21]   GP data points : 4
[2026-04-15 22:23:21]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 22:23:21]   Top 5:
[2026-04-15 22:23:21]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 22:23:21]     score=93.39  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 22:23:21]     score=42.30  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 22:23:21]     score=31.73  params={'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 22:23:23]
[Wave4] ========== Trial 10/25 ==========
[2026-04-15 22:23:23] [Wave4] GP UCB top-5 proposals:
[2026-04-15 22:23:23]   UCB=2.5794 mu=0.9857 σ=0.7969 params={'learning_rate': 0.00192547022313727, 'steps_per_switch': 3237, 'total_timesteps': 124659}
[2026-04-15 22:23:23]   UCB=2.5191 mu=1.3579 σ=0.5806 params={'learning_rate': 0.0019414376395480834, 'steps_per_switch': 3402, 'total_timesteps': 69220}
[2026-04-15 22:23:23]   UCB=2.5097 mu=0.7258 σ=0.8919 params={'learning_rate': 0.0019051112417148412, 'steps_per_switch': 3607, 'total_timesteps': 144368}
[2026-04-15 22:23:23]   UCB=2.4894 mu=1.2599 σ=0.6148 params={'learning_rate': 0.001905194185221269, 'steps_per_switch': 5874, 'total_timesteps': 111439}
[2026-04-15 22:23:23]   UCB=2.4776 mu=1.1168 σ=0.6804 params={'learning_rate': 0.0017822503576577222, 'steps_per_switch': 6596, 'total_timesteps': 121681}
[2026-04-15 22:23:23] [Wave4] Proposed params: {'learning_rate': 0.00192547022313727, 'steps_per_switch': 3237, 'total_timesteps': 124659}
[2026-04-15 22:23:25] [Wave4] Launching trial 10: {'learning_rate': 0.00192547022313727, 'steps_per_switch': 3237, 'total_timesteps': 124659}
[2026-04-15 22:23:25] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 124659 --steps-per-switch 3237 --learning-rate 0.00192547022313727 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0010
[2026-04-15 22:26:54] =================================================================
[2026-04-15 22:26:54] [Wave4] Multi-Track Autoresearch — GP+UCB Generalization Search
[2026-04-15 22:26:54] [Wave4] Training tracks : generated_track, mountain_track  (no generated_road, no warm-start)
[2026-04-15 22:26:54] [Wave4] Test tracks     : mini_monaco only (zero-shot; warren removed — broken done condition)
[2026-04-15 22:26:54] [Wave4] Max trials      : 25 | kappa=2.0 | push every 5
[2026-04-15 22:26:54] [Wave4] Results file    : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase4.jsonl
[2026-04-15 22:26:54] [Wave4] Champion dir    : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-champion
[2026-04-15 22:26:54] [Wave4] Warm start      : NONE (training from scratch each trial)
[2026-04-15 22:26:54] =================================================================
[2026-04-15 22:26:54] [Wave4] Loaded 5 existing Phase 3 results.
[2026-04-15 22:26:54] [Wave4] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 22:26:54] [Wave4] Starting from trial 6.
[2026-04-15 22:26:54]
[Wave4] ========== Trial 6/25 ==========
[2026-04-15 22:26:54] [Wave4] GP UCB top-5 proposals:
[2026-04-15 22:26:54]   UCB=2.8029 mu=1.3217 σ=0.7406 params={'learning_rate': 0.0009434282949002715, 'steps_per_switch': 14966, 'total_timesteps': 83094}
[2026-04-15 22:26:54]   UCB=2.7637 mu=1.4556 σ=0.6540 params={'learning_rate': 0.001016649027182601, 'steps_per_switch': 14757, 'total_timesteps': 85809}
[2026-04-15 22:26:54]   UCB=2.7344 mu=1.1173 σ=0.8085 params={'learning_rate': 0.000525489856531106, 'steps_per_switch': 14503, 'total_timesteps': 81150}
[2026-04-15 22:26:54]   UCB=2.7210 mu=1.0163 σ=0.8523 params={'learning_rate': 0.000448503297396427, 'steps_per_switch': 14723, 'total_timesteps': 80477}
[2026-04-15 22:26:54]   UCB=2.6726 mu=0.9116 σ=0.8805 params={'learning_rate': 0.0011227428004033503, 'steps_per_switch': 14832, 'total_timesteps': 81442}
[2026-04-15 22:26:54] [Wave4] Proposed params: {'learning_rate': 0.0009434282949002715, 'steps_per_switch': 14966, 'total_timesteps': 83094}
[2026-04-15 22:26:56] [Wave4] Launching trial 6: {'learning_rate': 0.0009434282949002715, 'steps_per_switch': 14966, 'total_timesteps': 83094}
[2026-04-15 22:26:56] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 83094 --steps-per-switch 14966 --learning-rate 0.0009434282949002715 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0006
[2026-04-15 22:38:49] [Wave4] Trial 6 finished in 712.6s, rc=101
[2026-04-15 22:38:49] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 22:38:49] [Wave4] ⚠️  No test score parsed — defaulting to 0.0
[2026-04-15 22:38:49] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 22:38:49] [Wave4] ===== Trial 6 Summary =====
[2026-04-15 22:38:49]   GP data points : 5
[2026-04-15 22:38:49]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 22:38:49]   Top 5:
[2026-04-15 22:38:49]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-15 22:38:49]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 22:38:49]     score=93.39  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 22:38:49]     score=42.30  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 22:38:49]     score=31.73  params={'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 22:38:51]
[Wave4] ========== Trial 7/25 ==========
[2026-04-15 22:38:51] [Wave4] GP UCB top-5 proposals:
[2026-04-15 22:38:51]   UCB=2.8131 mu=1.4171 σ=0.6980 params={'learning_rate': 0.0004346662414818233, 'steps_per_switch': 14860, 'total_timesteps': 83818}
[2026-04-15 22:38:51]   UCB=2.7278 mu=1.2363 σ=0.7458 params={'learning_rate': 0.00020210801580883826, 'steps_per_switch': 14145, 'total_timesteps': 88060}
[2026-04-15 22:38:51]   UCB=2.7159 mu=1.0082 σ=0.8538 params={'learning_rate': 0.00012352374581522935, 'steps_per_switch': 14555, 'total_timesteps': 84897}
[2026-04-15 22:38:51]   UCB=2.6982 mu=1.4815 σ=0.6084 params={'learning_rate': 0.0010645009917759616, 'steps_per_switch': 14511, 'total_timesteps': 89107}
[2026-04-15 22:38:51]   UCB=2.6948 mu=1.1351 σ=0.7798 params={'learning_rate': 0.0012021332143154963, 'steps_per_switch': 14377, 'total_timesteps': 87121}
[2026-04-15 22:38:51] [Wave4] Proposed params: {'learning_rate': 0.0004346662414818233, 'steps_per_switch': 14860, 'total_timesteps': 83818}
[2026-04-15 22:38:53] [Wave4] Launching trial 7: {'learning_rate': 0.0004346662414818233, 'steps_per_switch': 14860, 'total_timesteps': 83818}
[2026-04-15 22:38:53] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 83818 --steps-per-switch 14860 --learning-rate 0.0004346662414818233 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0007
[2026-04-15 22:47:30] =================================================================
[2026-04-15 22:47:30] [Wave4] Multi-Track Autoresearch — GP+UCB Generalization Search
[2026-04-15 22:47:30] [Wave4] Training tracks : generated_track, mountain_track  (no generated_road, no warm-start)
[2026-04-15 22:47:30] [Wave4] Test tracks     : mini_monaco only (zero-shot; warren removed — broken done condition)
[2026-04-15 22:47:30] [Wave4] Max trials      : 25 | kappa=2.0 | push every 5
[2026-04-15 22:47:30] [Wave4] Results file    : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase4.jsonl
[2026-04-15 22:47:30] [Wave4] Champion dir    : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-champion
[2026-04-15 22:47:30] [Wave4] Warm start      : NONE (training from scratch each trial)
[2026-04-15 22:47:30] =================================================================
[2026-04-15 22:47:30] [Wave4] Loaded 6 existing Phase 3 results.
[2026-04-15 22:47:30] [Wave4] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 22:47:30] [Wave4] Starting from trial 7.
[2026-04-15 22:47:30]
[Wave4] ========== Trial 7/25 ==========
[2026-04-15 22:47:30] [Wave4] GP UCB top-5 proposals:
[2026-04-15 22:47:30]   UCB=3.2602 mu=2.0535 σ=0.6034 params={'learning_rate': 0.0003074676335173842, 'steps_per_switch': 14103, 'total_timesteps': 89493}
[2026-04-15 22:47:30]   UCB=3.2334 mu=1.9885 σ=0.6224 params={'learning_rate': 0.00030011572199819647, 'steps_per_switch': 13772, 'total_timesteps': 89937}
[2026-04-15 22:47:30]   UCB=3.1559 mu=2.0419 σ=0.5570 params={'learning_rate': 0.0003574517736244094, 'steps_per_switch': 14971, 'total_timesteps': 88076}
[2026-04-15 22:47:30]   UCB=2.5661 mu=1.3681 σ=0.5990 params={'learning_rate': 0.00039989051435482747, 'steps_per_switch': 12483, 'total_timesteps': 89578}
[2026-04-15 22:47:30]   UCB=2.5147 mu=0.7976 σ=0.8585 params={'learning_rate': 0.00013529496950197116, 'steps_per_switch': 12536, 'total_timesteps': 85905}
[2026-04-15 22:47:30] [Wave4] Proposed params: {'learning_rate': 0.0003074676335173842, 'steps_per_switch': 14103, 'total_timesteps': 89493}
[2026-04-15 22:47:32] [Wave4] Launching trial 7: {'learning_rate': 0.0003074676335173842, 'steps_per_switch': 14103, 'total_timesteps': 89493}
[2026-04-15 22:47:32] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 89493 --steps-per-switch 14103 --learning-rate 0.0003074676335173842 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0007
[2026-04-15 23:45:56] [Wave4] Trial 7 finished in 3503.6s, rc=0
[2026-04-15 23:45:56] [Wave4] Parsed: combined=80.9467 mini_monaco=80.9467
[2026-04-15 23:45:56] [Wave4] ===== Trial 7 Summary =====
[2026-04-15 23:45:56]   GP data points : 7
[2026-04-15 23:45:56]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 23:45:56]   Top 5:
[2026-04-15 23:45:56]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-15 23:45:56]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 23:45:56]     score=93.39  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 23:45:56]     score=80.95  params={'learning_rate': 0.0003074676335173842, 'steps_per_switch': 14103, 'total_timesteps': 89493}
[2026-04-15 23:45:56]     score=42.30  params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 23:45:58]
[Wave4] ========== Trial 8/25 ==========
[2026-04-15 23:45:58] [Wave4] GP UCB top-5 proposals:
[2026-04-15 23:45:58]   UCB=2.8761 mu=2.3811 σ=0.2475 params={'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-15 23:45:58]   UCB=2.7487 mu=1.8283 σ=0.4602 params={'learning_rate': 0.0010473338684621142, 'steps_per_switch': 13978, 'total_timesteps': 89106}
[2026-04-15 23:45:58]   UCB=2.7282 mu=1.1574 σ=0.7854 params={'learning_rate': 0.0013992142565926369, 'steps_per_switch': 14988, 'total_timesteps': 89592}
[2026-04-15 23:45:58]   UCB=2.1012 mu=1.4334 σ=0.3339 params={'learning_rate': 0.0008049497388184544, 'steps_per_switch': 13467, 'total_timesteps': 88845}
[2026-04-15 23:45:58]   UCB=2.0136 mu=0.1079 σ=0.9528 params={'learning_rate': 0.0017586582805663546, 'steps_per_switch': 14607, 'total_timesteps': 89073}
[2026-04-15 23:45:58] [Wave4] Proposed params: {'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-15 23:46:00] [Wave4] Launching trial 8: {'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-15 23:46:00] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 89798 --steps-per-switch 14326 --learning-rate 0.0008478421211922181 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0008
[2026-04-16 00:48:42] [Wave4] Trial 8 finished in 3762.1s, rc=0
[2026-04-16 00:48:42] [Wave4] Parsed: combined=116.2588 mini_monaco=116.2588
[2026-04-16 00:48:42] [Wave4] ===== Trial 8 Summary =====
[2026-04-16 00:48:42]   GP data points : 8
[2026-04-16 00:48:42]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 00:48:42]   Top 5:
[2026-04-16 00:48:42]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 00:48:42]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 00:48:42]     score=116.26  params={'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-16 00:48:42]     score=93.39  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-16 00:48:42]     score=80.95  params={'learning_rate': 0.0003074676335173842, 'steps_per_switch': 14103, 'total_timesteps': 89493}
[2026-04-16 00:48:44]
[Wave4] ========== Trial 9/25 ==========
[2026-04-16 00:48:44] [Wave4] GP UCB top-5 proposals:
[2026-04-16 00:48:44]   UCB=2.6052 mu=1.5807 σ=0.5122 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 00:48:44]   UCB=2.5103 mu=1.1029 σ=0.7037 params={'learning_rate': 0.0007143303795900681, 'steps_per_switch': 5097, 'total_timesteps': 88077}
[2026-04-16 00:48:44]   UCB=2.3990 mu=0.8072 σ=0.7959 params={'learning_rate': 0.00027589394711147663, 'steps_per_switch': 5263, 'total_timesteps': 89853}
[2026-04-16 00:48:44]   UCB=2.3635 mu=0.6759 σ=0.8438 params={'learning_rate': 0.0007616025453782968, 'steps_per_switch': 3774, 'total_timesteps': 88582}
[2026-04-16 00:48:44]   UCB=2.3265 mu=0.9173 σ=0.7046 params={'learning_rate': 0.0011226509565016517, 'steps_per_switch': 5495, 'total_timesteps': 88334}
[2026-04-16 00:48:44] [Wave4] Proposed params: {'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 00:48:46] [Wave4] Launching trial 9: {'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 00:48:46] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 89893 --steps-per-switch 6851 --learning-rate 0.0007252855740444645 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0009
[2026-04-16 02:11:13] [Wave4] Trial 9 finished in 4947.1s, rc=0
[2026-04-16 02:11:13] [Wave4] Parsed: combined=1435.0401 mini_monaco=1435.0401
[2026-04-16 02:11:13] [Wave4] ===== Trial 9 Summary =====
[2026-04-16 02:11:13]   GP data points : 9
[2026-04-16 02:11:13]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 02:11:13]   Top 5:
[2026-04-16 02:11:13]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 02:11:13]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 02:11:13]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 02:11:13]     score=116.26  params={'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-16 02:11:13]     score=93.39  params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-16 02:11:15]
[Wave4] ========== Trial 10/25 ==========
[2026-04-16 02:11:15] [Wave4] GP UCB top-5 proposals:
[2026-04-16 02:11:15]   UCB=2.0934 mu=0.9205 σ=0.5865 params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 02:11:15]   UCB=1.9984 mu=0.3915 σ=0.8034 params={'learning_rate': 0.00150594219964486, 'steps_per_switch': 5446, 'total_timesteps': 89972}
[2026-04-16 02:11:15]   UCB=1.9795 mu=-0.0207 σ=1.0001 params={'learning_rate': 0.00187204813121039, 'steps_per_switch': 11474, 'total_timesteps': 60629}
[2026-04-16 02:11:15]   UCB=1.9792 mu=-0.0197 σ=0.9994 params={'learning_rate': 0.0006516393606134399, 'steps_per_switch': 14385, 'total_timesteps': 60164}
[2026-04-16 02:11:15]   UCB=1.9789 mu=-0.0200 σ=0.9995 params={'learning_rate': 0.001964829385849987, 'steps_per_switch': 10248, 'total_timesteps': 61072}
[2026-04-16 02:11:15] [Wave4] Proposed params: {'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 02:11:17] [Wave4] Launching trial 10: {'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 02:11:17] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 89501 --steps-per-switch 4587 --learning-rate 0.0010578566658808422 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0010
[2026-04-16 03:31:41] [Wave4] Trial 10 finished in 4823.9s, rc=0
[2026-04-16 03:31:41] [Wave4] Parsed: combined=141.3547 mini_monaco=141.3547
[2026-04-16 03:31:41] [Wave4] ===== Trial 10 Summary =====
[2026-04-16 03:31:41]   GP data points : 10
[2026-04-16 03:31:41]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 03:31:41]   Top 5:
[2026-04-16 03:31:41]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 03:31:41]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 03:31:41]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 03:31:41]     score=141.35  params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 03:31:41]     score=116.26  params={'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-16 03:31:42] [Wave4] ✅ Git push complete after trial 10
[2026-04-16 03:31:44]
[Wave4] ========== Trial 11/25 ==========
[2026-04-16 03:31:44] [Wave4] GP UCB top-5 proposals:
[2026-04-16 03:31:44]   UCB=2.3190 mu=1.6261 σ=0.3465 params={'learning_rate': 0.0004453206607211525, 'steps_per_switch': 6345, 'total_timesteps': 89491}
[2026-04-16 03:31:44]   UCB=2.1595 mu=1.1793 σ=0.4901 params={'learning_rate': 0.00046964743340115834, 'steps_per_switch': 4776, 'total_timesteps': 89856}
[2026-04-16 03:31:44]   UCB=2.0941 mu=0.7922 σ=0.6510 params={'learning_rate': 0.0003410660041988607, 'steps_per_switch': 4166, 'total_timesteps': 89400}
[2026-04-16 03:31:44]   UCB=2.0525 mu=0.6500 σ=0.7012 params={'learning_rate': 0.00010060597179209699, 'steps_per_switch': 6926, 'total_timesteps': 89756}
[2026-04-16 03:31:44]   UCB=2.0368 mu=0.7629 σ=0.6370 params={'learning_rate': 0.00012830012767488515, 'steps_per_switch': 6077, 'total_timesteps': 88482}
[2026-04-16 03:31:44] [Wave4] Proposed params: {'learning_rate': 0.0004453206607211525, 'steps_per_switch': 6345, 'total_timesteps': 89491}
[2026-04-16 03:31:46] [Wave4] Launching trial 11: {'learning_rate': 0.0004453206607211525, 'steps_per_switch': 6345, 'total_timesteps': 89491}
[2026-04-16 03:31:46] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 89491 --steps-per-switch 6345 --learning-rate 0.0004453206607211525 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0011
[2026-04-16 04:47:26] [Wave4] Trial 11 finished in 4539.4s, rc=0
[2026-04-16 04:47:26] [Wave4] Parsed: combined=85.169 mini_monaco=85.169
[2026-04-16 04:47:26] [Wave4] ===== Trial 11 Summary =====
[2026-04-16 04:47:26]   GP data points : 11
[2026-04-16 04:47:26]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 04:47:26]   Top 5:
[2026-04-16 04:47:26]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 04:47:26]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 04:47:26]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 04:47:26]     score=141.35  params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 04:47:26]     score=116.26  params={'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-16 04:47:28]
[Wave4] ========== Trial 12/25 ==========
[2026-04-16 04:47:28] [Wave4] GP UCB top-5 proposals:
[2026-04-16 04:47:28]   UCB=2.2712 mu=2.0464 σ=0.1124 params={'learning_rate': 0.0008597472207162354, 'steps_per_switch': 6936, 'total_timesteps': 89806}
[2026-04-16 04:47:28]   UCB=2.1972 mu=0.3727 σ=0.9123 params={'learning_rate': 0.00011512144682169134, 'steps_per_switch': 7128, 'total_timesteps': 67694}
[2026-04-16 04:47:28]   UCB=2.1921 mu=0.3206 σ=0.9357 params={'learning_rate': 0.00011280326159121539, 'steps_per_switch': 3550, 'total_timesteps': 67913}
[2026-04-16 04:47:28]   UCB=2.1765 mu=1.4350 σ=0.3708 params={'learning_rate': 0.00048039814596614487, 'steps_per_switch': 14975, 'total_timesteps': 84957}
[2026-04-16 04:47:28]   UCB=2.1664 mu=0.3542 σ=0.9061 params={'learning_rate': 0.00022405059432424194, 'steps_per_switch': 5228, 'total_timesteps': 67529}
[2026-04-16 04:47:28] [Wave4] Proposed params: {'learning_rate': 0.0008597472207162354, 'steps_per_switch': 6936, 'total_timesteps': 89806}
[2026-04-16 04:47:30] [Wave4] Launching trial 12: {'learning_rate': 0.0008597472207162354, 'steps_per_switch': 6936, 'total_timesteps': 89806}
[2026-04-16 04:47:30] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 89806 --steps-per-switch 6936 --learning-rate 0.0008597472207162354 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0012
[2026-04-16 06:01:18] [Wave4] Trial 12 finished in 4428.4s, rc=0
[2026-04-16 06:01:18] [Wave4] Parsed: combined=131.9914 mini_monaco=131.9914
[2026-04-16 06:01:18] [Wave4] ===== Trial 12 Summary =====
[2026-04-16 06:01:18]   GP data points : 12
[2026-04-16 06:01:18]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 06:01:18]   Top 5:
[2026-04-16 06:01:18]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 06:01:18]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 06:01:18]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 06:01:18]     score=141.35  params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 06:01:18]     score=131.99  params={'learning_rate': 0.0008597472207162354, 'steps_per_switch': 6936, 'total_timesteps': 89806}
[2026-04-16 06:01:20]
[Wave4] ========== Trial 13/25 ==========
[2026-04-16 06:01:20] [Wave4] GP UCB top-5 proposals:
[2026-04-16 06:01:20]   UCB=3.6412 mu=2.1560 σ=0.7426 params={'learning_rate': 0.0019124682919611855, 'steps_per_switch': 3574, 'total_timesteps': 73915}
[2026-04-16 06:01:20]   UCB=3.5658 mu=2.1987 σ=0.6835 params={'learning_rate': 0.0018623702999958462, 'steps_per_switch': 4072, 'total_timesteps': 73804}
[2026-04-16 06:01:20]   UCB=3.5079 mu=2.0033 σ=0.7523 params={'learning_rate': 0.001647357412665117, 'steps_per_switch': 3844, 'total_timesteps': 72144}
[2026-04-16 06:01:20]   UCB=3.4793 mu=2.2411 σ=0.6191 params={'learning_rate': 0.0017259517065612783, 'steps_per_switch': 3332, 'total_timesteps': 76276}
[2026-04-16 06:01:20]   UCB=3.4632 mu=1.7809 σ=0.8411 params={'learning_rate': 0.0018817829021946482, 'steps_per_switch': 4363, 'total_timesteps': 70258}
[2026-04-16 06:01:20] [Wave4] Proposed params: {'learning_rate': 0.0019124682919611855, 'steps_per_switch': 3574, 'total_timesteps': 73915}
[2026-04-16 06:01:22] [Wave4] Launching trial 13: {'learning_rate': 0.0019124682919611855, 'steps_per_switch': 3574, 'total_timesteps': 73915}
[2026-04-16 06:01:22] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 73915 --steps-per-switch 3574 --learning-rate 0.0019124682919611855 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0013
[2026-04-16 07:01:28] [Wave4] Trial 13 finished in 3605.7s, rc=0
[2026-04-16 07:01:28] [Wave4] Parsed: combined=86.8189 mini_monaco=86.8189
[2026-04-16 07:01:28] [Wave4] ===== Trial 13 Summary =====
[2026-04-16 07:01:28]   GP data points : 13
[2026-04-16 07:01:28]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 07:01:28]   Top 5:
[2026-04-16 07:01:28]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 07:01:28]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 07:01:28]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 07:01:28]     score=141.35  params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 07:01:28]     score=131.99  params={'learning_rate': 0.0008597472207162354, 'steps_per_switch': 6936, 'total_timesteps': 89806}
[2026-04-16 07:01:30]
[Wave4] ========== Trial 14/25 ==========
[2026-04-16 07:01:30] [Wave4] GP UCB top-5 proposals:
[2026-04-16 07:01:30]   UCB=3.4396 mu=1.7713 σ=0.8341 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 07:01:30]   UCB=3.4124 mu=1.7531 σ=0.8296 params={'learning_rate': 0.00020849841344963834, 'steps_per_switch': 6509, 'total_timesteps': 69502}
[2026-04-16 07:01:30]   UCB=3.3878 mu=1.7622 σ=0.8128 params={'learning_rate': 0.000483907016003038, 'steps_per_switch': 7464, 'total_timesteps': 70587}
[2026-04-16 07:01:30]   UCB=3.3445 mu=1.9911 σ=0.6767 params={'learning_rate': 0.00041395529913323, 'steps_per_switch': 7143, 'total_timesteps': 72719}
[2026-04-16 07:01:30]   UCB=3.3380 mu=1.6221 σ=0.8579 params={'learning_rate': 0.0004908056727355321, 'steps_per_switch': 4967, 'total_timesteps': 69146}
[2026-04-16 07:01:30] [Wave4] Proposed params: {'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 07:01:32] [Wave4] Launching trial 14: {'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 07:01:32] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 69359 --steps-per-switch 5448 --learning-rate 0.00033896311195395496 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0014
[2026-04-16 07:54:33] [Wave4] Trial 14 finished in 3181.0s, rc=0
[2026-04-16 07:54:33] [Wave4] Parsed: combined=1573.0703 mini_monaco=1573.0703
[2026-04-16 07:54:33] [Wave4] ===== Trial 14 Summary =====
[2026-04-16 07:54:33]   GP data points : 14
[2026-04-16 07:54:33]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 07:54:33]   Top 5:
[2026-04-16 07:54:33]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 07:54:33]     score=1573.07  params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 07:54:33]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 07:54:33]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 07:54:33]     score=141.35  params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 07:54:35]
[Wave4] ========== Trial 15/25 ==========
[2026-04-16 07:54:35] [Wave4] GP UCB top-5 proposals:
[2026-04-16 07:54:35]   UCB=2.8282 mu=1.6797 σ=0.5742 params={'learning_rate': 0.0003994526493374, 'steps_per_switch': 7747, 'total_timesteps': 70559}
[2026-04-16 07:54:35]   UCB=2.7924 mu=1.4316 σ=0.6804 params={'learning_rate': 0.0005427837302668084, 'steps_per_switch': 8198, 'total_timesteps': 71093}
[2026-04-16 07:54:35]   UCB=2.7732 mu=1.5025 σ=0.6353 params={'learning_rate': 0.000186376586240458, 'steps_per_switch': 7967, 'total_timesteps': 71123}
[2026-04-16 07:54:35]   UCB=2.7719 mu=1.2938 σ=0.7390 params={'learning_rate': 0.0002182534990992253, 'steps_per_switch': 8697, 'total_timesteps': 71541}
[2026-04-16 07:54:35]   UCB=2.7193 mu=1.1369 σ=0.7912 params={'learning_rate': 0.00011645020756444703, 'steps_per_switch': 8822, 'total_timesteps': 69675}
[2026-04-16 07:54:35] [Wave4] Proposed params: {'learning_rate': 0.0003994526493374, 'steps_per_switch': 7747, 'total_timesteps': 70559}
[2026-04-16 07:54:37] [Wave4] Launching trial 15: {'learning_rate': 0.0003994526493374, 'steps_per_switch': 7747, 'total_timesteps': 70559}
[2026-04-16 07:54:37] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 70559 --steps-per-switch 7747 --learning-rate 0.0003994526493374 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0015
[2026-04-16 08:43:17] [Wave4] Trial 15 finished in 2920.0s, rc=0
[2026-04-16 08:43:17] [Wave4] Parsed: combined=111.2516 mini_monaco=111.2516
[2026-04-16 08:43:17] [Wave4] ===== Trial 15 Summary =====
[2026-04-16 08:43:17]   GP data points : 15
[2026-04-16 08:43:17]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 08:43:17]   Top 5:
[2026-04-16 08:43:17]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 08:43:17]     score=1573.07  params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 08:43:17]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 08:43:17]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 08:43:17]     score=141.35  params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 08:43:19] [Wave4] ✅ Git push complete after trial 15
[2026-04-16 08:43:21]
[Wave4] ========== Trial 16/25 ==========
[2026-04-16 08:43:21] [Wave4] GP UCB top-5 proposals:
[2026-04-16 08:43:21]   UCB=3.6132 mu=2.7065 σ=0.4533 params={'learning_rate': 0.00040295487555843073, 'steps_per_switch': 3490, 'total_timesteps': 72578}
[2026-04-16 08:43:21]   UCB=3.6040 mu=2.6026 σ=0.5007 params={'learning_rate': 0.0004730969154346568, 'steps_per_switch': 3501, 'total_timesteps': 73673}
[2026-04-16 08:43:21]   UCB=3.5949 mu=2.3139 σ=0.6405 params={'learning_rate': 0.0007493214757809492, 'steps_per_switch': 3197, 'total_timesteps': 74747}
[2026-04-16 08:43:21]   UCB=3.3717 mu=2.1460 σ=0.6128 params={'learning_rate': 0.0008356081352627151, 'steps_per_switch': 3341, 'total_timesteps': 76184}
[2026-04-16 08:43:21]   UCB=3.2598 mu=1.9831 σ=0.6384 params={'learning_rate': 0.00015673628608872705, 'steps_per_switch': 3392, 'total_timesteps': 63890}
[2026-04-16 08:43:21] [Wave4] Proposed params: {'learning_rate': 0.00040295487555843073, 'steps_per_switch': 3490, 'total_timesteps': 72578}
[2026-04-16 08:43:23] [Wave4] Launching trial 16: {'learning_rate': 0.00040295487555843073, 'steps_per_switch': 3490, 'total_timesteps': 72578}
[2026-04-16 08:43:23] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 72578 --steps-per-switch 3490 --learning-rate 0.00040295487555843073 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0016
[2026-04-16 09:52:25] [Wave4] Trial 16 finished in 4142.2s, rc=0
[2026-04-16 09:52:25] [Wave4] Parsed: combined=59.9996 mini_monaco=59.9996
[2026-04-16 09:52:25] [Wave4] ===== Trial 16 Summary =====
[2026-04-16 09:52:25]   GP data points : 16
[2026-04-16 09:52:25]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 09:52:25]   Top 5:
[2026-04-16 09:52:25]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 09:52:25]     score=1573.07  params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 09:52:25]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 09:52:25]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 09:52:25]     score=141.35  params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 09:52:27]
[Wave4] ========== Trial 17/25 ==========
[2026-04-16 09:52:27] [Wave4] GP UCB top-5 proposals:
[2026-04-16 09:52:27]   UCB=4.4175 mu=3.0206 σ=0.6985 params={'learning_rate': 0.00011371306784012085, 'steps_per_switch': 4558, 'total_timesteps': 61409}
[2026-04-16 09:52:27]   UCB=3.8442 mu=2.8720 σ=0.4861 params={'learning_rate': 0.00015521967796620274, 'steps_per_switch': 5888, 'total_timesteps': 64258}
[2026-04-16 09:52:27]   UCB=3.8027 mu=2.6781 σ=0.5623 params={'learning_rate': 0.0001942087695905749, 'steps_per_switch': 3887, 'total_timesteps': 63994}
[2026-04-16 09:52:27]   UCB=3.7728 mu=2.8641 σ=0.4544 params={'learning_rate': 0.00019296340532126325, 'steps_per_switch': 4356, 'total_timesteps': 65008}
[2026-04-16 09:52:27]   UCB=3.7728 mu=2.4744 σ=0.6492 params={'learning_rate': 0.0004244302343635027, 'steps_per_switch': 4871, 'total_timesteps': 61656}
[2026-04-16 09:52:27] [Wave4] Proposed params: {'learning_rate': 0.00011371306784012085, 'steps_per_switch': 4558, 'total_timesteps': 61409}
[2026-04-16 09:52:29] [Wave4] Launching trial 17: {'learning_rate': 0.00011371306784012085, 'steps_per_switch': 4558, 'total_timesteps': 61409}
[2026-04-16 09:52:29] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 61409 --steps-per-switch 4558 --learning-rate 0.00011371306784012085 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0017
[2026-04-16 10:54:36] [Wave4] Trial 17 finished in 3727.6s, rc=0
[2026-04-16 10:54:36] [Wave4] Parsed: combined=106.1213 mini_monaco=106.1213
[2026-04-16 10:54:36] [Wave4] ===== Trial 17 Summary =====
[2026-04-16 10:54:36]   GP data points : 17
[2026-04-16 10:54:36]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 10:54:36]   Top 5:
[2026-04-16 10:54:36]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 10:54:36]     score=1573.07  params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 10:54:36]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 10:54:36]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 10:54:36]     score=141.35  params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 10:54:38]
[Wave4] ========== Trial 18/25 ==========
[2026-04-16 10:54:38] [Wave4] GP UCB top-5 proposals:
[2026-04-16 10:54:38]   UCB=2.4022 mu=1.7625 σ=0.3199 params={'learning_rate': 0.00014739138140971148, 'steps_per_switch': 5692, 'total_timesteps': 73312}
[2026-04-16 10:54:38]   UCB=2.1766 mu=1.8937 σ=0.1414 params={'learning_rate': 0.0003092758146459376, 'steps_per_switch': 5733, 'total_timesteps': 72147}
[2026-04-16 10:54:38]   UCB=2.0283 mu=1.8693 σ=0.0795 params={'learning_rate': 0.00038035350750885823, 'steps_per_switch': 5423, 'total_timesteps': 70634}
[2026-04-16 10:54:38]   UCB=2.0025 mu=0.0985 σ=0.9520 params={'learning_rate': 0.0019422789362836088, 'steps_per_switch': 9820, 'total_timesteps': 71591}
[2026-04-16 10:54:38]   UCB=1.9979 mu=1.7539 σ=0.1220 params={'learning_rate': 0.0002958405723304147, 'steps_per_switch': 5704, 'total_timesteps': 67124}
[2026-04-16 10:54:38] [Wave4] Proposed params: {'learning_rate': 0.00014739138140971148, 'steps_per_switch': 5692, 'total_timesteps': 73312}
[2026-04-16 10:54:40] [Wave4] Launching trial 18: {'learning_rate': 0.00014739138140971148, 'steps_per_switch': 5692, 'total_timesteps': 73312}
[2026-04-16 10:54:40] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 73312 --steps-per-switch 5692 --learning-rate 0.00014739138140971148 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0018
[2026-04-16 11:58:07] [Wave4] Trial 18 finished in 3806.9s, rc=0
[2026-04-16 11:58:07] [Wave4] Parsed: combined=115.6774 mini_monaco=115.6774
[2026-04-16 11:58:07] [Wave4] ===== Trial 18 Summary =====
[2026-04-16 11:58:07]   GP data points : 18
[2026-04-16 11:58:07]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 11:58:07]   Top 5:
[2026-04-16 11:58:07]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 11:58:07]     score=1573.07  params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 11:58:07]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 11:58:07]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 11:58:07]     score=141.35  params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 11:58:09]
[Wave4] ========== Trial 19/25 ==========
[2026-04-16 11:58:09] [Wave4] GP UCB top-5 proposals:
[2026-04-16 11:58:09]   UCB=4.2331 mu=3.1539 σ=0.5396 params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 11:58:09]   UCB=3.9367 mu=2.3643 σ=0.7862 params={'learning_rate': 0.000973356801504352, 'steps_per_switch': 4731, 'total_timesteps': 64636}
[2026-04-16 11:58:09]   UCB=3.8093 mu=2.3419 σ=0.7337 params={'learning_rate': 0.0009048073949917447, 'steps_per_switch': 4137, 'total_timesteps': 65312}
[2026-04-16 11:58:09]   UCB=3.7049 mu=2.1579 σ=0.7735 params={'learning_rate': 0.0008513107736880863, 'steps_per_switch': 6388, 'total_timesteps': 62234}
[2026-04-16 11:58:09]   UCB=3.6513 mu=2.2696 σ=0.6908 params={'learning_rate': 0.0006744614059620855, 'steps_per_switch': 4569, 'total_timesteps': 60715}
[2026-04-16 11:58:09] [Wave4] Proposed params: {'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 11:58:11] [Wave4] Launching trial 19: {'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 11:58:11] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 64179 --steps-per-switch 4747 --learning-rate 0.0006672844816013197 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0019
[2026-04-16 13:10:23] [Wave4] Trial 19 finished in 4331.6s, rc=0
[2026-04-16 13:10:23] [Wave4] Parsed: combined=230.9827 mini_monaco=230.9827
[2026-04-16 13:10:23] [Wave4] ===== Trial 19 Summary =====
[2026-04-16 13:10:23]   GP data points : 19
[2026-04-16 13:10:23]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 13:10:23]   Top 5:
[2026-04-16 13:10:23]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 13:10:23]     score=1573.07  params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 13:10:23]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 13:10:23]     score=230.98  params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 13:10:23]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 13:10:25]
[Wave4] ========== Trial 20/25 ==========
[2026-04-16 13:10:25] [Wave4] GP UCB top-5 proposals:
[2026-04-16 13:10:25]   UCB=2.2341 mu=1.6242 σ=0.3049 params={'learning_rate': 0.0007104769870597334, 'steps_per_switch': 5822, 'total_timesteps': 73160}
[2026-04-16 13:10:25]   UCB=2.2038 mu=1.8331 σ=0.1853 params={'learning_rate': 0.000688892492740156, 'steps_per_switch': 5559, 'total_timesteps': 78911}
[2026-04-16 13:10:25]   UCB=2.2000 mu=1.6830 σ=0.2585 params={'learning_rate': 0.0006312216002702526, 'steps_per_switch': 7125, 'total_timesteps': 80150}
[2026-04-16 13:10:25]   UCB=2.1950 mu=1.5842 σ=0.3054 params={'learning_rate': 0.0009544613332763077, 'steps_per_switch': 3097, 'total_timesteps': 89206}
[2026-04-16 13:10:25]   UCB=2.0924 mu=1.5207 σ=0.2859 params={'learning_rate': 0.0006628882948205277, 'steps_per_switch': 6497, 'total_timesteps': 73747}
[2026-04-16 13:10:25] [Wave4] Proposed params: {'learning_rate': 0.0007104769870597334, 'steps_per_switch': 5822, 'total_timesteps': 73160}
[2026-04-16 13:10:27] [Wave4] Launching trial 20: {'learning_rate': 0.0007104769870597334, 'steps_per_switch': 5822, 'total_timesteps': 73160}
[2026-04-16 13:10:27] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 73160 --steps-per-switch 5822 --learning-rate 0.0007104769870597334 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0020
[2026-04-16 14:10:06] [Wave4] Trial 20 finished in 3578.9s, rc=0
[2026-04-16 14:10:06] [Wave4] Parsed: combined=20.9587 mini_monaco=20.9587
[2026-04-16 14:10:06] [Wave4] ===== Trial 20 Summary =====
[2026-04-16 14:10:06]   GP data points : 20
[2026-04-16 14:10:06]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 14:10:06]   Top 5:
[2026-04-16 14:10:06]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 14:10:06]     score=1573.07  params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 14:10:06]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 14:10:06]     score=230.98  params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 14:10:06]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 14:10:08] [Wave4] ✅ Git push complete after trial 20
[2026-04-16 14:10:10]
[Wave4] ========== Trial 21/25 ==========
[2026-04-16 14:10:10] [Wave4] GP UCB top-5 proposals:
[2026-04-16 14:10:10]   UCB=2.6140 mu=2.2621 σ=0.1760 params={'learning_rate': 0.0006240263317913531, 'steps_per_switch': 6248, 'total_timesteps': 81430}
[2026-04-16 14:10:10]   UCB=2.4442 mu=2.0635 σ=0.1903 params={'learning_rate': 0.0008059149094586216, 'steps_per_switch': 5668, 'total_timesteps': 84976}
[2026-04-16 14:10:10]   UCB=2.4147 mu=2.0379 σ=0.1884 params={'learning_rate': 0.0005106809671527766, 'steps_per_switch': 7044, 'total_timesteps': 86259}
[2026-04-16 14:10:10]   UCB=2.2869 mu=1.1164 σ=0.5852 params={'learning_rate': 0.001932234805776913, 'steps_per_switch': 3270, 'total_timesteps': 81124}
[2026-04-16 14:10:10]   UCB=2.1881 mu=1.7486 σ=0.2198 params={'learning_rate': 0.0006026876192105071, 'steps_per_switch': 14736, 'total_timesteps': 86970}
[2026-04-16 14:10:10] [Wave4] Proposed params: {'learning_rate': 0.0006240263317913531, 'steps_per_switch': 6248, 'total_timesteps': 81430}
[2026-04-16 14:10:12] [Wave4] Launching trial 21: {'learning_rate': 0.0006240263317913531, 'steps_per_switch': 6248, 'total_timesteps': 81430}
[2026-04-16 14:10:12] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 81430 --steps-per-switch 6248 --learning-rate 0.0006240263317913531 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0021
[2026-04-16 15:41:17] [Wave4] Trial 21 finished in 5465.6s, rc=0
[2026-04-16 15:41:17] [Wave4] Parsed: combined=86.1012 mini_monaco=86.1012
[2026-04-16 15:41:17] [Wave4] ===== Trial 21 Summary =====
[2026-04-16 15:41:17]   GP data points : 21
[2026-04-16 15:41:17]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 15:41:17]   Top 5:
[2026-04-16 15:41:17]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 15:41:17]     score=1573.07  params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 15:41:17]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 15:41:17]     score=230.98  params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 15:41:17]     score=176.67  params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 15:41:19]
[Wave4] ========== Trial 22/25 ==========
[2026-04-16 15:41:19] [Wave4] GP UCB top-5 proposals:
[2026-04-16 15:41:19]   UCB=2.9276 mu=1.9840 σ=0.4718 params={'learning_rate': 0.0010680884757571496, 'steps_per_switch': 3269, 'total_timesteps': 81912}
[2026-04-16 15:41:19]   UCB=2.8979 mu=1.8081 σ=0.5449 params={'learning_rate': 0.0014475574559088072, 'steps_per_switch': 3057, 'total_timesteps': 77220}
[2026-04-16 15:41:19]   UCB=2.7394 mu=1.7758 σ=0.4818 params={'learning_rate': 0.0010706637693188694, 'steps_per_switch': 3430, 'total_timesteps': 79924}
[2026-04-16 15:41:19]   UCB=2.7361 mu=1.9188 σ=0.4086 params={'learning_rate': 0.0003137950754813831, 'steps_per_switch': 7355, 'total_timesteps': 63965}
[2026-04-16 15:41:19]   UCB=2.6672 mu=1.6715 σ=0.4978 params={'learning_rate': 0.0013764156246110833, 'steps_per_switch': 3205, 'total_timesteps': 83531}
[2026-04-16 15:41:19] [Wave4] Proposed params: {'learning_rate': 0.0010680884757571496, 'steps_per_switch': 3269, 'total_timesteps': 81912}
[2026-04-16 15:41:21] [Wave4] Launching trial 22: {'learning_rate': 0.0010680884757571496, 'steps_per_switch': 3269, 'total_timesteps': 81912}
[2026-04-16 15:41:21] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 81912 --steps-per-switch 3269 --learning-rate 0.0010680884757571496 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0022
[2026-04-16 17:13:22] [Wave4] Trial 22 finished in 5520.7s, rc=0
[2026-04-16 17:13:22] [Wave4] Parsed: combined=193.2341 mini_monaco=193.2341
[2026-04-16 17:13:22] [Wave4] ===== Trial 22 Summary =====
[2026-04-16 17:13:22]   GP data points : 22
[2026-04-16 17:13:22]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 17:13:22]   Top 5:
[2026-04-16 17:13:22]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 17:13:22]     score=1573.07  params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 17:13:22]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 17:13:22]     score=230.98  params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 17:13:22]     score=193.23  params={'learning_rate': 0.0010680884757571496, 'steps_per_switch': 3269, 'total_timesteps': 81912}
[2026-04-16 17:13:24]
[Wave4] ========== Trial 23/25 ==========
[2026-04-16 17:13:24] [Wave4] GP UCB top-5 proposals:
[2026-04-16 17:13:24]   UCB=2.5284 mu=2.2838 σ=0.1223 params={'learning_rate': 0.0003194840316570061, 'steps_per_switch': 4507, 'total_timesteps': 66965}
[2026-04-16 17:13:24]   UCB=2.4192 mu=1.9135 σ=0.2529 params={'learning_rate': 0.0005202514734965685, 'steps_per_switch': 6358, 'total_timesteps': 64937}
[2026-04-16 17:13:24]   UCB=2.3057 mu=2.0781 σ=0.1138 params={'learning_rate': 0.00028748663178259926, 'steps_per_switch': 4629, 'total_timesteps': 64690}
[2026-04-16 17:13:24]   UCB=2.2310 mu=1.9780 σ=0.1265 params={'learning_rate': 0.0005977134566763185, 'steps_per_switch': 7805, 'total_timesteps': 89342}
[2026-04-16 17:13:24]   UCB=2.1094 mu=1.5506 σ=0.2794 params={'learning_rate': 0.0001403862307566014, 'steps_per_switch': 6141, 'total_timesteps': 64341}
[2026-04-16 17:13:24] [Wave4] Proposed params: {'learning_rate': 0.0003194840316570061, 'steps_per_switch': 4507, 'total_timesteps': 66965}
[2026-04-16 17:13:26] [Wave4] Launching trial 23: {'learning_rate': 0.0003194840316570061, 'steps_per_switch': 4507, 'total_timesteps': 66965}
[2026-04-16 17:13:26] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 66965 --steps-per-switch 4507 --learning-rate 0.0003194840316570061 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0023
[2026-04-16 18:26:11] [Wave4] Trial 23 finished in 4365.0s, rc=0
[2026-04-16 18:26:11] [Wave4] Parsed: combined=150.773 mini_monaco=150.773
[2026-04-16 18:26:11] [Wave4] ===== Trial 23 Summary =====
[2026-04-16 18:26:11]   GP data points : 23
[2026-04-16 18:26:11]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 18:26:11]   Top 5:
[2026-04-16 18:26:11]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 18:26:11]     score=1573.07  params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 18:26:11]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 18:26:11]     score=230.98  params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 18:26:11]     score=193.23  params={'learning_rate': 0.0010680884757571496, 'steps_per_switch': 3269, 'total_timesteps': 81912}
[2026-04-16 18:26:13]
[Wave4] ========== Trial 24/25 ==========
[2026-04-16 18:26:13] [Wave4] GP UCB top-5 proposals:
[2026-04-16 18:26:13]   UCB=7.5247 mu=6.2951 σ=0.6148 params={'learning_rate': 0.0003596012644521025, 'steps_per_switch': 8013, 'total_timesteps': 60054}
[2026-04-16 18:26:13]   UCB=6.0685 mu=4.6817 σ=0.6934 params={'learning_rate': 0.0001643952288780874, 'steps_per_switch': 9023, 'total_timesteps': 61369}
[2026-04-16 18:26:13]   UCB=5.8942 mu=5.3342 σ=0.2800 params={'learning_rate': 0.00036103199241236257, 'steps_per_switch': 6908, 'total_timesteps': 64716}
[2026-04-16 18:26:13]   UCB=5.6180 mu=4.5299 σ=0.5441 params={'learning_rate': 0.0006214434613010804, 'steps_per_switch': 7433, 'total_timesteps': 61101}
[2026-04-16 18:26:13]   UCB=5.2906 mu=4.8505 σ=0.2200 params={'learning_rate': 0.000363303441736895, 'steps_per_switch': 6643, 'total_timesteps': 65509}
[2026-04-16 18:26:13] [Wave4] Proposed params: {'learning_rate': 0.0003596012644521025, 'steps_per_switch': 8013, 'total_timesteps': 60054}
[2026-04-16 18:26:15] [Wave4] Launching trial 24: {'learning_rate': 0.0003596012644521025, 'steps_per_switch': 8013, 'total_timesteps': 60054}
[2026-04-16 18:26:15] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 60054 --steps-per-switch 8013 --learning-rate 0.0003596012644521025 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0024
[2026-04-16 19:09:39] [Wave4] Trial 24 finished in 2604.0s, rc=0
[2026-04-16 19:09:39] [Wave4] Parsed: combined=37.824 mini_monaco=37.824
[2026-04-16 19:09:39] [Wave4] ===== Trial 24 Summary =====
[2026-04-16 19:09:39]   GP data points : 24
[2026-04-16 19:09:39]   Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 19:09:39]   Top 5:
[2026-04-16 19:09:39]     score=1943.10  params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 19:09:39]     score=1573.07  params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 19:09:39]     score=1435.04  params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 19:09:39]     score=230.98  params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 19:09:39]     score=193.23  params={'learning_rate': 0.0010680884757571496, 'steps_per_switch': 3269, 'total_timesteps': 81912}
[2026-04-16 19:09:41]
[Wave4] ========== Trial 25/25 ==========
[2026-04-16 19:09:41] [Wave4] GP UCB top-5 proposals:
[2026-04-16 19:09:41]   UCB=2.8514 mu=2.5523 σ=0.1495 params={'learning_rate': 0.0003128257557719074, 'steps_per_switch': 6836, 'total_timesteps': 62683}
[2026-04-16 19:09:41]   UCB=2.6831 mu=2.3606 σ=0.1612 params={'learning_rate': 0.0004069124594262667, 'steps_per_switch': 6035, 'total_timesteps': 62228}
[2026-04-16 19:09:41]   UCB=2.5561 mu=2.1065 σ=0.2248 params={'learning_rate': 0.00011678403675177319, 'steps_per_switch': 6295, 'total_timesteps': 60168}
[2026-04-16 19:09:41]   UCB=2.4849 mu=2.1243 σ=0.1803 params={'learning_rate': 0.0002227611562953955, 'steps_per_switch': 6484, 'total_timesteps': 60292}
[2026-04-16 19:09:41]   UCB=2.3964 mu=2.0203 σ=0.1880 params={'learning_rate': 0.0004890619502018291, 'steps_per_switch': 6956, 'total_timesteps': 63892}
[2026-04-16 19:09:41] [Wave4] Proposed params: {'learning_rate': 0.0003128257557719074, 'steps_per_switch': 6836, 'total_timesteps': 62683}
[2026-04-16 19:09:43] [Wave4] Launching trial 25: {'learning_rate': 0.0003128257557719074, 'steps_per_switch': 6836, 'total_timesteps': 62683}
[2026-04-16 19:09:43] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 62683 --steps-per-switch 6836 --learning-rate 0.0003128257557719074 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0025