donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_phase4_log.txt

837 lines
93 KiB
Plaintext
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

[2026-04-14 22:40:44] =================================================================
[2026-04-14 22:40:44] [Wave4] Multi-Track Autoresearch — GP+UCB Generalization Search
[2026-04-14 22:40:44] [Wave4] Training tracks : generated_track, mountain_track (no generated_road, no warm-start)
[2026-04-14 22:40:44] [Wave4] Test tracks : mini_monaco only (zero-shot; warren removed — broken done condition)
[2026-04-14 22:40:44] [Wave4] Max trials : 25 | kappa=2.0 | push every 5
[2026-04-14 22:40:44] [Wave4] Results file : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase4.jsonl
[2026-04-14 22:40:44] [Wave4] Champion dir : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-champion
[2026-04-14 22:40:44] [Wave4] Warm start : NONE (training from scratch each trial)
[2026-04-14 22:40:44] =================================================================
[2026-04-14 22:40:44] [Wave4] Loaded 0 existing Phase 3 results.
[2026-04-14 22:40:44] [Wave4] No Wave 3 champion yet.
[2026-04-14 22:40:44] [Wave4] Starting from trial 1.
[2026-04-14 22:40:44]
[Wave4] ========== Trial 1/25 ==========
[2026-04-14 22:40:44] [Wave4] Seed trial 1/2: using hardcoded params.
[2026-04-14 22:40:44] [Wave4] Proposed params: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-14 22:40:46] [Wave4] Launching trial 1: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-14 22:40:46] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 80000 --steps-per-switch 6000 --learning-rate 0.0003 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0001
[2026-04-14 22:44:24] =================================================================
[2026-04-14 22:44:24] [Wave4] Multi-Track Autoresearch — GP+UCB Generalization Search
[2026-04-14 22:44:24] [Wave4] Training tracks : generated_track, mountain_track (no generated_road, no warm-start)
[2026-04-14 22:44:24] [Wave4] Test tracks : mini_monaco only (zero-shot; warren removed — broken done condition)
[2026-04-14 22:44:24] [Wave4] Max trials : 25 | kappa=2.0 | push every 5
[2026-04-14 22:44:24] [Wave4] Results file : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase4.jsonl
[2026-04-14 22:44:24] [Wave4] Champion dir : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-champion
[2026-04-14 22:44:24] [Wave4] Warm start : NONE (training from scratch each trial)
[2026-04-14 22:44:24] =================================================================
[2026-04-14 22:44:24] [Wave4] Loaded 0 existing Phase 3 results.
[2026-04-14 22:44:24] [Wave4] No Wave 3 champion yet.
[2026-04-14 22:44:24] [Wave4] Starting from trial 1.
[2026-04-14 22:44:24]
[Wave4] ========== Trial 1/25 ==========
[2026-04-14 22:44:24] [Wave4] Seed trial 1/2: using hardcoded params.
[2026-04-14 22:44:24] [Wave4] Proposed params: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-14 22:44:26] [Wave4] Launching trial 1: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-14 22:44:26] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 80000 --steps-per-switch 6000 --learning-rate 0.0003 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0001
[2026-04-15 00:02:45] [Wave4] Trial 1 finished in 4699.3s, rc=0
[2026-04-15 00:02:45] [Wave4] Parsed: combined=45.6693 mini_monaco=45.6693
[2026-04-15 00:02:45] [Champion] 🏆 NEW BEST! Trial 1: score=45.67 (mini_monaco=45.7) params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 00:02:45] [Wave4] ===== Trial 1 Summary =====
[2026-04-15 00:02:45] GP data points : 1
[2026-04-15 00:02:45] Wave4 Champion: trial=1 score=45.67 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 00:02:45] Top 5:
[2026-04-15 00:02:45] score=45.67 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 00:02:47]
[Wave4] ========== Trial 2/25 ==========
[2026-04-15 00:02:47] [Wave4] Seed trial 2/2: using hardcoded params.
[2026-04-15 00:02:47] [Wave4] Proposed params: {'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 00:02:49] [Wave4] Launching trial 2: {'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 00:02:49] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 80000 --steps-per-switch 6000 --learning-rate 0.001 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0002
[2026-04-15 01:21:38] [Wave4] Trial 2 finished in 4728.4s, rc=0
[2026-04-15 01:21:38] [Wave4] Parsed: combined=222.0731 mini_monaco=222.0731
[2026-04-15 01:21:38] [Champion] 🏆 NEW BEST! Trial 2: score=222.07 (mini_monaco=222.1) params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 01:21:38] [Wave4] ===== Trial 2 Summary =====
[2026-04-15 01:21:38] GP data points : 2
[2026-04-15 01:21:38] Wave4 Champion: trial=2 score=222.07 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 01:21:38] Top 5:
[2026-04-15 01:21:38] score=222.07 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 01:21:38] score=45.67 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 01:21:40]
[Wave4] ========== Trial 3/25 ==========
[2026-04-15 01:21:40] [Wave4] Only 2 results — using random proposal.
[2026-04-15 01:21:40] [Wave4] Proposed params: {'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 01:21:42] [Wave4] Launching trial 3: {'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 01:21:42] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 157743 --steps-per-switch 17499 --learning-rate 0.0006852550685205609 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0003
[2026-04-15 03:15:46] [Wave4] Trial 3 finished in 6843.7s, rc=0
[2026-04-15 03:15:46] [Wave4] Parsed: combined=1943.1038 mini_monaco=1943.1038
[2026-04-15 03:15:46] [Champion] 🏆 NEW BEST! Trial 3: score=1943.10 (mini_monaco=1943.1) params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 03:15:46] [Wave4] ===== Trial 3 Summary =====
[2026-04-15 03:15:46] GP data points : 3
[2026-04-15 03:15:46] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 03:15:46] Top 5:
[2026-04-15 03:15:46] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 03:15:46] score=222.07 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 03:15:46] score=45.67 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 03:15:48]
[Wave4] ========== Trial 4/25 ==========
[2026-04-15 03:15:48] [Wave4] GP UCB top-5 proposals:
[2026-04-15 03:15:48] UCB=2.4560 mu=0.8788 σ=0.7886 params={'learning_rate': 0.0003250095463348546, 'steps_per_switch': 19054, 'total_timesteps': 197116}
[2026-04-15 03:15:48] UCB=2.4518 mu=0.8393 σ=0.8062 params={'learning_rate': 0.00121703003154963, 'steps_per_switch': 16951, 'total_timesteps': 180865}
[2026-04-15 03:15:48] UCB=2.4512 mu=0.7637 σ=0.8437 params={'learning_rate': 0.00036067077082995895, 'steps_per_switch': 16532, 'total_timesteps': 211219}
[2026-04-15 03:15:48] UCB=2.4501 mu=0.9283 σ=0.7609 params={'learning_rate': 0.0005325315186424085, 'steps_per_switch': 18992, 'total_timesteps': 205595}
[2026-04-15 03:15:48] UCB=2.4492 mu=0.9106 σ=0.7693 params={'learning_rate': 0.001163360064352729, 'steps_per_switch': 19652, 'total_timesteps': 151744}
[2026-04-15 03:15:48] [Wave4] Proposed params: {'learning_rate': 0.0003250095463348546, 'steps_per_switch': 19054, 'total_timesteps': 197116}
[2026-04-15 03:15:50] [Wave4] Launching trial 4: {'learning_rate': 0.0003250095463348546, 'steps_per_switch': 19054, 'total_timesteps': 197116}
[2026-04-15 03:15:50] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 197116 --steps-per-switch 19054 --learning-rate 0.0003250095463348546 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0004
[2026-04-15 05:15:51] [Wave4] Trial 4 TIMED OUT — killing runner.
[2026-04-15 05:15:51] [Wave4] Trial 4 finished in 7200.5s, rc=-9
[2026-04-15 05:15:51] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 05:15:51] [Wave4] ⚠️ No test score parsed — defaulting to 0.0
[2026-04-15 05:15:51] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 05:15:51] [Wave4] ===== Trial 4 Summary =====
[2026-04-15 05:15:51] GP data points : 3
[2026-04-15 05:15:51] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 05:15:51] Top 5:
[2026-04-15 05:15:51] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 05:15:51] score=222.07 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 05:15:51] score=45.67 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 05:15:53]
[Wave4] ========== Trial 5/25 ==========
[2026-04-15 05:15:53] [Wave4] GP UCB top-5 proposals:
[2026-04-15 05:15:53] UCB=2.4597 mu=0.8329 σ=0.8134 params={'learning_rate': 0.0003927960467617446, 'steps_per_switch': 19892, 'total_timesteps': 201785}
[2026-04-15 05:15:53] UCB=2.4568 mu=0.8585 σ=0.7991 params={'learning_rate': 0.0011330710879806035, 'steps_per_switch': 18089, 'total_timesteps': 193054}
[2026-04-15 05:15:53] UCB=2.4560 mu=0.7832 σ=0.8364 params={'learning_rate': 0.0006110661120319741, 'steps_per_switch': 17141, 'total_timesteps': 219583}
[2026-04-15 05:15:53] UCB=2.4560 mu=0.8338 σ=0.8111 params={'learning_rate': 0.000602366907571214, 'steps_per_switch': 16527, 'total_timesteps': 215069}
[2026-04-15 05:15:53] UCB=2.4522 mu=0.8120 σ=0.8201 params={'learning_rate': 0.0004035684210100053, 'steps_per_switch': 16067, 'total_timesteps': 208387}
[2026-04-15 05:15:53] [Wave4] Proposed params: {'learning_rate': 0.0003927960467617446, 'steps_per_switch': 19892, 'total_timesteps': 201785}
[2026-04-15 05:15:55] [Wave4] Launching trial 5: {'learning_rate': 0.0003927960467617446, 'steps_per_switch': 19892, 'total_timesteps': 201785}
[2026-04-15 05:15:55] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 201785 --steps-per-switch 19892 --learning-rate 0.0003927960467617446 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0005
[2026-04-15 07:15:57] [Wave4] Trial 5 TIMED OUT — killing runner.
[2026-04-15 07:15:57] [Wave4] Trial 5 finished in 7202.3s, rc=-9
[2026-04-15 07:15:57] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 07:15:57] [Wave4] ⚠️ No test score parsed — defaulting to 0.0
[2026-04-15 07:15:57] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 07:15:57] [Wave4] ===== Trial 5 Summary =====
[2026-04-15 07:15:57] GP data points : 3
[2026-04-15 07:15:57] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 07:15:57] Top 5:
[2026-04-15 07:15:57] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 07:15:57] score=222.07 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 07:15:57] score=45.67 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 07:15:59] [Wave4] ✅ Git push complete after trial 5
[2026-04-15 07:16:01]
[Wave4] ========== Trial 6/25 ==========
[2026-04-15 07:16:01] [Wave4] GP UCB top-5 proposals:
[2026-04-15 07:16:01] UCB=2.4565 mu=0.8712 σ=0.7926 params={'learning_rate': 0.0011062087200910864, 'steps_per_switch': 18318, 'total_timesteps': 194470}
[2026-04-15 07:16:01] UCB=2.4485 mu=0.9338 σ=0.7573 params={'learning_rate': 0.0004307107164246544, 'steps_per_switch': 19141, 'total_timesteps': 199878}
[2026-04-15 07:16:01] UCB=2.4478 mu=0.8840 σ=0.7819 params={'learning_rate': 0.00041215765557335777, 'steps_per_switch': 16229, 'total_timesteps': 203707}
[2026-04-15 07:16:01] UCB=2.4468 mu=0.8283 σ=0.8092 params={'learning_rate': 0.0009928039664024839, 'steps_per_switch': 19629, 'total_timesteps': 113788}
[2026-04-15 07:16:01] UCB=2.4456 mu=0.9298 σ=0.7579 params={'learning_rate': 0.0002412156295150517, 'steps_per_switch': 19116, 'total_timesteps': 179367}
[2026-04-15 07:16:01] [Wave4] Proposed params: {'learning_rate': 0.0011062087200910864, 'steps_per_switch': 18318, 'total_timesteps': 194470}
[2026-04-15 07:16:03] [Wave4] Launching trial 6: {'learning_rate': 0.0011062087200910864, 'steps_per_switch': 18318, 'total_timesteps': 194470}
[2026-04-15 07:16:03] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 194470 --steps-per-switch 18318 --learning-rate 0.0011062087200910864 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0006
[2026-04-15 09:06:26] =================================================================
[2026-04-15 09:06:26] [Wave4] Multi-Track Autoresearch — GP+UCB Generalization Search
[2026-04-15 09:06:26] [Wave4] Training tracks : generated_track, mountain_track (no generated_road, no warm-start)
[2026-04-15 09:06:26] [Wave4] Test tracks : mini_monaco only (zero-shot; warren removed — broken done condition)
[2026-04-15 09:06:26] [Wave4] Max trials : 25 | kappa=2.0 | push every 5
[2026-04-15 09:06:26] [Wave4] Results file : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase4.jsonl
[2026-04-15 09:06:26] [Wave4] Champion dir : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-champion
[2026-04-15 09:06:26] [Wave4] Warm start : NONE (training from scratch each trial)
[2026-04-15 09:06:26] =================================================================
[2026-04-15 09:06:26] [Wave4] Loaded 0 existing Phase 3 results.
[2026-04-15 09:06:26] [Wave4] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 09:06:26] [Wave4] Starting from trial 1.
[2026-04-15 09:06:26]
[Wave4] ========== Trial 1/25 ==========
[2026-04-15 09:06:26] [Wave4] Seed trial 1/2: using hardcoded params.
[2026-04-15 09:06:26] [Wave4] Proposed params: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 09:06:28] [Wave4] Launching trial 1: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 09:06:28] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 80000 --steps-per-switch 6000 --learning-rate 0.0003 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0001
[2026-04-15 09:17:28] =================================================================
[2026-04-15 09:17:28] [Wave4] Multi-Track Autoresearch — GP+UCB Generalization Search
[2026-04-15 09:17:28] [Wave4] Training tracks : generated_track, mountain_track (no generated_road, no warm-start)
[2026-04-15 09:17:28] [Wave4] Test tracks : mini_monaco only (zero-shot; warren removed — broken done condition)
[2026-04-15 09:17:28] [Wave4] Max trials : 25 | kappa=2.0 | push every 5
[2026-04-15 09:17:28] [Wave4] Results file : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase4.jsonl
[2026-04-15 09:17:28] [Wave4] Champion dir : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-champion
[2026-04-15 09:17:28] [Wave4] Warm start : NONE (training from scratch each trial)
[2026-04-15 09:17:28] =================================================================
[2026-04-15 09:17:28] [Wave4] Loaded 0 existing Phase 3 results.
[2026-04-15 09:17:28] [Wave4] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 09:17:28] [Wave4] Starting from trial 1.
[2026-04-15 09:17:28]
[Wave4] ========== Trial 1/25 ==========
[2026-04-15 09:17:28] [Wave4] Seed trial 1/2: using hardcoded params.
[2026-04-15 09:17:28] [Wave4] Proposed params: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 09:17:30] [Wave4] Launching trial 1: {'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 09:17:30] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 80000 --steps-per-switch 6000 --learning-rate 0.0003 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0001
[2026-04-15 10:33:27] [Wave4] Trial 1 finished in 4557.0s, rc=0
[2026-04-15 10:33:27] [Wave4] Parsed: combined=42.2964 mini_monaco=42.2964
[2026-04-15 10:33:27] [Wave4] ===== Trial 1 Summary =====
[2026-04-15 10:33:27] GP data points : 1
[2026-04-15 10:33:27] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 10:33:27] Top 5:
[2026-04-15 10:33:27] score=42.30 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 10:33:29]
[Wave4] ========== Trial 2/25 ==========
[2026-04-15 10:33:29] [Wave4] Seed trial 2/2: using hardcoded params.
[2026-04-15 10:33:29] [Wave4] Proposed params: {'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 10:33:31] [Wave4] Launching trial 2: {'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 10:33:31] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 80000 --steps-per-switch 6000 --learning-rate 0.001 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0002
[2026-04-15 11:39:05] [Wave4] Trial 2 finished in 3934.0s, rc=0
[2026-04-15 11:39:05] [Wave4] Parsed: combined=93.3894 mini_monaco=93.3894
[2026-04-15 11:39:05] [Wave4] ===== Trial 2 Summary =====
[2026-04-15 11:39:05] GP data points : 2
[2026-04-15 11:39:05] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 11:39:05] Top 5:
[2026-04-15 11:39:05] score=93.39 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 11:39:05] score=42.30 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 11:39:07]
[Wave4] ========== Trial 3/25 ==========
[2026-04-15 11:39:07] [Wave4] Only 2 results — using random proposal.
[2026-04-15 11:39:07] [Wave4] Proposed params: {'learning_rate': 0.0008162408849407889, 'steps_per_switch': 8441, 'total_timesteps': 140634}
[2026-04-15 11:39:09] [Wave4] Launching trial 3: {'learning_rate': 0.0008162408849407889, 'steps_per_switch': 8441, 'total_timesteps': 140634}
[2026-04-15 11:39:09] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 140634 --steps-per-switch 8441 --learning-rate 0.0008162408849407889 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0003
[2026-04-15 13:39:32] [Wave4] Trial 3 TIMED OUT — killing runner.
[2026-04-15 13:39:32] [Wave4] Trial 3 finished in 7222.7s, rc=-9
[2026-04-15 13:39:32] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 13:39:32] [Wave4] ⚠️ No test score parsed — defaulting to 0.0
[2026-04-15 13:39:32] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 13:39:32] [Wave4] ===== Trial 3 Summary =====
[2026-04-15 13:39:32] GP data points : 2
[2026-04-15 13:39:32] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 13:39:32] Top 5:
[2026-04-15 13:39:32] score=93.39 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 13:39:32] score=42.30 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 13:39:34]
[Wave4] ========== Trial 4/25 ==========
[2026-04-15 13:39:34] [Wave4] Only 2 results — using random proposal.
[2026-04-15 13:39:34] [Wave4] Proposed params: {'learning_rate': 0.00020853884350577402, 'steps_per_switch': 19927, 'total_timesteps': 138928}
[2026-04-15 13:39:36] [Wave4] Launching trial 4: {'learning_rate': 0.00020853884350577402, 'steps_per_switch': 19927, 'total_timesteps': 138928}
[2026-04-15 13:39:36] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 138928 --steps-per-switch 19927 --learning-rate 0.00020853884350577402 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0004
[2026-04-15 15:39:37] [Wave4] Trial 4 TIMED OUT — killing runner.
[2026-04-15 15:39:37] [Wave4] Trial 4 finished in 7200.7s, rc=-9
[2026-04-15 15:39:37] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 15:39:37] [Wave4] ⚠️ No test score parsed — defaulting to 0.0
[2026-04-15 15:39:37] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 15:39:37] [Wave4] ===== Trial 4 Summary =====
[2026-04-15 15:39:37] GP data points : 2
[2026-04-15 15:39:37] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 15:39:37] Top 5:
[2026-04-15 15:39:37] score=93.39 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 15:39:37] score=42.30 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 15:39:39]
[Wave4] ========== Trial 5/25 ==========
[2026-04-15 15:39:39] [Wave4] Only 2 results — using random proposal.
[2026-04-15 15:39:39] [Wave4] Proposed params: {'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 15:39:41] [Wave4] Launching trial 5: {'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 15:39:41] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 104878 --steps-per-switch 9368 --learning-rate 0.0007517877668650138 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0005
[2026-04-15 17:08:50] [Wave4] Trial 5 finished in 5348.8s, rc=0
[2026-04-15 17:08:50] [Wave4] Parsed: combined=31.73 mini_monaco=31.73
[2026-04-15 17:08:50] [Wave4] ===== Trial 5 Summary =====
[2026-04-15 17:08:50] GP data points : 3
[2026-04-15 17:08:50] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 17:08:50] Top 5:
[2026-04-15 17:08:50] score=93.39 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 17:08:50] score=42.30 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 17:08:50] score=31.73 params={'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 17:08:51] [Wave4] ✅ Git push complete after trial 5
[2026-04-15 17:08:53]
[Wave4] ========== Trial 6/25 ==========
[2026-04-15 17:08:53] [Wave4] GP UCB top-5 proposals:
[2026-04-15 17:08:53] UCB=2.9352 mu=1.3419 σ=0.7966 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 17:08:53] UCB=2.8442 mu=1.0827 σ=0.8808 params={'learning_rate': 0.0017357820530198068, 'steps_per_switch': 5009, 'total_timesteps': 87890}
[2026-04-15 17:08:53] UCB=2.8258 mu=1.0947 σ=0.8655 params={'learning_rate': 0.001668571948240882, 'steps_per_switch': 4814, 'total_timesteps': 101589}
[2026-04-15 17:08:53] UCB=2.8168 mu=0.9976 σ=0.9096 params={'learning_rate': 0.001788300003253932, 'steps_per_switch': 4215, 'total_timesteps': 80751}
[2026-04-15 17:08:53] UCB=2.8147 mu=1.5860 σ=0.6144 params={'learning_rate': 0.0012975326127189415, 'steps_per_switch': 3961, 'total_timesteps': 100527}
[2026-04-15 17:08:53] [Wave4] Proposed params: {'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 17:08:55] [Wave4] Launching trial 6: {'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 17:08:55] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 79532 --steps-per-switch 5524 --learning-rate 0.0016223486895735558 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0006
[2026-04-15 18:19:30] [Wave4] Trial 6 finished in 4235.3s, rc=0
[2026-04-15 18:19:30] [Wave4] Parsed: combined=176.6721 mini_monaco=176.6721
[2026-04-15 18:19:31] [Wave4] ===== Trial 6 Summary =====
[2026-04-15 18:19:31] GP data points : 4
[2026-04-15 18:19:31] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 18:19:31] Top 5:
[2026-04-15 18:19:31] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 18:19:31] score=93.39 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 18:19:31] score=42.30 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 18:19:31] score=31.73 params={'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 18:19:33]
[Wave4] ========== Trial 7/25 ==========
[2026-04-15 18:19:33] [Wave4] GP UCB top-5 proposals:
[2026-04-15 18:19:33] UCB=2.5520 mu=1.1319 σ=0.7101 params={'learning_rate': 0.001779556625962812, 'steps_per_switch': 4226, 'total_timesteps': 123651}
[2026-04-15 18:19:33] UCB=2.5115 mu=0.7590 σ=0.8763 params={'learning_rate': 0.0017725367196782225, 'steps_per_switch': 3941, 'total_timesteps': 145375}
[2026-04-15 18:19:33] UCB=2.4988 mu=1.1176 σ=0.6906 params={'learning_rate': 0.0018568268138302447, 'steps_per_switch': 6910, 'total_timesteps': 119000}
[2026-04-15 18:19:33] UCB=2.4973 mu=0.8571 σ=0.8201 params={'learning_rate': 0.0019597767383017994, 'steps_per_switch': 9245, 'total_timesteps': 113699}
[2026-04-15 18:19:33] UCB=2.4874 mu=1.2498 σ=0.6188 params={'learning_rate': 0.001739834862935009, 'steps_per_switch': 4764, 'total_timesteps': 117423}
[2026-04-15 18:19:33] [Wave4] Proposed params: {'learning_rate': 0.001779556625962812, 'steps_per_switch': 4226, 'total_timesteps': 123651}
[2026-04-15 18:19:35] [Wave4] Launching trial 7: {'learning_rate': 0.001779556625962812, 'steps_per_switch': 4226, 'total_timesteps': 123651}
[2026-04-15 18:19:35] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 123651 --steps-per-switch 4226 --learning-rate 0.001779556625962812 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0007
[2026-04-15 20:19:38] [Wave4] Trial 7 TIMED OUT — killing runner.
[2026-04-15 20:19:38] [Wave4] Trial 7 finished in 7203.4s, rc=-9
[2026-04-15 20:19:38] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 20:19:38] [Wave4] ⚠️ No test score parsed — defaulting to 0.0
[2026-04-15 20:19:38] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 20:19:38] [Wave4] ===== Trial 7 Summary =====
[2026-04-15 20:19:38] GP data points : 4
[2026-04-15 20:19:38] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 20:19:38] Top 5:
[2026-04-15 20:19:38] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 20:19:38] score=93.39 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 20:19:38] score=42.30 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 20:19:38] score=31.73 params={'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 20:19:40]
[Wave4] ========== Trial 8/25 ==========
[2026-04-15 20:19:40] [Wave4] GP UCB top-5 proposals:
[2026-04-15 20:19:40] UCB=2.5481 mu=1.2081 σ=0.6700 params={'learning_rate': 0.0019521225364821895, 'steps_per_switch': 4185, 'total_timesteps': 110996}
[2026-04-15 20:19:40] UCB=2.5357 mu=0.8887 σ=0.8235 params={'learning_rate': 0.001901474046587741, 'steps_per_switch': 5568, 'total_timesteps': 136179}
[2026-04-15 20:19:40] UCB=2.5188 mu=1.1408 σ=0.6890 params={'learning_rate': 0.0018359364507444984, 'steps_per_switch': 5746, 'total_timesteps': 122356}
[2026-04-15 20:19:40] UCB=2.4994 mu=0.7140 σ=0.8927 params={'learning_rate': 0.0019730433529852004, 'steps_per_switch': 4578, 'total_timesteps': 144668}
[2026-04-15 20:19:40] UCB=2.4934 mu=1.4169 σ=0.5382 params={'learning_rate': 0.0018644207617691767, 'steps_per_switch': 3071, 'total_timesteps': 75515}
[2026-04-15 20:19:40] [Wave4] Proposed params: {'learning_rate': 0.0019521225364821895, 'steps_per_switch': 4185, 'total_timesteps': 110996}
[2026-04-15 20:19:42] [Wave4] Launching trial 8: {'learning_rate': 0.0019521225364821895, 'steps_per_switch': 4185, 'total_timesteps': 110996}
[2026-04-15 20:19:42] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 110996 --steps-per-switch 4185 --learning-rate 0.0019521225364821895 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0008
[2026-04-15 22:19:42] [Wave4] Trial 8 TIMED OUT — killing runner.
[2026-04-15 22:19:42] [Wave4] Trial 8 finished in 7200.2s, rc=-9
[2026-04-15 22:19:42] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 22:19:42] [Wave4] ⚠️ No test score parsed — defaulting to 0.0
[2026-04-15 22:19:42] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 22:19:42] [Wave4] ===== Trial 8 Summary =====
[2026-04-15 22:19:42] GP data points : 4
[2026-04-15 22:19:42] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 22:19:42] Top 5:
[2026-04-15 22:19:42] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 22:19:42] score=93.39 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 22:19:42] score=42.30 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 22:19:42] score=31.73 params={'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 22:19:44]
[Wave4] ========== Trial 9/25 ==========
[2026-04-15 22:19:44] [Wave4] GP UCB top-5 proposals:
[2026-04-15 22:19:44] UCB=2.5432 mu=1.2960 σ=0.6236 params={'learning_rate': 0.001989433235306402, 'steps_per_switch': 3575, 'total_timesteps': 65457}
[2026-04-15 22:19:44] UCB=2.5396 mu=1.1027 σ=0.7185 params={'learning_rate': 0.001979671928972082, 'steps_per_switch': 8884, 'total_timesteps': 73116}
[2026-04-15 22:19:44] UCB=2.5350 mu=1.2006 σ=0.6672 params={'learning_rate': 0.0019228314496482347, 'steps_per_switch': 4960, 'total_timesteps': 115468}
[2026-04-15 22:19:44] UCB=2.5256 mu=1.1804 σ=0.6726 params={'learning_rate': 0.0016231147459723914, 'steps_per_switch': 3259, 'total_timesteps': 115708}
[2026-04-15 22:19:44] UCB=2.5201 mu=0.9834 σ=0.7683 params={'learning_rate': 0.0016588035055714473, 'steps_per_switch': 4314, 'total_timesteps': 131749}
[2026-04-15 22:19:44] [Wave4] Proposed params: {'learning_rate': 0.001989433235306402, 'steps_per_switch': 3575, 'total_timesteps': 65457}
[2026-04-15 22:19:46] [Wave4] Launching trial 9: {'learning_rate': 0.001989433235306402, 'steps_per_switch': 3575, 'total_timesteps': 65457}
[2026-04-15 22:19:46] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 65457 --steps-per-switch 3575 --learning-rate 0.001989433235306402 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0009
[2026-04-15 22:23:21] [Wave4] Trial 9 finished in 215.0s, rc=101
[2026-04-15 22:23:21] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 22:23:21] [Wave4] ⚠️ No test score parsed — defaulting to 0.0
[2026-04-15 22:23:21] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 22:23:21] [Wave4] ===== Trial 9 Summary =====
[2026-04-15 22:23:21] GP data points : 4
[2026-04-15 22:23:21] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 22:23:21] Top 5:
[2026-04-15 22:23:21] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 22:23:21] score=93.39 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 22:23:21] score=42.30 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 22:23:21] score=31.73 params={'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 22:23:23]
[Wave4] ========== Trial 10/25 ==========
[2026-04-15 22:23:23] [Wave4] GP UCB top-5 proposals:
[2026-04-15 22:23:23] UCB=2.5794 mu=0.9857 σ=0.7969 params={'learning_rate': 0.00192547022313727, 'steps_per_switch': 3237, 'total_timesteps': 124659}
[2026-04-15 22:23:23] UCB=2.5191 mu=1.3579 σ=0.5806 params={'learning_rate': 0.0019414376395480834, 'steps_per_switch': 3402, 'total_timesteps': 69220}
[2026-04-15 22:23:23] UCB=2.5097 mu=0.7258 σ=0.8919 params={'learning_rate': 0.0019051112417148412, 'steps_per_switch': 3607, 'total_timesteps': 144368}
[2026-04-15 22:23:23] UCB=2.4894 mu=1.2599 σ=0.6148 params={'learning_rate': 0.001905194185221269, 'steps_per_switch': 5874, 'total_timesteps': 111439}
[2026-04-15 22:23:23] UCB=2.4776 mu=1.1168 σ=0.6804 params={'learning_rate': 0.0017822503576577222, 'steps_per_switch': 6596, 'total_timesteps': 121681}
[2026-04-15 22:23:23] [Wave4] Proposed params: {'learning_rate': 0.00192547022313727, 'steps_per_switch': 3237, 'total_timesteps': 124659}
[2026-04-15 22:23:25] [Wave4] Launching trial 10: {'learning_rate': 0.00192547022313727, 'steps_per_switch': 3237, 'total_timesteps': 124659}
[2026-04-15 22:23:25] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 124659 --steps-per-switch 3237 --learning-rate 0.00192547022313727 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0010
[2026-04-15 22:26:54] =================================================================
[2026-04-15 22:26:54] [Wave4] Multi-Track Autoresearch — GP+UCB Generalization Search
[2026-04-15 22:26:54] [Wave4] Training tracks : generated_track, mountain_track (no generated_road, no warm-start)
[2026-04-15 22:26:54] [Wave4] Test tracks : mini_monaco only (zero-shot; warren removed — broken done condition)
[2026-04-15 22:26:54] [Wave4] Max trials : 25 | kappa=2.0 | push every 5
[2026-04-15 22:26:54] [Wave4] Results file : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase4.jsonl
[2026-04-15 22:26:54] [Wave4] Champion dir : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-champion
[2026-04-15 22:26:54] [Wave4] Warm start : NONE (training from scratch each trial)
[2026-04-15 22:26:54] =================================================================
[2026-04-15 22:26:54] [Wave4] Loaded 5 existing Phase 3 results.
[2026-04-15 22:26:54] [Wave4] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 22:26:54] [Wave4] Starting from trial 6.
[2026-04-15 22:26:54]
[Wave4] ========== Trial 6/25 ==========
[2026-04-15 22:26:54] [Wave4] GP UCB top-5 proposals:
[2026-04-15 22:26:54] UCB=2.8029 mu=1.3217 σ=0.7406 params={'learning_rate': 0.0009434282949002715, 'steps_per_switch': 14966, 'total_timesteps': 83094}
[2026-04-15 22:26:54] UCB=2.7637 mu=1.4556 σ=0.6540 params={'learning_rate': 0.001016649027182601, 'steps_per_switch': 14757, 'total_timesteps': 85809}
[2026-04-15 22:26:54] UCB=2.7344 mu=1.1173 σ=0.8085 params={'learning_rate': 0.000525489856531106, 'steps_per_switch': 14503, 'total_timesteps': 81150}
[2026-04-15 22:26:54] UCB=2.7210 mu=1.0163 σ=0.8523 params={'learning_rate': 0.000448503297396427, 'steps_per_switch': 14723, 'total_timesteps': 80477}
[2026-04-15 22:26:54] UCB=2.6726 mu=0.9116 σ=0.8805 params={'learning_rate': 0.0011227428004033503, 'steps_per_switch': 14832, 'total_timesteps': 81442}
[2026-04-15 22:26:54] [Wave4] Proposed params: {'learning_rate': 0.0009434282949002715, 'steps_per_switch': 14966, 'total_timesteps': 83094}
[2026-04-15 22:26:56] [Wave4] Launching trial 6: {'learning_rate': 0.0009434282949002715, 'steps_per_switch': 14966, 'total_timesteps': 83094}
[2026-04-15 22:26:56] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 83094 --steps-per-switch 14966 --learning-rate 0.0009434282949002715 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0006
[2026-04-15 22:38:49] [Wave4] Trial 6 finished in 712.6s, rc=101
[2026-04-15 22:38:49] [Wave4] Parsed: combined=None mini_monaco=None
[2026-04-15 22:38:49] [Wave4] ⚠️ No test score parsed — defaulting to 0.0
[2026-04-15 22:38:49] [Wave4] combined_test_score=0 — excluded from GP (crash/timeout).
[2026-04-15 22:38:49] [Wave4] ===== Trial 6 Summary =====
[2026-04-15 22:38:49] GP data points : 5
[2026-04-15 22:38:49] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 22:38:49] Top 5:
[2026-04-15 22:38:49] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-15 22:38:49] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 22:38:49] score=93.39 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 22:38:49] score=42.30 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 22:38:49] score=31.73 params={'learning_rate': 0.0007517877668650138, 'steps_per_switch': 9368, 'total_timesteps': 104878}
[2026-04-15 22:38:51]
[Wave4] ========== Trial 7/25 ==========
[2026-04-15 22:38:51] [Wave4] GP UCB top-5 proposals:
[2026-04-15 22:38:51] UCB=2.8131 mu=1.4171 σ=0.6980 params={'learning_rate': 0.0004346662414818233, 'steps_per_switch': 14860, 'total_timesteps': 83818}
[2026-04-15 22:38:51] UCB=2.7278 mu=1.2363 σ=0.7458 params={'learning_rate': 0.00020210801580883826, 'steps_per_switch': 14145, 'total_timesteps': 88060}
[2026-04-15 22:38:51] UCB=2.7159 mu=1.0082 σ=0.8538 params={'learning_rate': 0.00012352374581522935, 'steps_per_switch': 14555, 'total_timesteps': 84897}
[2026-04-15 22:38:51] UCB=2.6982 mu=1.4815 σ=0.6084 params={'learning_rate': 0.0010645009917759616, 'steps_per_switch': 14511, 'total_timesteps': 89107}
[2026-04-15 22:38:51] UCB=2.6948 mu=1.1351 σ=0.7798 params={'learning_rate': 0.0012021332143154963, 'steps_per_switch': 14377, 'total_timesteps': 87121}
[2026-04-15 22:38:51] [Wave4] Proposed params: {'learning_rate': 0.0004346662414818233, 'steps_per_switch': 14860, 'total_timesteps': 83818}
[2026-04-15 22:38:53] [Wave4] Launching trial 7: {'learning_rate': 0.0004346662414818233, 'steps_per_switch': 14860, 'total_timesteps': 83818}
[2026-04-15 22:38:53] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 83818 --steps-per-switch 14860 --learning-rate 0.0004346662414818233 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0007
[2026-04-15 22:47:30] =================================================================
[2026-04-15 22:47:30] [Wave4] Multi-Track Autoresearch — GP+UCB Generalization Search
[2026-04-15 22:47:30] [Wave4] Training tracks : generated_track, mountain_track (no generated_road, no warm-start)
[2026-04-15 22:47:30] [Wave4] Test tracks : mini_monaco only (zero-shot; warren removed — broken done condition)
[2026-04-15 22:47:30] [Wave4] Max trials : 25 | kappa=2.0 | push every 5
[2026-04-15 22:47:30] [Wave4] Results file : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase4.jsonl
[2026-04-15 22:47:30] [Wave4] Champion dir : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-champion
[2026-04-15 22:47:30] [Wave4] Warm start : NONE (training from scratch each trial)
[2026-04-15 22:47:30] =================================================================
[2026-04-15 22:47:30] [Wave4] Loaded 6 existing Phase 3 results.
[2026-04-15 22:47:30] [Wave4] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 22:47:30] [Wave4] Starting from trial 7.
[2026-04-15 22:47:30]
[Wave4] ========== Trial 7/25 ==========
[2026-04-15 22:47:30] [Wave4] GP UCB top-5 proposals:
[2026-04-15 22:47:30] UCB=3.2602 mu=2.0535 σ=0.6034 params={'learning_rate': 0.0003074676335173842, 'steps_per_switch': 14103, 'total_timesteps': 89493}
[2026-04-15 22:47:30] UCB=3.2334 mu=1.9885 σ=0.6224 params={'learning_rate': 0.00030011572199819647, 'steps_per_switch': 13772, 'total_timesteps': 89937}
[2026-04-15 22:47:30] UCB=3.1559 mu=2.0419 σ=0.5570 params={'learning_rate': 0.0003574517736244094, 'steps_per_switch': 14971, 'total_timesteps': 88076}
[2026-04-15 22:47:30] UCB=2.5661 mu=1.3681 σ=0.5990 params={'learning_rate': 0.00039989051435482747, 'steps_per_switch': 12483, 'total_timesteps': 89578}
[2026-04-15 22:47:30] UCB=2.5147 mu=0.7976 σ=0.8585 params={'learning_rate': 0.00013529496950197116, 'steps_per_switch': 12536, 'total_timesteps': 85905}
[2026-04-15 22:47:30] [Wave4] Proposed params: {'learning_rate': 0.0003074676335173842, 'steps_per_switch': 14103, 'total_timesteps': 89493}
[2026-04-15 22:47:32] [Wave4] Launching trial 7: {'learning_rate': 0.0003074676335173842, 'steps_per_switch': 14103, 'total_timesteps': 89493}
[2026-04-15 22:47:32] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 89493 --steps-per-switch 14103 --learning-rate 0.0003074676335173842 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0007
[2026-04-15 23:45:56] [Wave4] Trial 7 finished in 3503.6s, rc=0
[2026-04-15 23:45:56] [Wave4] Parsed: combined=80.9467 mini_monaco=80.9467
[2026-04-15 23:45:56] [Wave4] ===== Trial 7 Summary =====
[2026-04-15 23:45:56] GP data points : 7
[2026-04-15 23:45:56] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-15 23:45:56] Top 5:
[2026-04-15 23:45:56] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-15 23:45:56] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-15 23:45:56] score=93.39 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 23:45:56] score=80.95 params={'learning_rate': 0.0003074676335173842, 'steps_per_switch': 14103, 'total_timesteps': 89493}
[2026-04-15 23:45:56] score=42.30 params={'learning_rate': 0.0003, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-15 23:45:58]
[Wave4] ========== Trial 8/25 ==========
[2026-04-15 23:45:58] [Wave4] GP UCB top-5 proposals:
[2026-04-15 23:45:58] UCB=2.8761 mu=2.3811 σ=0.2475 params={'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-15 23:45:58] UCB=2.7487 mu=1.8283 σ=0.4602 params={'learning_rate': 0.0010473338684621142, 'steps_per_switch': 13978, 'total_timesteps': 89106}
[2026-04-15 23:45:58] UCB=2.7282 mu=1.1574 σ=0.7854 params={'learning_rate': 0.0013992142565926369, 'steps_per_switch': 14988, 'total_timesteps': 89592}
[2026-04-15 23:45:58] UCB=2.1012 mu=1.4334 σ=0.3339 params={'learning_rate': 0.0008049497388184544, 'steps_per_switch': 13467, 'total_timesteps': 88845}
[2026-04-15 23:45:58] UCB=2.0136 mu=0.1079 σ=0.9528 params={'learning_rate': 0.0017586582805663546, 'steps_per_switch': 14607, 'total_timesteps': 89073}
[2026-04-15 23:45:58] [Wave4] Proposed params: {'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-15 23:46:00] [Wave4] Launching trial 8: {'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-15 23:46:00] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 89798 --steps-per-switch 14326 --learning-rate 0.0008478421211922181 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0008
[2026-04-16 00:48:42] [Wave4] Trial 8 finished in 3762.1s, rc=0
[2026-04-16 00:48:42] [Wave4] Parsed: combined=116.2588 mini_monaco=116.2588
[2026-04-16 00:48:42] [Wave4] ===== Trial 8 Summary =====
[2026-04-16 00:48:42] GP data points : 8
[2026-04-16 00:48:42] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 00:48:42] Top 5:
[2026-04-16 00:48:42] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 00:48:42] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 00:48:42] score=116.26 params={'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-16 00:48:42] score=93.39 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-16 00:48:42] score=80.95 params={'learning_rate': 0.0003074676335173842, 'steps_per_switch': 14103, 'total_timesteps': 89493}
[2026-04-16 00:48:44]
[Wave4] ========== Trial 9/25 ==========
[2026-04-16 00:48:44] [Wave4] GP UCB top-5 proposals:
[2026-04-16 00:48:44] UCB=2.6052 mu=1.5807 σ=0.5122 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 00:48:44] UCB=2.5103 mu=1.1029 σ=0.7037 params={'learning_rate': 0.0007143303795900681, 'steps_per_switch': 5097, 'total_timesteps': 88077}
[2026-04-16 00:48:44] UCB=2.3990 mu=0.8072 σ=0.7959 params={'learning_rate': 0.00027589394711147663, 'steps_per_switch': 5263, 'total_timesteps': 89853}
[2026-04-16 00:48:44] UCB=2.3635 mu=0.6759 σ=0.8438 params={'learning_rate': 0.0007616025453782968, 'steps_per_switch': 3774, 'total_timesteps': 88582}
[2026-04-16 00:48:44] UCB=2.3265 mu=0.9173 σ=0.7046 params={'learning_rate': 0.0011226509565016517, 'steps_per_switch': 5495, 'total_timesteps': 88334}
[2026-04-16 00:48:44] [Wave4] Proposed params: {'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 00:48:46] [Wave4] Launching trial 9: {'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 00:48:46] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 89893 --steps-per-switch 6851 --learning-rate 0.0007252855740444645 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0009
[2026-04-16 02:11:13] [Wave4] Trial 9 finished in 4947.1s, rc=0
[2026-04-16 02:11:13] [Wave4] Parsed: combined=1435.0401 mini_monaco=1435.0401
[2026-04-16 02:11:13] [Wave4] ===== Trial 9 Summary =====
[2026-04-16 02:11:13] GP data points : 9
[2026-04-16 02:11:13] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 02:11:13] Top 5:
[2026-04-16 02:11:13] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 02:11:13] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 02:11:13] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 02:11:13] score=116.26 params={'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-16 02:11:13] score=93.39 params={'learning_rate': 0.001, 'steps_per_switch': 6000, 'total_timesteps': 80000}
[2026-04-16 02:11:15]
[Wave4] ========== Trial 10/25 ==========
[2026-04-16 02:11:15] [Wave4] GP UCB top-5 proposals:
[2026-04-16 02:11:15] UCB=2.0934 mu=0.9205 σ=0.5865 params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 02:11:15] UCB=1.9984 mu=0.3915 σ=0.8034 params={'learning_rate': 0.00150594219964486, 'steps_per_switch': 5446, 'total_timesteps': 89972}
[2026-04-16 02:11:15] UCB=1.9795 mu=-0.0207 σ=1.0001 params={'learning_rate': 0.00187204813121039, 'steps_per_switch': 11474, 'total_timesteps': 60629}
[2026-04-16 02:11:15] UCB=1.9792 mu=-0.0197 σ=0.9994 params={'learning_rate': 0.0006516393606134399, 'steps_per_switch': 14385, 'total_timesteps': 60164}
[2026-04-16 02:11:15] UCB=1.9789 mu=-0.0200 σ=0.9995 params={'learning_rate': 0.001964829385849987, 'steps_per_switch': 10248, 'total_timesteps': 61072}
[2026-04-16 02:11:15] [Wave4] Proposed params: {'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 02:11:17] [Wave4] Launching trial 10: {'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 02:11:17] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 89501 --steps-per-switch 4587 --learning-rate 0.0010578566658808422 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0010
[2026-04-16 03:31:41] [Wave4] Trial 10 finished in 4823.9s, rc=0
[2026-04-16 03:31:41] [Wave4] Parsed: combined=141.3547 mini_monaco=141.3547
[2026-04-16 03:31:41] [Wave4] ===== Trial 10 Summary =====
[2026-04-16 03:31:41] GP data points : 10
[2026-04-16 03:31:41] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 03:31:41] Top 5:
[2026-04-16 03:31:41] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 03:31:41] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 03:31:41] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 03:31:41] score=141.35 params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 03:31:41] score=116.26 params={'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-16 03:31:42] [Wave4] ✅ Git push complete after trial 10
[2026-04-16 03:31:44]
[Wave4] ========== Trial 11/25 ==========
[2026-04-16 03:31:44] [Wave4] GP UCB top-5 proposals:
[2026-04-16 03:31:44] UCB=2.3190 mu=1.6261 σ=0.3465 params={'learning_rate': 0.0004453206607211525, 'steps_per_switch': 6345, 'total_timesteps': 89491}
[2026-04-16 03:31:44] UCB=2.1595 mu=1.1793 σ=0.4901 params={'learning_rate': 0.00046964743340115834, 'steps_per_switch': 4776, 'total_timesteps': 89856}
[2026-04-16 03:31:44] UCB=2.0941 mu=0.7922 σ=0.6510 params={'learning_rate': 0.0003410660041988607, 'steps_per_switch': 4166, 'total_timesteps': 89400}
[2026-04-16 03:31:44] UCB=2.0525 mu=0.6500 σ=0.7012 params={'learning_rate': 0.00010060597179209699, 'steps_per_switch': 6926, 'total_timesteps': 89756}
[2026-04-16 03:31:44] UCB=2.0368 mu=0.7629 σ=0.6370 params={'learning_rate': 0.00012830012767488515, 'steps_per_switch': 6077, 'total_timesteps': 88482}
[2026-04-16 03:31:44] [Wave4] Proposed params: {'learning_rate': 0.0004453206607211525, 'steps_per_switch': 6345, 'total_timesteps': 89491}
[2026-04-16 03:31:46] [Wave4] Launching trial 11: {'learning_rate': 0.0004453206607211525, 'steps_per_switch': 6345, 'total_timesteps': 89491}
[2026-04-16 03:31:46] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 89491 --steps-per-switch 6345 --learning-rate 0.0004453206607211525 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0011
[2026-04-16 04:47:26] [Wave4] Trial 11 finished in 4539.4s, rc=0
[2026-04-16 04:47:26] [Wave4] Parsed: combined=85.169 mini_monaco=85.169
[2026-04-16 04:47:26] [Wave4] ===== Trial 11 Summary =====
[2026-04-16 04:47:26] GP data points : 11
[2026-04-16 04:47:26] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 04:47:26] Top 5:
[2026-04-16 04:47:26] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 04:47:26] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 04:47:26] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 04:47:26] score=141.35 params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 04:47:26] score=116.26 params={'learning_rate': 0.0008478421211922181, 'steps_per_switch': 14326, 'total_timesteps': 89798}
[2026-04-16 04:47:28]
[Wave4] ========== Trial 12/25 ==========
[2026-04-16 04:47:28] [Wave4] GP UCB top-5 proposals:
[2026-04-16 04:47:28] UCB=2.2712 mu=2.0464 σ=0.1124 params={'learning_rate': 0.0008597472207162354, 'steps_per_switch': 6936, 'total_timesteps': 89806}
[2026-04-16 04:47:28] UCB=2.1972 mu=0.3727 σ=0.9123 params={'learning_rate': 0.00011512144682169134, 'steps_per_switch': 7128, 'total_timesteps': 67694}
[2026-04-16 04:47:28] UCB=2.1921 mu=0.3206 σ=0.9357 params={'learning_rate': 0.00011280326159121539, 'steps_per_switch': 3550, 'total_timesteps': 67913}
[2026-04-16 04:47:28] UCB=2.1765 mu=1.4350 σ=0.3708 params={'learning_rate': 0.00048039814596614487, 'steps_per_switch': 14975, 'total_timesteps': 84957}
[2026-04-16 04:47:28] UCB=2.1664 mu=0.3542 σ=0.9061 params={'learning_rate': 0.00022405059432424194, 'steps_per_switch': 5228, 'total_timesteps': 67529}
[2026-04-16 04:47:28] [Wave4] Proposed params: {'learning_rate': 0.0008597472207162354, 'steps_per_switch': 6936, 'total_timesteps': 89806}
[2026-04-16 04:47:30] [Wave4] Launching trial 12: {'learning_rate': 0.0008597472207162354, 'steps_per_switch': 6936, 'total_timesteps': 89806}
[2026-04-16 04:47:30] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 89806 --steps-per-switch 6936 --learning-rate 0.0008597472207162354 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0012
[2026-04-16 06:01:18] [Wave4] Trial 12 finished in 4428.4s, rc=0
[2026-04-16 06:01:18] [Wave4] Parsed: combined=131.9914 mini_monaco=131.9914
[2026-04-16 06:01:18] [Wave4] ===== Trial 12 Summary =====
[2026-04-16 06:01:18] GP data points : 12
[2026-04-16 06:01:18] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 06:01:18] Top 5:
[2026-04-16 06:01:18] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 06:01:18] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 06:01:18] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 06:01:18] score=141.35 params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 06:01:18] score=131.99 params={'learning_rate': 0.0008597472207162354, 'steps_per_switch': 6936, 'total_timesteps': 89806}
[2026-04-16 06:01:20]
[Wave4] ========== Trial 13/25 ==========
[2026-04-16 06:01:20] [Wave4] GP UCB top-5 proposals:
[2026-04-16 06:01:20] UCB=3.6412 mu=2.1560 σ=0.7426 params={'learning_rate': 0.0019124682919611855, 'steps_per_switch': 3574, 'total_timesteps': 73915}
[2026-04-16 06:01:20] UCB=3.5658 mu=2.1987 σ=0.6835 params={'learning_rate': 0.0018623702999958462, 'steps_per_switch': 4072, 'total_timesteps': 73804}
[2026-04-16 06:01:20] UCB=3.5079 mu=2.0033 σ=0.7523 params={'learning_rate': 0.001647357412665117, 'steps_per_switch': 3844, 'total_timesteps': 72144}
[2026-04-16 06:01:20] UCB=3.4793 mu=2.2411 σ=0.6191 params={'learning_rate': 0.0017259517065612783, 'steps_per_switch': 3332, 'total_timesteps': 76276}
[2026-04-16 06:01:20] UCB=3.4632 mu=1.7809 σ=0.8411 params={'learning_rate': 0.0018817829021946482, 'steps_per_switch': 4363, 'total_timesteps': 70258}
[2026-04-16 06:01:20] [Wave4] Proposed params: {'learning_rate': 0.0019124682919611855, 'steps_per_switch': 3574, 'total_timesteps': 73915}
[2026-04-16 06:01:22] [Wave4] Launching trial 13: {'learning_rate': 0.0019124682919611855, 'steps_per_switch': 3574, 'total_timesteps': 73915}
[2026-04-16 06:01:22] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 73915 --steps-per-switch 3574 --learning-rate 0.0019124682919611855 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0013
[2026-04-16 07:01:28] [Wave4] Trial 13 finished in 3605.7s, rc=0
[2026-04-16 07:01:28] [Wave4] Parsed: combined=86.8189 mini_monaco=86.8189
[2026-04-16 07:01:28] [Wave4] ===== Trial 13 Summary =====
[2026-04-16 07:01:28] GP data points : 13
[2026-04-16 07:01:28] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 07:01:28] Top 5:
[2026-04-16 07:01:28] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 07:01:28] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 07:01:28] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 07:01:28] score=141.35 params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 07:01:28] score=131.99 params={'learning_rate': 0.0008597472207162354, 'steps_per_switch': 6936, 'total_timesteps': 89806}
[2026-04-16 07:01:30]
[Wave4] ========== Trial 14/25 ==========
[2026-04-16 07:01:30] [Wave4] GP UCB top-5 proposals:
[2026-04-16 07:01:30] UCB=3.4396 mu=1.7713 σ=0.8341 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 07:01:30] UCB=3.4124 mu=1.7531 σ=0.8296 params={'learning_rate': 0.00020849841344963834, 'steps_per_switch': 6509, 'total_timesteps': 69502}
[2026-04-16 07:01:30] UCB=3.3878 mu=1.7622 σ=0.8128 params={'learning_rate': 0.000483907016003038, 'steps_per_switch': 7464, 'total_timesteps': 70587}
[2026-04-16 07:01:30] UCB=3.3445 mu=1.9911 σ=0.6767 params={'learning_rate': 0.00041395529913323, 'steps_per_switch': 7143, 'total_timesteps': 72719}
[2026-04-16 07:01:30] UCB=3.3380 mu=1.6221 σ=0.8579 params={'learning_rate': 0.0004908056727355321, 'steps_per_switch': 4967, 'total_timesteps': 69146}
[2026-04-16 07:01:30] [Wave4] Proposed params: {'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 07:01:32] [Wave4] Launching trial 14: {'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 07:01:32] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 69359 --steps-per-switch 5448 --learning-rate 0.00033896311195395496 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0014
[2026-04-16 07:54:33] [Wave4] Trial 14 finished in 3181.0s, rc=0
[2026-04-16 07:54:33] [Wave4] Parsed: combined=1573.0703 mini_monaco=1573.0703
[2026-04-16 07:54:33] [Wave4] ===== Trial 14 Summary =====
[2026-04-16 07:54:33] GP data points : 14
[2026-04-16 07:54:33] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 07:54:33] Top 5:
[2026-04-16 07:54:33] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 07:54:33] score=1573.07 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 07:54:33] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 07:54:33] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 07:54:33] score=141.35 params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 07:54:35]
[Wave4] ========== Trial 15/25 ==========
[2026-04-16 07:54:35] [Wave4] GP UCB top-5 proposals:
[2026-04-16 07:54:35] UCB=2.8282 mu=1.6797 σ=0.5742 params={'learning_rate': 0.0003994526493374, 'steps_per_switch': 7747, 'total_timesteps': 70559}
[2026-04-16 07:54:35] UCB=2.7924 mu=1.4316 σ=0.6804 params={'learning_rate': 0.0005427837302668084, 'steps_per_switch': 8198, 'total_timesteps': 71093}
[2026-04-16 07:54:35] UCB=2.7732 mu=1.5025 σ=0.6353 params={'learning_rate': 0.000186376586240458, 'steps_per_switch': 7967, 'total_timesteps': 71123}
[2026-04-16 07:54:35] UCB=2.7719 mu=1.2938 σ=0.7390 params={'learning_rate': 0.0002182534990992253, 'steps_per_switch': 8697, 'total_timesteps': 71541}
[2026-04-16 07:54:35] UCB=2.7193 mu=1.1369 σ=0.7912 params={'learning_rate': 0.00011645020756444703, 'steps_per_switch': 8822, 'total_timesteps': 69675}
[2026-04-16 07:54:35] [Wave4] Proposed params: {'learning_rate': 0.0003994526493374, 'steps_per_switch': 7747, 'total_timesteps': 70559}
[2026-04-16 07:54:37] [Wave4] Launching trial 15: {'learning_rate': 0.0003994526493374, 'steps_per_switch': 7747, 'total_timesteps': 70559}
[2026-04-16 07:54:37] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 70559 --steps-per-switch 7747 --learning-rate 0.0003994526493374 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0015
[2026-04-16 08:43:17] [Wave4] Trial 15 finished in 2920.0s, rc=0
[2026-04-16 08:43:17] [Wave4] Parsed: combined=111.2516 mini_monaco=111.2516
[2026-04-16 08:43:17] [Wave4] ===== Trial 15 Summary =====
[2026-04-16 08:43:17] GP data points : 15
[2026-04-16 08:43:17] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 08:43:17] Top 5:
[2026-04-16 08:43:17] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 08:43:17] score=1573.07 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 08:43:17] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 08:43:17] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 08:43:17] score=141.35 params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 08:43:19] [Wave4] ✅ Git push complete after trial 15
[2026-04-16 08:43:21]
[Wave4] ========== Trial 16/25 ==========
[2026-04-16 08:43:21] [Wave4] GP UCB top-5 proposals:
[2026-04-16 08:43:21] UCB=3.6132 mu=2.7065 σ=0.4533 params={'learning_rate': 0.00040295487555843073, 'steps_per_switch': 3490, 'total_timesteps': 72578}
[2026-04-16 08:43:21] UCB=3.6040 mu=2.6026 σ=0.5007 params={'learning_rate': 0.0004730969154346568, 'steps_per_switch': 3501, 'total_timesteps': 73673}
[2026-04-16 08:43:21] UCB=3.5949 mu=2.3139 σ=0.6405 params={'learning_rate': 0.0007493214757809492, 'steps_per_switch': 3197, 'total_timesteps': 74747}
[2026-04-16 08:43:21] UCB=3.3717 mu=2.1460 σ=0.6128 params={'learning_rate': 0.0008356081352627151, 'steps_per_switch': 3341, 'total_timesteps': 76184}
[2026-04-16 08:43:21] UCB=3.2598 mu=1.9831 σ=0.6384 params={'learning_rate': 0.00015673628608872705, 'steps_per_switch': 3392, 'total_timesteps': 63890}
[2026-04-16 08:43:21] [Wave4] Proposed params: {'learning_rate': 0.00040295487555843073, 'steps_per_switch': 3490, 'total_timesteps': 72578}
[2026-04-16 08:43:23] [Wave4] Launching trial 16: {'learning_rate': 0.00040295487555843073, 'steps_per_switch': 3490, 'total_timesteps': 72578}
[2026-04-16 08:43:23] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 72578 --steps-per-switch 3490 --learning-rate 0.00040295487555843073 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0016
[2026-04-16 09:52:25] [Wave4] Trial 16 finished in 4142.2s, rc=0
[2026-04-16 09:52:25] [Wave4] Parsed: combined=59.9996 mini_monaco=59.9996
[2026-04-16 09:52:25] [Wave4] ===== Trial 16 Summary =====
[2026-04-16 09:52:25] GP data points : 16
[2026-04-16 09:52:25] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 09:52:25] Top 5:
[2026-04-16 09:52:25] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 09:52:25] score=1573.07 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 09:52:25] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 09:52:25] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 09:52:25] score=141.35 params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 09:52:27]
[Wave4] ========== Trial 17/25 ==========
[2026-04-16 09:52:27] [Wave4] GP UCB top-5 proposals:
[2026-04-16 09:52:27] UCB=4.4175 mu=3.0206 σ=0.6985 params={'learning_rate': 0.00011371306784012085, 'steps_per_switch': 4558, 'total_timesteps': 61409}
[2026-04-16 09:52:27] UCB=3.8442 mu=2.8720 σ=0.4861 params={'learning_rate': 0.00015521967796620274, 'steps_per_switch': 5888, 'total_timesteps': 64258}
[2026-04-16 09:52:27] UCB=3.8027 mu=2.6781 σ=0.5623 params={'learning_rate': 0.0001942087695905749, 'steps_per_switch': 3887, 'total_timesteps': 63994}
[2026-04-16 09:52:27] UCB=3.7728 mu=2.8641 σ=0.4544 params={'learning_rate': 0.00019296340532126325, 'steps_per_switch': 4356, 'total_timesteps': 65008}
[2026-04-16 09:52:27] UCB=3.7728 mu=2.4744 σ=0.6492 params={'learning_rate': 0.0004244302343635027, 'steps_per_switch': 4871, 'total_timesteps': 61656}
[2026-04-16 09:52:27] [Wave4] Proposed params: {'learning_rate': 0.00011371306784012085, 'steps_per_switch': 4558, 'total_timesteps': 61409}
[2026-04-16 09:52:29] [Wave4] Launching trial 17: {'learning_rate': 0.00011371306784012085, 'steps_per_switch': 4558, 'total_timesteps': 61409}
[2026-04-16 09:52:29] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 61409 --steps-per-switch 4558 --learning-rate 0.00011371306784012085 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0017
[2026-04-16 10:54:36] [Wave4] Trial 17 finished in 3727.6s, rc=0
[2026-04-16 10:54:36] [Wave4] Parsed: combined=106.1213 mini_monaco=106.1213
[2026-04-16 10:54:36] [Wave4] ===== Trial 17 Summary =====
[2026-04-16 10:54:36] GP data points : 17
[2026-04-16 10:54:36] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 10:54:36] Top 5:
[2026-04-16 10:54:36] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 10:54:36] score=1573.07 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 10:54:36] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 10:54:36] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 10:54:36] score=141.35 params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 10:54:38]
[Wave4] ========== Trial 18/25 ==========
[2026-04-16 10:54:38] [Wave4] GP UCB top-5 proposals:
[2026-04-16 10:54:38] UCB=2.4022 mu=1.7625 σ=0.3199 params={'learning_rate': 0.00014739138140971148, 'steps_per_switch': 5692, 'total_timesteps': 73312}
[2026-04-16 10:54:38] UCB=2.1766 mu=1.8937 σ=0.1414 params={'learning_rate': 0.0003092758146459376, 'steps_per_switch': 5733, 'total_timesteps': 72147}
[2026-04-16 10:54:38] UCB=2.0283 mu=1.8693 σ=0.0795 params={'learning_rate': 0.00038035350750885823, 'steps_per_switch': 5423, 'total_timesteps': 70634}
[2026-04-16 10:54:38] UCB=2.0025 mu=0.0985 σ=0.9520 params={'learning_rate': 0.0019422789362836088, 'steps_per_switch': 9820, 'total_timesteps': 71591}
[2026-04-16 10:54:38] UCB=1.9979 mu=1.7539 σ=0.1220 params={'learning_rate': 0.0002958405723304147, 'steps_per_switch': 5704, 'total_timesteps': 67124}
[2026-04-16 10:54:38] [Wave4] Proposed params: {'learning_rate': 0.00014739138140971148, 'steps_per_switch': 5692, 'total_timesteps': 73312}
[2026-04-16 10:54:40] [Wave4] Launching trial 18: {'learning_rate': 0.00014739138140971148, 'steps_per_switch': 5692, 'total_timesteps': 73312}
[2026-04-16 10:54:40] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 73312 --steps-per-switch 5692 --learning-rate 0.00014739138140971148 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0018
[2026-04-16 11:58:07] [Wave4] Trial 18 finished in 3806.9s, rc=0
[2026-04-16 11:58:07] [Wave4] Parsed: combined=115.6774 mini_monaco=115.6774
[2026-04-16 11:58:07] [Wave4] ===== Trial 18 Summary =====
[2026-04-16 11:58:07] GP data points : 18
[2026-04-16 11:58:07] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 11:58:07] Top 5:
[2026-04-16 11:58:07] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 11:58:07] score=1573.07 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 11:58:07] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 11:58:07] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 11:58:07] score=141.35 params={'learning_rate': 0.0010578566658808422, 'steps_per_switch': 4587, 'total_timesteps': 89501}
[2026-04-16 11:58:09]
[Wave4] ========== Trial 19/25 ==========
[2026-04-16 11:58:09] [Wave4] GP UCB top-5 proposals:
[2026-04-16 11:58:09] UCB=4.2331 mu=3.1539 σ=0.5396 params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 11:58:09] UCB=3.9367 mu=2.3643 σ=0.7862 params={'learning_rate': 0.000973356801504352, 'steps_per_switch': 4731, 'total_timesteps': 64636}
[2026-04-16 11:58:09] UCB=3.8093 mu=2.3419 σ=0.7337 params={'learning_rate': 0.0009048073949917447, 'steps_per_switch': 4137, 'total_timesteps': 65312}
[2026-04-16 11:58:09] UCB=3.7049 mu=2.1579 σ=0.7735 params={'learning_rate': 0.0008513107736880863, 'steps_per_switch': 6388, 'total_timesteps': 62234}
[2026-04-16 11:58:09] UCB=3.6513 mu=2.2696 σ=0.6908 params={'learning_rate': 0.0006744614059620855, 'steps_per_switch': 4569, 'total_timesteps': 60715}
[2026-04-16 11:58:09] [Wave4] Proposed params: {'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 11:58:11] [Wave4] Launching trial 19: {'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 11:58:11] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 64179 --steps-per-switch 4747 --learning-rate 0.0006672844816013197 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0019
[2026-04-16 13:10:23] [Wave4] Trial 19 finished in 4331.6s, rc=0
[2026-04-16 13:10:23] [Wave4] Parsed: combined=230.9827 mini_monaco=230.9827
[2026-04-16 13:10:23] [Wave4] ===== Trial 19 Summary =====
[2026-04-16 13:10:23] GP data points : 19
[2026-04-16 13:10:23] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 13:10:23] Top 5:
[2026-04-16 13:10:23] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 13:10:23] score=1573.07 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 13:10:23] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 13:10:23] score=230.98 params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 13:10:23] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 13:10:25]
[Wave4] ========== Trial 20/25 ==========
[2026-04-16 13:10:25] [Wave4] GP UCB top-5 proposals:
[2026-04-16 13:10:25] UCB=2.2341 mu=1.6242 σ=0.3049 params={'learning_rate': 0.0007104769870597334, 'steps_per_switch': 5822, 'total_timesteps': 73160}
[2026-04-16 13:10:25] UCB=2.2038 mu=1.8331 σ=0.1853 params={'learning_rate': 0.000688892492740156, 'steps_per_switch': 5559, 'total_timesteps': 78911}
[2026-04-16 13:10:25] UCB=2.2000 mu=1.6830 σ=0.2585 params={'learning_rate': 0.0006312216002702526, 'steps_per_switch': 7125, 'total_timesteps': 80150}
[2026-04-16 13:10:25] UCB=2.1950 mu=1.5842 σ=0.3054 params={'learning_rate': 0.0009544613332763077, 'steps_per_switch': 3097, 'total_timesteps': 89206}
[2026-04-16 13:10:25] UCB=2.0924 mu=1.5207 σ=0.2859 params={'learning_rate': 0.0006628882948205277, 'steps_per_switch': 6497, 'total_timesteps': 73747}
[2026-04-16 13:10:25] [Wave4] Proposed params: {'learning_rate': 0.0007104769870597334, 'steps_per_switch': 5822, 'total_timesteps': 73160}
[2026-04-16 13:10:27] [Wave4] Launching trial 20: {'learning_rate': 0.0007104769870597334, 'steps_per_switch': 5822, 'total_timesteps': 73160}
[2026-04-16 13:10:27] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 73160 --steps-per-switch 5822 --learning-rate 0.0007104769870597334 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0020
[2026-04-16 14:10:06] [Wave4] Trial 20 finished in 3578.9s, rc=0
[2026-04-16 14:10:06] [Wave4] Parsed: combined=20.9587 mini_monaco=20.9587
[2026-04-16 14:10:06] [Wave4] ===== Trial 20 Summary =====
[2026-04-16 14:10:06] GP data points : 20
[2026-04-16 14:10:06] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 14:10:06] Top 5:
[2026-04-16 14:10:06] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 14:10:06] score=1573.07 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 14:10:06] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 14:10:06] score=230.98 params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 14:10:06] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 14:10:08] [Wave4] ✅ Git push complete after trial 20
[2026-04-16 14:10:10]
[Wave4] ========== Trial 21/25 ==========
[2026-04-16 14:10:10] [Wave4] GP UCB top-5 proposals:
[2026-04-16 14:10:10] UCB=2.6140 mu=2.2621 σ=0.1760 params={'learning_rate': 0.0006240263317913531, 'steps_per_switch': 6248, 'total_timesteps': 81430}
[2026-04-16 14:10:10] UCB=2.4442 mu=2.0635 σ=0.1903 params={'learning_rate': 0.0008059149094586216, 'steps_per_switch': 5668, 'total_timesteps': 84976}
[2026-04-16 14:10:10] UCB=2.4147 mu=2.0379 σ=0.1884 params={'learning_rate': 0.0005106809671527766, 'steps_per_switch': 7044, 'total_timesteps': 86259}
[2026-04-16 14:10:10] UCB=2.2869 mu=1.1164 σ=0.5852 params={'learning_rate': 0.001932234805776913, 'steps_per_switch': 3270, 'total_timesteps': 81124}
[2026-04-16 14:10:10] UCB=2.1881 mu=1.7486 σ=0.2198 params={'learning_rate': 0.0006026876192105071, 'steps_per_switch': 14736, 'total_timesteps': 86970}
[2026-04-16 14:10:10] [Wave4] Proposed params: {'learning_rate': 0.0006240263317913531, 'steps_per_switch': 6248, 'total_timesteps': 81430}
[2026-04-16 14:10:12] [Wave4] Launching trial 21: {'learning_rate': 0.0006240263317913531, 'steps_per_switch': 6248, 'total_timesteps': 81430}
[2026-04-16 14:10:12] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 81430 --steps-per-switch 6248 --learning-rate 0.0006240263317913531 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0021
[2026-04-16 15:41:17] [Wave4] Trial 21 finished in 5465.6s, rc=0
[2026-04-16 15:41:17] [Wave4] Parsed: combined=86.1012 mini_monaco=86.1012
[2026-04-16 15:41:17] [Wave4] ===== Trial 21 Summary =====
[2026-04-16 15:41:17] GP data points : 21
[2026-04-16 15:41:17] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 15:41:17] Top 5:
[2026-04-16 15:41:17] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 15:41:17] score=1573.07 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 15:41:17] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 15:41:17] score=230.98 params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 15:41:17] score=176.67 params={'learning_rate': 0.0016223486895735558, 'steps_per_switch': 5524, 'total_timesteps': 79532}
[2026-04-16 15:41:19]
[Wave4] ========== Trial 22/25 ==========
[2026-04-16 15:41:19] [Wave4] GP UCB top-5 proposals:
[2026-04-16 15:41:19] UCB=2.9276 mu=1.9840 σ=0.4718 params={'learning_rate': 0.0010680884757571496, 'steps_per_switch': 3269, 'total_timesteps': 81912}
[2026-04-16 15:41:19] UCB=2.8979 mu=1.8081 σ=0.5449 params={'learning_rate': 0.0014475574559088072, 'steps_per_switch': 3057, 'total_timesteps': 77220}
[2026-04-16 15:41:19] UCB=2.7394 mu=1.7758 σ=0.4818 params={'learning_rate': 0.0010706637693188694, 'steps_per_switch': 3430, 'total_timesteps': 79924}
[2026-04-16 15:41:19] UCB=2.7361 mu=1.9188 σ=0.4086 params={'learning_rate': 0.0003137950754813831, 'steps_per_switch': 7355, 'total_timesteps': 63965}
[2026-04-16 15:41:19] UCB=2.6672 mu=1.6715 σ=0.4978 params={'learning_rate': 0.0013764156246110833, 'steps_per_switch': 3205, 'total_timesteps': 83531}
[2026-04-16 15:41:19] [Wave4] Proposed params: {'learning_rate': 0.0010680884757571496, 'steps_per_switch': 3269, 'total_timesteps': 81912}
[2026-04-16 15:41:21] [Wave4] Launching trial 22: {'learning_rate': 0.0010680884757571496, 'steps_per_switch': 3269, 'total_timesteps': 81912}
[2026-04-16 15:41:21] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 81912 --steps-per-switch 3269 --learning-rate 0.0010680884757571496 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0022
[2026-04-16 17:13:22] [Wave4] Trial 22 finished in 5520.7s, rc=0
[2026-04-16 17:13:22] [Wave4] Parsed: combined=193.2341 mini_monaco=193.2341
[2026-04-16 17:13:22] [Wave4] ===== Trial 22 Summary =====
[2026-04-16 17:13:22] GP data points : 22
[2026-04-16 17:13:22] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 17:13:22] Top 5:
[2026-04-16 17:13:22] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 17:13:22] score=1573.07 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 17:13:22] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 17:13:22] score=230.98 params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 17:13:22] score=193.23 params={'learning_rate': 0.0010680884757571496, 'steps_per_switch': 3269, 'total_timesteps': 81912}
[2026-04-16 17:13:24]
[Wave4] ========== Trial 23/25 ==========
[2026-04-16 17:13:24] [Wave4] GP UCB top-5 proposals:
[2026-04-16 17:13:24] UCB=2.5284 mu=2.2838 σ=0.1223 params={'learning_rate': 0.0003194840316570061, 'steps_per_switch': 4507, 'total_timesteps': 66965}
[2026-04-16 17:13:24] UCB=2.4192 mu=1.9135 σ=0.2529 params={'learning_rate': 0.0005202514734965685, 'steps_per_switch': 6358, 'total_timesteps': 64937}
[2026-04-16 17:13:24] UCB=2.3057 mu=2.0781 σ=0.1138 params={'learning_rate': 0.00028748663178259926, 'steps_per_switch': 4629, 'total_timesteps': 64690}
[2026-04-16 17:13:24] UCB=2.2310 mu=1.9780 σ=0.1265 params={'learning_rate': 0.0005977134566763185, 'steps_per_switch': 7805, 'total_timesteps': 89342}
[2026-04-16 17:13:24] UCB=2.1094 mu=1.5506 σ=0.2794 params={'learning_rate': 0.0001403862307566014, 'steps_per_switch': 6141, 'total_timesteps': 64341}
[2026-04-16 17:13:24] [Wave4] Proposed params: {'learning_rate': 0.0003194840316570061, 'steps_per_switch': 4507, 'total_timesteps': 66965}
[2026-04-16 17:13:26] [Wave4] Launching trial 23: {'learning_rate': 0.0003194840316570061, 'steps_per_switch': 4507, 'total_timesteps': 66965}
[2026-04-16 17:13:26] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 66965 --steps-per-switch 4507 --learning-rate 0.0003194840316570061 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0023
[2026-04-16 18:26:11] [Wave4] Trial 23 finished in 4365.0s, rc=0
[2026-04-16 18:26:11] [Wave4] Parsed: combined=150.773 mini_monaco=150.773
[2026-04-16 18:26:11] [Wave4] ===== Trial 23 Summary =====
[2026-04-16 18:26:11] GP data points : 23
[2026-04-16 18:26:11] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 18:26:11] Top 5:
[2026-04-16 18:26:11] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 18:26:11] score=1573.07 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 18:26:11] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 18:26:11] score=230.98 params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 18:26:11] score=193.23 params={'learning_rate': 0.0010680884757571496, 'steps_per_switch': 3269, 'total_timesteps': 81912}
[2026-04-16 18:26:13]
[Wave4] ========== Trial 24/25 ==========
[2026-04-16 18:26:13] [Wave4] GP UCB top-5 proposals:
[2026-04-16 18:26:13] UCB=7.5247 mu=6.2951 σ=0.6148 params={'learning_rate': 0.0003596012644521025, 'steps_per_switch': 8013, 'total_timesteps': 60054}
[2026-04-16 18:26:13] UCB=6.0685 mu=4.6817 σ=0.6934 params={'learning_rate': 0.0001643952288780874, 'steps_per_switch': 9023, 'total_timesteps': 61369}
[2026-04-16 18:26:13] UCB=5.8942 mu=5.3342 σ=0.2800 params={'learning_rate': 0.00036103199241236257, 'steps_per_switch': 6908, 'total_timesteps': 64716}
[2026-04-16 18:26:13] UCB=5.6180 mu=4.5299 σ=0.5441 params={'learning_rate': 0.0006214434613010804, 'steps_per_switch': 7433, 'total_timesteps': 61101}
[2026-04-16 18:26:13] UCB=5.2906 mu=4.8505 σ=0.2200 params={'learning_rate': 0.000363303441736895, 'steps_per_switch': 6643, 'total_timesteps': 65509}
[2026-04-16 18:26:13] [Wave4] Proposed params: {'learning_rate': 0.0003596012644521025, 'steps_per_switch': 8013, 'total_timesteps': 60054}
[2026-04-16 18:26:15] [Wave4] Launching trial 24: {'learning_rate': 0.0003596012644521025, 'steps_per_switch': 8013, 'total_timesteps': 60054}
[2026-04-16 18:26:15] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 60054 --steps-per-switch 8013 --learning-rate 0.0003596012644521025 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0024
[2026-04-16 19:09:39] [Wave4] Trial 24 finished in 2604.0s, rc=0
[2026-04-16 19:09:39] [Wave4] Parsed: combined=37.824 mini_monaco=37.824
[2026-04-16 19:09:39] [Wave4] ===== Trial 24 Summary =====
[2026-04-16 19:09:39] GP data points : 24
[2026-04-16 19:09:39] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 19:09:39] Top 5:
[2026-04-16 19:09:39] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 19:09:39] score=1573.07 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 19:09:39] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 19:09:39] score=230.98 params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 19:09:39] score=193.23 params={'learning_rate': 0.0010680884757571496, 'steps_per_switch': 3269, 'total_timesteps': 81912}
[2026-04-16 19:09:41]
[Wave4] ========== Trial 25/25 ==========
[2026-04-16 19:09:41] [Wave4] GP UCB top-5 proposals:
[2026-04-16 19:09:41] UCB=2.8514 mu=2.5523 σ=0.1495 params={'learning_rate': 0.0003128257557719074, 'steps_per_switch': 6836, 'total_timesteps': 62683}
[2026-04-16 19:09:41] UCB=2.6831 mu=2.3606 σ=0.1612 params={'learning_rate': 0.0004069124594262667, 'steps_per_switch': 6035, 'total_timesteps': 62228}
[2026-04-16 19:09:41] UCB=2.5561 mu=2.1065 σ=0.2248 params={'learning_rate': 0.00011678403675177319, 'steps_per_switch': 6295, 'total_timesteps': 60168}
[2026-04-16 19:09:41] UCB=2.4849 mu=2.1243 σ=0.1803 params={'learning_rate': 0.0002227611562953955, 'steps_per_switch': 6484, 'total_timesteps': 60292}
[2026-04-16 19:09:41] UCB=2.3964 mu=2.0203 σ=0.1880 params={'learning_rate': 0.0004890619502018291, 'steps_per_switch': 6956, 'total_timesteps': 63892}
[2026-04-16 19:09:41] [Wave4] Proposed params: {'learning_rate': 0.0003128257557719074, 'steps_per_switch': 6836, 'total_timesteps': 62683}
[2026-04-16 19:09:43] [Wave4] Launching trial 25: {'learning_rate': 0.0003128257557719074, 'steps_per_switch': 6836, 'total_timesteps': 62683}
[2026-04-16 19:09:43] [Wave4] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 62683 --steps-per-switch 6836 --learning-rate 0.0003128257557719074 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave4-trial-0025
[2026-04-16 20:01:51] [Wave4] Trial 25 finished in 3128.1s, rc=0
[2026-04-16 20:01:51] [Wave4] Parsed: combined=1542.9971 mini_monaco=1542.9971
[2026-04-16 20:01:51] [Wave4] ===== Trial 25 Summary =====
[2026-04-16 20:01:51] GP data points : 25
[2026-04-16 20:01:51] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 20:01:51] Top 5:
[2026-04-16 20:01:51] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 20:01:51] score=1573.07 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 20:01:51] score=1543.00 params={'learning_rate': 0.0003128257557719074, 'steps_per_switch': 6836, 'total_timesteps': 62683}
[2026-04-16 20:01:51] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 20:01:51] score=230.98 params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 20:01:53] [Wave4] ✅ Git push complete after trial 25
[2026-04-16 20:01:55]
[Wave4] ===== All 25 trials complete! =====
[2026-04-16 20:01:55] [Wave4] ===== Trial 25 Summary =====
[2026-04-16 20:01:55] GP data points : 25
[2026-04-16 20:01:55] Wave4 Champion: trial=3 score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 157743}
[2026-04-16 20:01:55] Top 5:
[2026-04-16 20:01:55] score=1943.10 params={'learning_rate': 0.0006852550685205609, 'steps_per_switch': 17499, 'total_timesteps': 90000}
[2026-04-16 20:01:55] score=1573.07 params={'learning_rate': 0.00033896311195395496, 'steps_per_switch': 5448, 'total_timesteps': 69359}
[2026-04-16 20:01:55] score=1543.00 params={'learning_rate': 0.0003128257557719074, 'steps_per_switch': 6836, 'total_timesteps': 62683}
[2026-04-16 20:01:55] score=1435.04 params={'learning_rate': 0.0007252855740444645, 'steps_per_switch': 6851, 'total_timesteps': 89893}
[2026-04-16 20:01:55] score=230.98 params={'learning_rate': 0.0006672844816013197, 'steps_per_switch': 4747, 'total_timesteps': 64179}
[2026-04-16 20:01:56] [Wave4] ✅ Git push complete after trial 25