[2026-04-14 12:44:38] [Wave3] Seed trial 1/2: using hardcoded params. [2026-04-14 12:44:38] [Wave3] Seed trial 2/2: using hardcoded params. [2026-04-14 12:44:38] [Wave3] Only 0 results — using random proposal. [2026-04-14 12:44:38] [Champion] 🏆 NEW BEST! Trial 3: combined=1500.00 (mini_monaco=900.0, warren=600.0) params={'learning_rate': 0.0002, 'steps_per_switch': 8000, 'total_timesteps': 150000} [2026-04-14 12:44:38] [Champion] 🏆 NEW BEST! Trial 1: combined=2000.00 (mini_monaco=1200.0, warren=800.0) params={} [2026-04-14 12:45:00] [Wave3] Seed trial 1/2: using hardcoded params. [2026-04-14 12:45:00] [Wave3] Seed trial 2/2: using hardcoded params. [2026-04-14 12:45:00] [Wave3] Only 0 results — using random proposal. [2026-04-14 12:45:00] [Champion] 🏆 NEW BEST! Trial 3: combined=1500.00 (mini_monaco=900.0, warren=600.0) params={'learning_rate': 0.0002, 'steps_per_switch': 8000, 'total_timesteps': 150000} [2026-04-14 12:45:00] [Champion] 🏆 NEW BEST! Trial 1: combined=2000.00 (mini_monaco=1200.0, warren=800.0) params={} [2026-04-14 12:45:27] [Wave3] Seed trial 1/2: using hardcoded params. [2026-04-14 12:45:27] [Wave3] Seed trial 2/2: using hardcoded params. [2026-04-14 12:45:27] [Wave3] Only 0 results — using random proposal. [2026-04-14 12:45:27] [Champion] 🏆 NEW BEST! Trial 3: combined=1500.00 (mini_monaco=900.0, warren=600.0) params={'learning_rate': 0.0002, 'steps_per_switch': 8000, 'total_timesteps': 150000} [2026-04-14 12:45:27] [Champion] 🏆 NEW BEST! Trial 1: combined=2000.00 (mini_monaco=1200.0, warren=800.0) params={} [2026-04-14 12:45:39] [Wave3] Seed trial 1/2: using hardcoded params. [2026-04-14 12:45:39] [Wave3] Seed trial 2/2: using hardcoded params. [2026-04-14 12:45:39] [Wave3] Only 0 results — using random proposal. [2026-04-14 12:45:39] [Champion] 🏆 NEW BEST! Trial 3: combined=1500.00 (mini_monaco=900.0, warren=600.0) params={'learning_rate': 0.0002, 'steps_per_switch': 8000, 'total_timesteps': 150000} [2026-04-14 12:45:39] [Champion] 🏆 NEW BEST! Trial 1: combined=2000.00 (mini_monaco=1200.0, warren=800.0) params={} [2026-04-14 12:47:25] ================================================================= [2026-04-14 12:47:25] [Wave3] Multi-Track Autoresearch — GP+UCB Generalization Search [2026-04-14 12:47:25] [Wave3] Training tracks : generated_road, generated_track, mountain_track [2026-04-14 12:47:25] [Wave3] Test tracks : mini_monaco, warren (zero-shot) [2026-04-14 12:47:25] [Wave3] Max trials : 25 | kappa=2.0 | push every 5 [2026-04-14 12:47:25] [Wave3] Results file : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase3.jsonl [2026-04-14 12:47:25] [Wave3] Champion dir : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave3-champion [2026-04-14 12:47:25] [Wave3] Warm start : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/champion/model.zip [2026-04-14 12:47:25] ================================================================= [2026-04-14 12:47:25] [Wave3] Loaded 0 existing Phase 3 results. [2026-04-14 12:47:25] [Wave3] No Wave 3 champion yet. [2026-04-14 12:47:25] [Wave3] Starting from trial 1. [2026-04-14 12:47:25] [Wave3] ========== Trial 1/25 ========== [2026-04-14 12:47:25] [Wave3] Seed trial 1/2: using hardcoded params. [2026-04-14 12:47:25] [Wave3] Proposed params: {'learning_rate': 0.000225, 'steps_per_switch': 10000, 'total_timesteps': 150000} [2026-04-14 12:47:27] [Wave3] Launching trial 1: {'learning_rate': 0.000225, 'steps_per_switch': 10000, 'total_timesteps': 150000} [2026-04-14 12:47:27] [Wave3] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 150000 --steps-per-switch 10000 --learning-rate 0.000225 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave3-trial-0001 --warm-start /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/champion/model.zip [2026-04-14 13:28:47] [Wave3] Seed trial 1/2: using hardcoded params. [2026-04-14 13:28:47] [Wave3] Seed trial 2/2: using hardcoded params. [2026-04-14 13:28:47] [Wave3] Only 0 results — using random proposal. [2026-04-14 13:28:47] [Champion] 🏆 NEW BEST! Trial 3: combined=1500.00 (mini_monaco=900.0, warren=600.0) params={'learning_rate': 0.0002, 'steps_per_switch': 8000, 'total_timesteps': 150000} [2026-04-14 13:28:47] [Champion] 🏆 NEW BEST! Trial 1: combined=2000.00 (mini_monaco=1200.0, warren=800.0) params={} [2026-04-14 13:29:08] [Wave3] Seed trial 1/2: using hardcoded params. [2026-04-14 13:29:08] [Wave3] Seed trial 2/2: using hardcoded params. [2026-04-14 13:29:08] [Wave3] Only 0 results — using random proposal. [2026-04-14 13:29:08] [Champion] 🏆 NEW BEST! Trial 3: combined=1500.00 (mini_monaco=900.0, warren=600.0) params={'learning_rate': 0.0002, 'steps_per_switch': 8000, 'total_timesteps': 150000} [2026-04-14 13:29:08] [Champion] 🏆 NEW BEST! Trial 1: combined=2000.00 (mini_monaco=1200.0, warren=800.0) params={} [2026-04-14 13:29:34] [Wave3] Seed trial 1/2: using hardcoded params. [2026-04-14 13:29:34] [Wave3] Seed trial 2/2: using hardcoded params. [2026-04-14 13:29:34] [Wave3] Only 0 results — using random proposal. [2026-04-14 13:29:34] [Champion] 🏆 NEW BEST! Trial 3: combined=1500.00 (mini_monaco=900.0, warren=600.0) params={'learning_rate': 0.0002, 'steps_per_switch': 8000, 'total_timesteps': 150000} [2026-04-14 13:29:34] [Champion] 🏆 NEW BEST! Trial 1: combined=2000.00 (mini_monaco=1200.0, warren=800.0) params={} [2026-04-14 13:36:58] ================================================================= [2026-04-14 13:36:58] [Wave3] Multi-Track Autoresearch — GP+UCB Generalization Search [2026-04-14 13:36:58] [Wave3] Training tracks : generated_road, generated_track, mountain_track [2026-04-14 13:36:58] [Wave3] Test tracks : mini_monaco, warren (zero-shot) [2026-04-14 13:36:58] [Wave3] Max trials : 25 | kappa=2.0 | push every 5 [2026-04-14 13:36:58] [Wave3] Results file : /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results_phase3.jsonl [2026-04-14 13:36:58] [Wave3] Champion dir : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave3-champion [2026-04-14 13:36:58] [Wave3] Warm start : /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/champion/model.zip [2026-04-14 13:36:58] ================================================================= [2026-04-14 13:36:58] [Wave3] Loaded 0 existing Phase 3 results. [2026-04-14 13:36:58] [Wave3] No Wave 3 champion yet. [2026-04-14 13:36:58] [Wave3] Starting from trial 1. [2026-04-14 13:36:58] [Wave3] ========== Trial 1/25 ========== [2026-04-14 13:36:58] [Wave3] Seed trial 1/2: using hardcoded params. [2026-04-14 13:36:58] [Wave3] Proposed params: {'learning_rate': 0.000225, 'steps_per_switch': 5000, 'total_timesteps': 45000} [2026-04-14 13:37:00] [Wave3] Launching trial 1: {'learning_rate': 0.000225, 'steps_per_switch': 5000, 'total_timesteps': 45000} [2026-04-14 13:37:00] [Wave3] Command: python3 /home/paulh/projects/donkeycar-rl-autoresearch/agent/multitrack_runner.py --total-timesteps 45000 --steps-per-switch 5000 --learning-rate 0.000225 --eval-episodes 3 --save-dir /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/wave3-trial-0001 --warm-start /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/champion/model.zip [2026-04-14 13:47:17] [Wave3] Seed trial 1/2: using hardcoded params. [2026-04-14 13:47:17] [Wave3] Seed trial 2/2: using hardcoded params. [2026-04-14 13:47:17] [Wave3] Only 0 results — using random proposal. [2026-04-14 13:47:17] [Champion] 🏆 NEW BEST! Trial 3: score=1500.00 (mini_monaco=1500.0) params={'learning_rate': 0.0002, 'steps_per_switch': 8000, 'total_timesteps': 150000} [2026-04-14 13:47:17] [Champion] 🏆 NEW BEST! Trial 1: score=2000.00 (mini_monaco=2000.0) params={}