6353 lines
694 KiB
Plaintext
6353 lines
694 KiB
Plaintext
[2026-04-13 00:52:06] ============================================================
|
|
[2026-04-13 00:52:06] [AutoResearch] Starting Karpathy-style autoresearch controller
|
|
[2026-04-13 00:52:06] [AutoResearch] Max trials: 100
|
|
[2026-04-13 00:52:06] [AutoResearch] Runner: /home/paulh/projects/donkeycar-rl-autoresearch/agent/donkeycar_sb3_runner.py
|
|
[2026-04-13 00:52:06] [AutoResearch] Results: /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results.jsonl
|
|
[2026-04-13 00:52:06] ============================================================
|
|
[2026-04-13 00:52:06] [AutoResearch] Loaded 18 existing result(s) from base sweep + history.
|
|
[2026-04-13 00:52:06] [AutoResearch] === Trial 0 Summary ===
|
|
[2026-04-13 00:52:06] Total runs in history: 18
|
|
[2026-04-13 00:52:06] Best so far: mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:06] Top 5 results:
|
|
[2026-04-13 00:52:06] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:06] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:06] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:06] mean_reward=80.3866 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:06] mean_reward=78.3455 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:06]
|
|
[AutoResearch] ========== Trial 1/100 ==========
|
|
[2026-04-13 00:52:06] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:52:06] UCB=8.7366 mu=7.1484 sigma=0.7941 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0031591822946350732}
|
|
[2026-04-13 00:52:06] UCB=7.6154 mu=5.7877 sigma=0.9138 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0036742723050532423}
|
|
[2026-04-13 00:52:06] UCB=7.1218 mu=5.6920 sigma=0.7149 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0028974485260930445}
|
|
[2026-04-13 00:52:06] UCB=6.9354 mu=6.2459 sigma=0.3448 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001985059980937195}
|
|
[2026-04-13 00:52:06] UCB=6.6277 mu=5.4057 sigma=0.6110 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0025710701837463484}
|
|
[2026-04-13 00:52:06] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0031591822946350732, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:08] [AutoResearch] Launching job: n_steer=7 n_throttle=2 lr=0.003159
|
|
[2026-04-13 00:52:17] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 00:52:17] [AutoResearch] mean_reward=73.8366
|
|
[2026-04-13 00:52:17] [AutoResearch] === Trial 1 Summary ===
|
|
[2026-04-13 00:52:17] Total runs in history: 19
|
|
[2026-04-13 00:52:17] Best so far: mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:17] Top 5 results:
|
|
[2026-04-13 00:52:17] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:17] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:17] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:17] mean_reward=80.3866 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:17] mean_reward=78.3455 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:19]
|
|
[AutoResearch] ========== Trial 2/100 ==========
|
|
[2026-04-13 00:52:19] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:52:19] UCB=5.6112 mu=5.0878 sigma=0.2617 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0023408725669147915}
|
|
[2026-04-13 00:52:19] UCB=4.9874 mu=4.4092 sigma=0.2891 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0022102250532624844}
|
|
[2026-04-13 00:52:19] UCB=3.9952 mu=3.4419 sigma=0.2766 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.00176287499480802}
|
|
[2026-04-13 00:52:19] UCB=3.9669 mu=3.2040 sigma=0.3814 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0016860184468568981}
|
|
[2026-04-13 00:52:19] UCB=3.9142 mu=3.6874 sigma=0.1134 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0014103284483475934}
|
|
[2026-04-13 00:52:19] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0023408725669147915, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:21] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.002341
|
|
[2026-04-13 00:52:30] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 00:52:30] [AutoResearch] mean_reward=57.5366
|
|
[2026-04-13 00:52:30] [AutoResearch] === Trial 2 Summary ===
|
|
[2026-04-13 00:52:30] Total runs in history: 20
|
|
[2026-04-13 00:52:30] Best so far: mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:30] Top 5 results:
|
|
[2026-04-13 00:52:30] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:30] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:30] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:30] mean_reward=80.3866 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:30] mean_reward=78.3455 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:32]
|
|
[AutoResearch] ========== Trial 3/100 ==========
|
|
[2026-04-13 00:52:32] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:52:32] UCB=7.9212 mu=6.5277 sigma=0.6968 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.002908900986021436}
|
|
[2026-04-13 00:52:32] UCB=6.7426 mu=5.2492 sigma=0.7467 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0028206246948325}
|
|
[2026-04-13 00:52:32] UCB=6.5376 mu=4.9713 sigma=0.7832 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0024441281283003047}
|
|
[2026-04-13 00:52:32] UCB=6.1941 mu=4.8346 sigma=0.6797 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0035309415160584188}
|
|
[2026-04-13 00:52:32] UCB=6.1547 mu=5.0344 sigma=0.5602 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0013908191204546352}
|
|
[2026-04-13 00:52:32] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.002908900986021436, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:34] [AutoResearch] Launching job: n_steer=9 n_throttle=2 lr=0.002909
|
|
[2026-04-13 00:52:42] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 00:52:42] [AutoResearch] mean_reward=64.4771
|
|
[2026-04-13 00:52:42] [AutoResearch] === Trial 3 Summary ===
|
|
[2026-04-13 00:52:42] Total runs in history: 21
|
|
[2026-04-13 00:52:42] Best so far: mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:42] Top 5 results:
|
|
[2026-04-13 00:52:42] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:42] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:42] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:42] mean_reward=80.3866 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:42] mean_reward=78.3455 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:44]
|
|
[AutoResearch] ========== Trial 4/100 ==========
|
|
[2026-04-13 00:52:44] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:52:44] UCB=4.5785 mu=3.8905 sigma=0.3440 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803}
|
|
[2026-04-13 00:52:44] UCB=3.9110 mu=3.2788 sigma=0.3161 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.002340933398266476}
|
|
[2026-04-13 00:52:44] UCB=3.9031 mu=2.9561 sigma=0.4735 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001545455598480437}
|
|
[2026-04-13 00:52:44] UCB=3.5572 mu=2.5132 sigma=0.5220 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0013558415362026459}
|
|
[2026-04-13 00:52:44] UCB=3.3774 mu=2.4958 sigma=0.4408 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0018737024764719805}
|
|
[2026-04-13 00:52:44] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:46] [AutoResearch] Launching job: n_steer=8 n_throttle=2 lr=0.001597
|
|
[2026-04-13 00:52:55] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 00:52:55] [AutoResearch] mean_reward=88.3092
|
|
[2026-04-13 00:52:55] [AutoResearch] === Trial 4 Summary ===
|
|
[2026-04-13 00:52:55] Total runs in history: 22
|
|
[2026-04-13 00:52:55] Best so far: mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:55] Top 5 results:
|
|
[2026-04-13 00:52:55] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:55] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:55] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:55] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:55] mean_reward=80.3866 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:57]
|
|
[AutoResearch] ========== Trial 5/100 ==========
|
|
[2026-04-13 00:52:57] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:52:57] UCB=3.3471 mu=3.1082 sigma=0.1194 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0016381928160972385}
|
|
[2026-04-13 00:52:57] UCB=3.0414 mu=1.7214 sigma=0.6600 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00426621333116507}
|
|
[2026-04-13 00:52:57] UCB=3.0086 mu=1.3566 sigma=0.8260 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.004619910176822425}
|
|
[2026-04-13 00:52:57] UCB=2.9987 mu=1.7205 sigma=0.6391 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0042748160163195}
|
|
[2026-04-13 00:52:57] UCB=2.7975 mu=1.1672 sigma=0.8151 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.004445617404399141}
|
|
[2026-04-13 00:52:57] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0016381928160972385, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:52:59] [AutoResearch] Launching job: n_steer=7 n_throttle=2 lr=0.001638
|
|
[2026-04-13 00:53:08] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 00:53:08] [AutoResearch] mean_reward=44.8118
|
|
[2026-04-13 00:53:08] [AutoResearch] === Trial 5 Summary ===
|
|
[2026-04-13 00:53:08] Total runs in history: 23
|
|
[2026-04-13 00:53:08] Best so far: mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:08] Top 5 results:
|
|
[2026-04-13 00:53:08] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:08] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:08] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:08] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:08] mean_reward=80.3866 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:10]
|
|
[AutoResearch] ========== Trial 6/100 ==========
|
|
[2026-04-13 00:53:10] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:53:10] UCB=10.5093 mu=8.9110 sigma=0.7992 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.004851311454386098}
|
|
[2026-04-13 00:53:10] UCB=10.4182 mu=9.3234 sigma=0.5474 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.004366689527801074}
|
|
[2026-04-13 00:53:10] UCB=9.4028 mu=7.7158 sigma=0.8435 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.004842752726812958}
|
|
[2026-04-13 00:53:10] UCB=9.3026 mu=7.6597 sigma=0.8215 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.004751507901539082}
|
|
[2026-04-13 00:53:10] UCB=9.2416 mu=8.2576 sigma=0.4920 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.004081146093840212}
|
|
[2026-04-13 00:53:10] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.004851311454386098, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:12] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.004851
|
|
[2026-04-13 00:53:19] [AutoResearch] Job finished in 7.9s, returncode=0
|
|
[2026-04-13 00:53:19] [AutoResearch] mean_reward=46.5373
|
|
[2026-04-13 00:53:19] [AutoResearch] === Trial 6 Summary ===
|
|
[2026-04-13 00:53:19] Total runs in history: 24
|
|
[2026-04-13 00:53:19] Best so far: mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:19] Top 5 results:
|
|
[2026-04-13 00:53:19] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:19] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:19] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:19] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:19] mean_reward=80.3866 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:21]
|
|
[AutoResearch] ========== Trial 7/100 ==========
|
|
[2026-04-13 00:53:21] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:53:21] UCB=5.9657 mu=4.5529 sigma=0.7064 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.004453157042702185}
|
|
[2026-04-13 00:53:21] UCB=5.8618 mu=4.8987 sigma=0.4815 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0006658663810253478}
|
|
[2026-04-13 00:53:21] UCB=5.7090 mu=4.8640 sigma=0.4225 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.00117475523426127}
|
|
[2026-04-13 00:53:21] UCB=5.6143 mu=4.5643 sigma=0.5250 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.00045009308923107505}
|
|
[2026-04-13 00:53:21] UCB=5.2904 mu=4.4634 sigma=0.4135 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.004462678717603152}
|
|
[2026-04-13 00:53:21] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.004453157042702185, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:23] [AutoResearch] Launching job: n_steer=8 n_throttle=2 lr=0.004453
|
|
[2026-04-13 00:53:32] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 00:53:32] [AutoResearch] mean_reward=56.7353
|
|
[2026-04-13 00:53:32] [AutoResearch] === Trial 7 Summary ===
|
|
[2026-04-13 00:53:32] Total runs in history: 25
|
|
[2026-04-13 00:53:32] Best so far: mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:32] Top 5 results:
|
|
[2026-04-13 00:53:32] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:32] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:32] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:32] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:32] mean_reward=80.3866 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:34]
|
|
[AutoResearch] ========== Trial 8/100 ==========
|
|
[2026-04-13 00:53:34] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:53:34] UCB=5.5499 mu=4.6053 sigma=0.4723 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0011279054427624348}
|
|
[2026-04-13 00:53:34] UCB=5.4360 mu=4.2750 sigma=0.5805 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0006027597763957639}
|
|
[2026-04-13 00:53:34] UCB=4.8016 mu=4.2809 sigma=0.2604 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0010534495862622021}
|
|
[2026-04-13 00:53:34] UCB=4.7717 mu=4.0103 sigma=0.3807 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0008346235013081151}
|
|
[2026-04-13 00:53:34] UCB=4.5734 mu=3.2637 sigma=0.6548 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0011741923377706195}
|
|
[2026-04-13 00:53:34] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0011279054427624348, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:36] [AutoResearch] Launching job: n_steer=9 n_throttle=2 lr=0.001128
|
|
[2026-04-13 00:53:45] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 00:53:45] [AutoResearch] mean_reward=61.1893
|
|
[2026-04-13 00:53:45] [AutoResearch] === Trial 8 Summary ===
|
|
[2026-04-13 00:53:45] Total runs in history: 26
|
|
[2026-04-13 00:53:45] Best so far: mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:45] Top 5 results:
|
|
[2026-04-13 00:53:45] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:45] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:45] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:45] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:45] mean_reward=80.3866 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:47]
|
|
[AutoResearch] ========== Trial 9/100 ==========
|
|
[2026-04-13 00:53:47] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:53:47] UCB=3.3903 mu=2.8595 sigma=0.2654 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.004035206090986697}
|
|
[2026-04-13 00:53:47] UCB=3.1750 mu=2.6479 sigma=0.2635 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0033604201067401833}
|
|
[2026-04-13 00:53:47] UCB=3.1008 mu=2.1107 sigma=0.4950 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0011889769544248898}
|
|
[2026-04-13 00:53:47] UCB=2.9988 mu=2.3368 sigma=0.3310 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0010834456712308352}
|
|
[2026-04-13 00:53:47] UCB=2.9851 mu=2.0823 sigma=0.4514 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0035813642136645536}
|
|
[2026-04-13 00:53:47] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.004035206090986697, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:49] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.004035
|
|
[2026-04-13 00:53:57] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 00:53:57] [AutoResearch] mean_reward=62.7198
|
|
[2026-04-13 00:53:57] [AutoResearch] === Trial 9 Summary ===
|
|
[2026-04-13 00:53:57] Total runs in history: 27
|
|
[2026-04-13 00:53:57] Best so far: mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:57] Top 5 results:
|
|
[2026-04-13 00:53:57] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:57] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:57] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:57] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:57] mean_reward=80.3866 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:53:59]
|
|
[AutoResearch] ========== Trial 10/100 ==========
|
|
[2026-04-13 00:53:59] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:53:59] UCB=2.9382 mu=1.5083 sigma=0.7149 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0012897155274619015}
|
|
[2026-04-13 00:53:59] UCB=2.8922 mu=1.5425 sigma=0.6748 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.001793043080358741}
|
|
[2026-04-13 00:53:59] UCB=2.8546 mu=1.1612 sigma=0.8467 params={'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.001644106713651884}
|
|
[2026-04-13 00:53:59] UCB=2.7720 mu=2.2112 sigma=0.2804 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.001007475016565743}
|
|
[2026-04-13 00:53:59] UCB=2.7340 mu=1.0585 sigma=0.8378 params={'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.0020023257243619004}
|
|
[2026-04-13 00:53:59] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0012897155274619015, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:01] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.001290
|
|
[2026-04-13 00:54:10] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 00:54:10] [AutoResearch] mean_reward=53.1231
|
|
[2026-04-13 00:54:10] [AutoResearch] === Trial 10 Summary ===
|
|
[2026-04-13 00:54:10] Total runs in history: 28
|
|
[2026-04-13 00:54:10] Best so far: mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:10] Top 5 results:
|
|
[2026-04-13 00:54:10] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:10] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:10] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:10] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:10] mean_reward=80.3866 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:12]
|
|
[AutoResearch] ========== Trial 11/100 ==========
|
|
[2026-04-13 00:54:12] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:54:12] UCB=2.9789 mu=2.0063 sigma=0.4863 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0010549459569002538}
|
|
[2026-04-13 00:54:12] UCB=2.9384 mu=2.1633 sigma=0.3875 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0012553950207137904}
|
|
[2026-04-13 00:54:12] UCB=2.9384 mu=2.1944 sigma=0.3720 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0014482807862373649}
|
|
[2026-04-13 00:54:12] UCB=2.8686 mu=2.2563 sigma=0.3061 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0014199760241790462}
|
|
[2026-04-13 00:54:12] UCB=2.8324 mu=2.0061 sigma=0.4131 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.001348691706661935}
|
|
[2026-04-13 00:54:12] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0010549459569002538, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:14] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.001055
|
|
[2026-04-13 00:54:22] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 00:54:22] [AutoResearch] mean_reward=61.6252
|
|
[2026-04-13 00:54:22] [AutoResearch] === Trial 11 Summary ===
|
|
[2026-04-13 00:54:22] Total runs in history: 29
|
|
[2026-04-13 00:54:22] Best so far: mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:22] Top 5 results:
|
|
[2026-04-13 00:54:22] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:22] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:22] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:22] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:22] mean_reward=80.3866 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:24]
|
|
[AutoResearch] ========== Trial 12/100 ==========
|
|
[2026-04-13 00:54:24] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:54:24] UCB=2.8855 mu=1.6337 sigma=0.6259 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.00033027725081315553}
|
|
[2026-04-13 00:54:24] UCB=2.7876 mu=1.3124 sigma=0.7376 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.00022455166593943768}
|
|
[2026-04-13 00:54:24] UCB=2.7515 mu=1.1581 sigma=0.7967 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.00023999588542665236}
|
|
[2026-04-13 00:54:24] UCB=2.5371 mu=1.6189 sigma=0.4591 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.00046212852487548554}
|
|
[2026-04-13 00:54:24] UCB=2.5157 mu=0.8922 sigma=0.8118 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.00027644239445836957}
|
|
[2026-04-13 00:54:24] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.00033027725081315553, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:26] [AutoResearch] Launching job: n_steer=6 n_throttle=4 lr=0.000330
|
|
[2026-04-13 00:54:35] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 00:54:35] [AutoResearch] mean_reward=60.6853
|
|
[2026-04-13 00:54:35] [AutoResearch] === Trial 12 Summary ===
|
|
[2026-04-13 00:54:35] Total runs in history: 30
|
|
[2026-04-13 00:54:35] Best so far: mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:35] Top 5 results:
|
|
[2026-04-13 00:54:35] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:35] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:35] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:35] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:35] mean_reward=80.3866 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:37]
|
|
[AutoResearch] ========== Trial 13/100 ==========
|
|
[2026-04-13 00:54:37] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:54:37] UCB=2.1372 mu=1.9818 sigma=0.0777 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496}
|
|
[2026-04-13 00:54:37] UCB=2.1271 mu=1.7571 sigma=0.1850 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.004064621389415977}
|
|
[2026-04-13 00:54:37] UCB=2.1104 mu=1.3691 sigma=0.3706 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0037371331309874307}
|
|
[2026-04-13 00:54:37] UCB=2.0497 mu=0.9857 sigma=0.5320 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003716589028221073}
|
|
[2026-04-13 00:54:37] UCB=2.0210 mu=0.0295 sigma=0.9958 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.004189234937149967}
|
|
[2026-04-13 00:54:37] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:39] [AutoResearch] Launching job: n_steer=8 n_throttle=2 lr=0.001226
|
|
[2026-04-13 00:54:48] [AutoResearch] Job finished in 9.2s, returncode=0
|
|
[2026-04-13 00:54:48] [AutoResearch] mean_reward=103.9999
|
|
[2026-04-13 00:54:48] [AutoResearch] === Trial 13 Summary ===
|
|
[2026-04-13 00:54:48] Total runs in history: 31
|
|
[2026-04-13 00:54:48] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:48] Top 5 results:
|
|
[2026-04-13 00:54:48] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:48] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:48] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:48] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:48] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:50]
|
|
[AutoResearch] ========== Trial 14/100 ==========
|
|
[2026-04-13 00:54:50] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:54:50] UCB=2.6535 mu=2.4622 sigma=0.0956 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001302845514299492}
|
|
[2026-04-13 00:54:50] UCB=2.3601 mu=2.1814 sigma=0.0894 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0007622095085624903}
|
|
[2026-04-13 00:54:50] UCB=2.3278 mu=2.0003 sigma=0.1637 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0013190715420722456}
|
|
[2026-04-13 00:54:50] UCB=2.2009 mu=1.9125 sigma=0.1442 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0011042463684477683}
|
|
[2026-04-13 00:54:50] UCB=2.1214 mu=1.7407 sigma=0.1904 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003770542708865002}
|
|
[2026-04-13 00:54:50] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001302845514299492, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:54:52] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.001303
|
|
[2026-04-13 00:55:01] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 00:55:01] [AutoResearch] mean_reward=64.5947
|
|
[2026-04-13 00:55:01] [AutoResearch] === Trial 14 Summary ===
|
|
[2026-04-13 00:55:01] Total runs in history: 32
|
|
[2026-04-13 00:55:01] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:01] Top 5 results:
|
|
[2026-04-13 00:55:01] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:01] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:01] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:01] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:01] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:03]
|
|
[AutoResearch] ========== Trial 15/100 ==========
|
|
[2026-04-13 00:55:03] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:55:03] UCB=2.6144 mu=2.4904 sigma=0.0620 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0010013894647417003}
|
|
[2026-04-13 00:55:03] UCB=2.0328 mu=0.0907 sigma=0.9711 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.003286191653881072}
|
|
[2026-04-13 00:55:03] UCB=2.0271 mu=0.0529 sigma=0.9871 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0038897315515869606}
|
|
[2026-04-13 00:55:03] UCB=2.0221 mu=0.0621 sigma=0.9800 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0036975111663414663}
|
|
[2026-04-13 00:55:03] UCB=2.0054 mu=0.0755 sigma=0.9650 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0033043475166238396}
|
|
[2026-04-13 00:55:03] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0010013894647417003, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:05] [AutoResearch] Launching job: n_steer=8 n_throttle=2 lr=0.001001
|
|
[2026-04-13 00:55:14] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 00:55:14] [AutoResearch] mean_reward=79.0138
|
|
[2026-04-13 00:55:14] [AutoResearch] === Trial 15 Summary ===
|
|
[2026-04-13 00:55:14] Total runs in history: 33
|
|
[2026-04-13 00:55:14] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:14] Top 5 results:
|
|
[2026-04-13 00:55:14] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:14] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:14] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:14] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:14] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:16]
|
|
[AutoResearch] ========== Trial 16/100 ==========
|
|
[2026-04-13 00:55:16] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:55:16] UCB=1.9745 mu=-0.0257 sigma=1.0001 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0049663157953246115}
|
|
[2026-04-13 00:55:16] UCB=1.9742 mu=-0.0255 sigma=0.9999 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.004970172293450301}
|
|
[2026-04-13 00:55:16] UCB=1.9714 mu=-0.0280 sigma=0.9997 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004961925772786133}
|
|
[2026-04-13 00:55:16] UCB=1.9684 mu=-0.0293 sigma=0.9989 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0047934832879292745}
|
|
[2026-04-13 00:55:16] UCB=1.9680 mu=-0.0237 sigma=0.9958 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.004334335985674997}
|
|
[2026-04-13 00:55:16] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0049663157953246115, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:18] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.004966
|
|
[2026-04-13 00:55:26] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 00:55:26] [AutoResearch] mean_reward=55.1989
|
|
[2026-04-13 00:55:26] [AutoResearch] === Trial 16 Summary ===
|
|
[2026-04-13 00:55:26] Total runs in history: 34
|
|
[2026-04-13 00:55:26] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:26] Top 5 results:
|
|
[2026-04-13 00:55:26] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:26] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:26] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:26] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:26] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:28]
|
|
[AutoResearch] ========== Trial 17/100 ==========
|
|
[2026-04-13 00:55:28] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:55:28] UCB=2.0140 mu=1.8337 sigma=0.0901 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0016398984653946051}
|
|
[2026-04-13 00:55:28] UCB=1.9557 mu=1.7887 sigma=0.0835 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0014989037096377728}
|
|
[2026-04-13 00:55:28] UCB=1.9071 mu=1.6847 sigma=0.1112 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.00042659129128150123}
|
|
[2026-04-13 00:55:28] UCB=1.8517 mu=1.6313 sigma=0.1102 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0005405816939013512}
|
|
[2026-04-13 00:55:28] UCB=1.7892 mu=-0.1865 sigma=0.9878 params={'n_steer': 4, 'n_throttle': 5, 'learning_rate': 0.004927207597456925}
|
|
[2026-04-13 00:55:28] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0016398984653946051, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:30] [AutoResearch] Launching job: n_steer=9 n_throttle=2 lr=0.001640
|
|
[2026-04-13 00:55:39] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 00:55:39] [AutoResearch] mean_reward=60.5687
|
|
[2026-04-13 00:55:39] [AutoResearch] === Trial 17 Summary ===
|
|
[2026-04-13 00:55:39] Total runs in history: 35
|
|
[2026-04-13 00:55:39] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:39] Top 5 results:
|
|
[2026-04-13 00:55:39] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:39] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:39] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:39] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:39] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:41]
|
|
[AutoResearch] ========== Trial 18/100 ==========
|
|
[2026-04-13 00:55:41] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:55:41] UCB=1.8632 mu=1.5331 sigma=0.1651 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.00032508345090800943}
|
|
[2026-04-13 00:55:41] UCB=1.8474 mu=1.5361 sigma=0.1556 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0005973704154588059}
|
|
[2026-04-13 00:55:41] UCB=1.8195 mu=-0.1690 sigma=0.9943 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.004989105666159698}
|
|
[2026-04-13 00:55:41] UCB=1.7396 mu=-0.0881 sigma=0.9138 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.00280403820560342}
|
|
[2026-04-13 00:55:41] UCB=1.7032 mu=1.5882 sigma=0.0575 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0011853230687248566}
|
|
[2026-04-13 00:55:41] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.00032508345090800943, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:43] [AutoResearch] Launching job: n_steer=6 n_throttle=3 lr=0.000325
|
|
[2026-04-13 00:55:52] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 00:55:52] [AutoResearch] mean_reward=82.0927
|
|
[2026-04-13 00:55:52] [AutoResearch] === Trial 18 Summary ===
|
|
[2026-04-13 00:55:52] Total runs in history: 36
|
|
[2026-04-13 00:55:52] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:52] Top 5 results:
|
|
[2026-04-13 00:55:52] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:52] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:52] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:52] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:52] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:54]
|
|
[AutoResearch] ========== Trial 19/100 ==========
|
|
[2026-04-13 00:55:54] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:55:54] UCB=1.6624 mu=-0.3286 sigma=0.9955 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0045710707333632946}
|
|
[2026-04-13 00:55:54] UCB=1.6000 mu=-0.2943 sigma=0.9471 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0002558633731861893}
|
|
[2026-04-13 00:55:54] UCB=1.5957 mu=-0.3068 sigma=0.9513 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00022406340703560237}
|
|
[2026-04-13 00:55:54] UCB=1.5680 mu=-0.2542 sigma=0.9111 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031127594405234117}
|
|
[2026-04-13 00:55:54] UCB=1.5515 mu=-0.4103 sigma=0.9809 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.000885594243876572}
|
|
[2026-04-13 00:55:54] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0045710707333632946, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:55:56] [AutoResearch] Launching job: n_steer=3 n_throttle=5 lr=0.004571
|
|
[2026-04-13 00:56:04] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 00:56:04] [AutoResearch] mean_reward=46.9465
|
|
[2026-04-13 00:56:04] [AutoResearch] === Trial 19 Summary ===
|
|
[2026-04-13 00:56:04] Total runs in history: 37
|
|
[2026-04-13 00:56:04] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:04] Top 5 results:
|
|
[2026-04-13 00:56:04] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:04] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:04] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:04] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:04] mean_reward=84.9219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:06]
|
|
[AutoResearch] ========== Trial 20/100 ==========
|
|
[2026-04-13 00:56:06] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:56:06] UCB=1.6977 mu=-0.2391 sigma=0.9684 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154}
|
|
[2026-04-13 00:56:06] UCB=1.6151 mu=-0.3489 sigma=0.9820 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00067971942642946}
|
|
[2026-04-13 00:56:06] UCB=1.6138 mu=-0.2055 sigma=0.9097 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.003572364364491491}
|
|
[2026-04-13 00:56:06] UCB=1.5986 mu=-0.1910 sigma=0.8948 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.00014397841973062383}
|
|
[2026-04-13 00:56:06] UCB=1.5967 mu=-0.3128 sigma=0.9547 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00013040987443290104}
|
|
[2026-04-13 00:56:06] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:08] [AutoResearch] Launching job: n_steer=3 n_throttle=5 lr=0.000177
|
|
[2026-04-13 00:56:17] [AutoResearch] Job finished in 9.3s, returncode=0
|
|
[2026-04-13 00:56:17] [AutoResearch] mean_reward=93.2196
|
|
[2026-04-13 00:56:17] [AutoResearch] === Trial 20 Summary ===
|
|
[2026-04-13 00:56:17] Total runs in history: 38
|
|
[2026-04-13 00:56:17] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:17] Top 5 results:
|
|
[2026-04-13 00:56:17] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:17] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:17] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:17] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:17] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:19]
|
|
[AutoResearch] ========== Trial 21/100 ==========
|
|
[2026-04-13 00:56:19] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:56:19] UCB=2.1860 mu=1.3491 sigma=0.4185 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0002435599740629827}
|
|
[2026-04-13 00:56:19] UCB=2.0024 mu=1.4130 sigma=0.2947 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0004355136246119339}
|
|
[2026-04-13 00:56:19] UCB=1.9942 mu=0.7711 sigma=0.6116 params={'n_steer': 3, 'n_throttle': 4, 'learning_rate': 0.0003350675946722402}
|
|
[2026-04-13 00:56:19] UCB=1.9030 mu=0.7618 sigma=0.5706 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 0.0002524516679931099}
|
|
[2026-04-13 00:56:19] UCB=1.7058 mu=0.6406 sigma=0.5326 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 0.00042684550774110954}
|
|
[2026-04-13 00:56:19] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0002435599740629827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:21] [AutoResearch] Launching job: n_steer=3 n_throttle=5 lr=0.000244
|
|
[2026-04-13 00:56:30] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 00:56:30] [AutoResearch] mean_reward=84.4936
|
|
[2026-04-13 00:56:30] [AutoResearch] === Trial 21 Summary ===
|
|
[2026-04-13 00:56:30] Total runs in history: 39
|
|
[2026-04-13 00:56:30] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:30] Top 5 results:
|
|
[2026-04-13 00:56:30] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:30] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:30] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:30] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:30] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:32]
|
|
[AutoResearch] ========== Trial 22/100 ==========
|
|
[2026-04-13 00:56:32] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:56:32] UCB=2.4040 mu=1.1687 sigma=0.6176 params={'n_steer': 3, 'n_throttle': 4, 'learning_rate': 7.325410951797715e-05}
|
|
[2026-04-13 00:56:32] UCB=1.9897 mu=1.2049 sigma=0.3924 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0001688385519170211}
|
|
[2026-04-13 00:56:32] UCB=1.7335 mu=0.6778 sigma=0.5279 params={'n_steer': 4, 'n_throttle': 5, 'learning_rate': 0.0001377630902333695}
|
|
[2026-04-13 00:56:32] UCB=1.5198 mu=-0.1580 sigma=0.8389 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0007106208469080401}
|
|
[2026-04-13 00:56:32] UCB=1.4557 mu=-0.2625 sigma=0.8591 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002756253051962962}
|
|
[2026-04-13 00:56:32] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 4, 'learning_rate': 7.325410951797715e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:34] [AutoResearch] Launching job: n_steer=3 n_throttle=4 lr=0.000073
|
|
[2026-04-13 00:56:43] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 00:56:43] [AutoResearch] mean_reward=56.4207
|
|
[2026-04-13 00:56:43] [AutoResearch] === Trial 22 Summary ===
|
|
[2026-04-13 00:56:43] Total runs in history: 40
|
|
[2026-04-13 00:56:43] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:43] Top 5 results:
|
|
[2026-04-13 00:56:43] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:43] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:43] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:43] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:43] mean_reward=87.9600 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:45]
|
|
[AutoResearch] ========== Trial 23/100 ==========
|
|
[2026-04-13 00:56:45] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:56:45] UCB=1.5724 mu=-0.2479 sigma=0.9102 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485}
|
|
[2026-04-13 00:56:45] UCB=1.4851 mu=-0.2874 sigma=0.8863 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.003116908682207946}
|
|
[2026-04-13 00:56:45] UCB=1.4772 mu=-0.2902 sigma=0.8837 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0037166051204700636}
|
|
[2026-04-13 00:56:45] UCB=1.4731 mu=-0.1346 sigma=0.8039 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0005935261858649259}
|
|
[2026-04-13 00:56:45] UCB=1.4535 mu=-0.3182 sigma=0.8858 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004619708815723241}
|
|
[2026-04-13 00:56:45] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:47] [AutoResearch] Launching job: n_steer=9 n_throttle=5 lr=0.003101
|
|
[2026-04-13 00:56:57] [AutoResearch] Job finished in 9.4s, returncode=0
|
|
[2026-04-13 00:56:57] [AutoResearch] mean_reward=103.5239
|
|
[2026-04-13 00:56:57] [AutoResearch] === Trial 23 Summary ===
|
|
[2026-04-13 00:56:57] Total runs in history: 41
|
|
[2026-04-13 00:56:57] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:57] Top 5 results:
|
|
[2026-04-13 00:56:57] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:57] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:57] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:57] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:57] mean_reward=88.3092 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00159651348358803, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:56:59]
|
|
[AutoResearch] ========== Trial 24/100 ==========
|
|
[2026-04-13 00:56:59] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:56:59] UCB=2.4438 mu=1.2313 sigma=0.6063 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0019796944610240333}
|
|
[2026-04-13 00:56:59] UCB=2.4280 mu=1.4872 sigma=0.4704 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0033225095158634964}
|
|
[2026-04-13 00:56:59] UCB=2.4082 mu=1.3713 sigma=0.5184 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.003512585344835523}
|
|
[2026-04-13 00:56:59] UCB=2.4022 mu=1.6022 sigma=0.4000 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0036716005531689452}
|
|
[2026-04-13 00:56:59] UCB=2.3816 mu=1.7776 sigma=0.3020 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.003169022267786561}
|
|
[2026-04-13 00:56:59] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0019796944610240333, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:01] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.001980
|
|
[2026-04-13 00:57:10] [AutoResearch] Job finished in 9.2s, returncode=0
|
|
[2026-04-13 00:57:10] [AutoResearch] mean_reward=91.1118
|
|
[2026-04-13 00:57:10] [AutoResearch] === Trial 24 Summary ===
|
|
[2026-04-13 00:57:10] Total runs in history: 42
|
|
[2026-04-13 00:57:10] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:10] Top 5 results:
|
|
[2026-04-13 00:57:10] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:10] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:10] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:10] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:10] mean_reward=91.1118 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0019796944610240333, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:12]
|
|
[AutoResearch] ========== Trial 25/100 ==========
|
|
[2026-04-13 00:57:12] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:57:12] UCB=2.2454 mu=1.5784 sigma=0.3335 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.002889685346896813}
|
|
[2026-04-13 00:57:12] UCB=2.2212 mu=1.1759 sigma=0.5227 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0037166264355508885}
|
|
[2026-04-13 00:57:12] UCB=2.1976 mu=1.2162 sigma=0.4907 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0035008920621202584}
|
|
[2026-04-13 00:57:12] UCB=2.1826 mu=1.4563 sigma=0.3631 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0027885297450610668}
|
|
[2026-04-13 00:57:12] UCB=1.9918 mu=1.6761 sigma=0.1578 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002380279821840149}
|
|
[2026-04-13 00:57:12] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.002889685346896813, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:14] [AutoResearch] Launching job: n_steer=9 n_throttle=5 lr=0.002890
|
|
[2026-04-13 00:57:22] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 00:57:22] [AutoResearch] mean_reward=66.2469
|
|
[2026-04-13 00:57:22] [AutoResearch] === Trial 25 Summary ===
|
|
[2026-04-13 00:57:22] Total runs in history: 43
|
|
[2026-04-13 00:57:22] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:22] Top 5 results:
|
|
[2026-04-13 00:57:22] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:22] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:22] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:22] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:22] mean_reward=91.1118 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0019796944610240333, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:24]
|
|
[AutoResearch] ========== Trial 26/100 ==========
|
|
[2026-04-13 00:57:24] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:57:24] UCB=7.1657 mu=5.7753 sigma=0.6952 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004892287974701984}
|
|
[2026-04-13 00:57:24] UCB=6.3122 mu=4.6129 sigma=0.8496 params={'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.0044375201757866885}
|
|
[2026-04-13 00:57:24] UCB=6.1792 mu=5.0737 sigma=0.5527 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0028863064677772865}
|
|
[2026-04-13 00:57:24] UCB=5.8843 mu=5.4728 sigma=0.2057 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.003427662504165744}
|
|
[2026-04-13 00:57:24] UCB=5.7454 mu=4.6356 sigma=0.5549 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.003303349285142094}
|
|
[2026-04-13 00:57:24] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004892287974701984, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:27] [AutoResearch] Launching job: n_steer=9 n_throttle=5 lr=0.004892
|
|
[2026-04-13 00:57:36] [AutoResearch] Job finished in 9.5s, returncode=0
|
|
[2026-04-13 00:57:36] [AutoResearch] mean_reward=92.981
|
|
[2026-04-13 00:57:36] [AutoResearch] === Trial 26 Summary ===
|
|
[2026-04-13 00:57:36] Total runs in history: 44
|
|
[2026-04-13 00:57:36] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:36] Top 5 results:
|
|
[2026-04-13 00:57:36] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:36] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:36] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:36] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:36] mean_reward=92.9810 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004892287974701984, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:38]
|
|
[AutoResearch] ========== Trial 27/100 ==========
|
|
[2026-04-13 00:57:38] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:57:38] UCB=7.3266 mu=6.0536 sigma=0.6365 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.003242537541023145}
|
|
[2026-04-13 00:57:38] UCB=6.9409 mu=5.9973 sigma=0.4718 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0024903186410206676}
|
|
[2026-04-13 00:57:38] UCB=6.8413 mu=5.6850 sigma=0.5781 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.002734467523884946}
|
|
[2026-04-13 00:57:38] UCB=6.7823 mu=5.5652 sigma=0.6085 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0036905926231828303}
|
|
[2026-04-13 00:57:38] UCB=6.6285 mu=5.9260 sigma=0.3513 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.003155576688162544}
|
|
[2026-04-13 00:57:38] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.003242537541023145, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:40] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.003243
|
|
[2026-04-13 00:57:48] [AutoResearch] Job finished in 8.1s, returncode=0
|
|
[2026-04-13 00:57:48] [AutoResearch] mean_reward=55.613
|
|
[2026-04-13 00:57:48] [AutoResearch] === Trial 27 Summary ===
|
|
[2026-04-13 00:57:48] Total runs in history: 45
|
|
[2026-04-13 00:57:48] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:48] Top 5 results:
|
|
[2026-04-13 00:57:48] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:48] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:48] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:48] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:48] mean_reward=92.9810 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004892287974701984, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:50]
|
|
[AutoResearch] ========== Trial 28/100 ==========
|
|
[2026-04-13 00:57:50] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:57:50] UCB=6.6390 mu=5.5170 sigma=0.5610 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.00070655144076326}
|
|
[2026-04-13 00:57:50] UCB=5.9512 mu=4.7760 sigma=0.5876 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0006257106665899486}
|
|
[2026-04-13 00:57:50] UCB=5.3062 mu=4.0409 sigma=0.6326 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0002493010019427645}
|
|
[2026-04-13 00:57:50] UCB=5.1055 mu=3.6040 sigma=0.7507 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.00022912963739756914}
|
|
[2026-04-13 00:57:50] UCB=4.9223 mu=4.1396 sigma=0.3913 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0013268128209221724}
|
|
[2026-04-13 00:57:50] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.00070655144076326, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:57:52] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.000707
|
|
[2026-04-13 00:58:00] [AutoResearch] Job finished in 8.1s, returncode=0
|
|
[2026-04-13 00:58:00] [AutoResearch] mean_reward=33.9312
|
|
[2026-04-13 00:58:00] [AutoResearch] === Trial 28 Summary ===
|
|
[2026-04-13 00:58:00] Total runs in history: 46
|
|
[2026-04-13 00:58:00] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:00] Top 5 results:
|
|
[2026-04-13 00:58:00] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:00] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:00] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:00] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:00] mean_reward=92.9810 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004892287974701984, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:02]
|
|
[AutoResearch] ========== Trial 29/100 ==========
|
|
[2026-04-13 00:58:02] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:58:02] UCB=4.9586 mu=3.8115 sigma=0.5736 params={'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.0039011945535334163}
|
|
[2026-04-13 00:58:02] UCB=4.9435 mu=4.5922 sigma=0.1757 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004280662970642927}
|
|
[2026-04-13 00:58:02] UCB=3.7171 mu=3.3383 sigma=0.1894 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004536936161070795}
|
|
[2026-04-13 00:58:02] UCB=3.6725 mu=1.9754 sigma=0.8485 params={'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.004111421186319758}
|
|
[2026-04-13 00:58:02] UCB=3.3985 mu=1.9384 sigma=0.7301 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.003575550222340208}
|
|
[2026-04-13 00:58:02] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.0039011945535334163, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:04] [AutoResearch] Launching job: n_steer=9 n_throttle=4 lr=0.003901
|
|
[2026-04-13 00:58:13] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 00:58:13] [AutoResearch] mean_reward=61.9858
|
|
[2026-04-13 00:58:13] [AutoResearch] === Trial 29 Summary ===
|
|
[2026-04-13 00:58:13] Total runs in history: 47
|
|
[2026-04-13 00:58:13] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:13] Top 5 results:
|
|
[2026-04-13 00:58:13] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:13] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:13] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:13] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:13] mean_reward=92.9810 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004892287974701984, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:15]
|
|
[AutoResearch] ========== Trial 30/100 ==========
|
|
[2026-04-13 00:58:15] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:58:15] UCB=3.5878 mu=3.0149 sigma=0.2864 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.003796325289340756}
|
|
[2026-04-13 00:58:15] UCB=3.4647 mu=2.7989 sigma=0.3329 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0040326228435428125}
|
|
[2026-04-13 00:58:15] UCB=3.4645 mu=3.0981 sigma=0.1832 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.003137759730987224}
|
|
[2026-04-13 00:58:15] UCB=3.2179 mu=2.7491 sigma=0.2344 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.003215470457925968}
|
|
[2026-04-13 00:58:15] UCB=3.1263 mu=2.5519 sigma=0.2872 params={'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.0038470093726776454}
|
|
[2026-04-13 00:58:15] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.003796325289340756, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:17] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.003796
|
|
[2026-04-13 00:58:25] [AutoResearch] Job finished in 8.0s, returncode=0
|
|
[2026-04-13 00:58:25] [AutoResearch] mean_reward=53.6243
|
|
[2026-04-13 00:58:25] [AutoResearch] === Trial 30 Summary ===
|
|
[2026-04-13 00:58:25] Total runs in history: 48
|
|
[2026-04-13 00:58:25] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:25] Top 5 results:
|
|
[2026-04-13 00:58:25] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:25] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:25] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:25] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:25] mean_reward=92.9810 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004892287974701984, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:27]
|
|
[AutoResearch] ========== Trial 31/100 ==========
|
|
[2026-04-13 00:58:27] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:58:27] UCB=3.1688 mu=2.5086 sigma=0.3301 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0023882854375356236}
|
|
[2026-04-13 00:58:27] UCB=3.1629 mu=2.5020 sigma=0.3305 params={'n_steer': 6, 'n_throttle': 5, 'learning_rate': 0.002387030304333966}
|
|
[2026-04-13 00:58:27] UCB=3.1145 mu=2.7212 sigma=0.1967 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0016510550230621078}
|
|
[2026-04-13 00:58:27] UCB=3.1016 mu=2.0100 sigma=0.5458 params={'n_steer': 5, 'n_throttle': 5, 'learning_rate': 0.00484692975257849}
|
|
[2026-04-13 00:58:27] UCB=2.8432 mu=1.8176 sigma=0.5128 params={'n_steer': 6, 'n_throttle': 5, 'learning_rate': 0.0019853355543382888}
|
|
[2026-04-13 00:58:27] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0023882854375356236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:29] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.002388
|
|
[2026-04-13 00:58:38] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 00:58:38] [AutoResearch] mean_reward=76.825
|
|
[2026-04-13 00:58:38] [AutoResearch] === Trial 31 Summary ===
|
|
[2026-04-13 00:58:38] Total runs in history: 49
|
|
[2026-04-13 00:58:38] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:38] Top 5 results:
|
|
[2026-04-13 00:58:38] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:38] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:38] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:38] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:38] mean_reward=92.9810 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004892287974701984, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:40]
|
|
[AutoResearch] ========== Trial 32/100 ==========
|
|
[2026-04-13 00:58:40] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:58:40] UCB=4.5040 mu=3.3165 sigma=0.5937 params={'n_steer': 5, 'n_throttle': 5, 'learning_rate': 0.004832108910719866}
|
|
[2026-04-13 00:58:40] UCB=3.5052 mu=1.8707 sigma=0.8173 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.004811057168584232}
|
|
[2026-04-13 00:58:40] UCB=3.0596 mu=1.3445 sigma=0.8575 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.004822747984638482}
|
|
[2026-04-13 00:58:40] UCB=2.9521 mu=1.2492 sigma=0.8515 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.004675188045160062}
|
|
[2026-04-13 00:58:40] UCB=2.6683 mu=2.4118 sigma=0.1283 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.00402338860870788}
|
|
[2026-04-13 00:58:40] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 5, 'learning_rate': 0.004832108910719866, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:42] [AutoResearch] Launching job: n_steer=5 n_throttle=5 lr=0.004832
|
|
[2026-04-13 00:58:50] [AutoResearch] Job finished in 8.1s, returncode=0
|
|
[2026-04-13 00:58:50] [AutoResearch] mean_reward=48.8252
|
|
[2026-04-13 00:58:50] [AutoResearch] === Trial 32 Summary ===
|
|
[2026-04-13 00:58:50] Total runs in history: 50
|
|
[2026-04-13 00:58:50] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:50] Top 5 results:
|
|
[2026-04-13 00:58:50] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:50] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:50] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:50] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:50] mean_reward=92.9810 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004892287974701984, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:52]
|
|
[AutoResearch] ========== Trial 33/100 ==========
|
|
[2026-04-13 00:58:52] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:58:52] UCB=1.7883 mu=1.4157 sigma=0.1863 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007}
|
|
[2026-04-13 00:58:52] UCB=1.6332 mu=1.2485 sigma=0.1924 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005458520619100773}
|
|
[2026-04-13 00:58:52] UCB=1.6085 mu=1.1422 sigma=0.2332 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0034888341624946294}
|
|
[2026-04-13 00:58:52] UCB=1.5555 mu=1.2740 sigma=0.1407 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00018472412696700732}
|
|
[2026-04-13 00:58:52] UCB=1.5479 mu=1.4642 sigma=0.0418 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0014554864226165437}
|
|
[2026-04-13 00:58:52] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:58:54] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.000442
|
|
[2026-04-13 00:59:03] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 00:59:03] [AutoResearch] mean_reward=93.49
|
|
[2026-04-13 00:59:03] [AutoResearch] === Trial 33 Summary ===
|
|
[2026-04-13 00:59:03] Total runs in history: 51
|
|
[2026-04-13 00:59:03] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:03] Top 5 results:
|
|
[2026-04-13 00:59:03] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:03] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:03] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:03] mean_reward=93.4900 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:03] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:05]
|
|
[AutoResearch] ========== Trial 34/100 ==========
|
|
[2026-04-13 00:59:05] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:59:05] UCB=3.0621 mu=2.7095 sigma=0.1763 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004204249470693179}
|
|
[2026-04-13 00:59:05] UCB=1.8627 mu=1.4553 sigma=0.2037 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.00036472579252732506}
|
|
[2026-04-13 00:59:05] UCB=1.7895 mu=1.5765 sigma=0.1065 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0036985161718080333}
|
|
[2026-04-13 00:59:05] UCB=1.7789 mu=1.4144 sigma=0.1822 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0003285374011541316}
|
|
[2026-04-13 00:59:05] UCB=1.7574 mu=1.3672 sigma=0.1951 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0007312182936846756}
|
|
[2026-04-13 00:59:05] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004204249470693179, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:07] [AutoResearch] Launching job: n_steer=9 n_throttle=5 lr=0.004204
|
|
[2026-04-13 00:59:15] [AutoResearch] Job finished in 8.0s, returncode=0
|
|
[2026-04-13 00:59:15] [AutoResearch] mean_reward=47.1207
|
|
[2026-04-13 00:59:15] [AutoResearch] === Trial 34 Summary ===
|
|
[2026-04-13 00:59:15] Total runs in history: 52
|
|
[2026-04-13 00:59:15] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:15] Top 5 results:
|
|
[2026-04-13 00:59:15] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:15] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:15] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:15] mean_reward=93.4900 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:15] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:17]
|
|
[AutoResearch] ========== Trial 35/100 ==========
|
|
[2026-04-13 00:59:17] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:59:17] UCB=2.9235 mu=2.2865 sigma=0.3185 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002784221187522108}
|
|
[2026-04-13 00:59:17] UCB=2.8672 mu=2.6093 sigma=0.1290 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0029109451287695296}
|
|
[2026-04-13 00:59:17] UCB=2.6791 mu=1.9896 sigma=0.3448 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002601411851310607}
|
|
[2026-04-13 00:59:17] UCB=2.5635 mu=2.3469 sigma=0.1083 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0023179844816168427}
|
|
[2026-04-13 00:59:17] UCB=2.3529 mu=1.9009 sigma=0.2260 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0022243011947462314}
|
|
[2026-04-13 00:59:17] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002784221187522108, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:19] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.002784
|
|
[2026-04-13 00:59:27] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 00:59:27] [AutoResearch] mean_reward=51.6919
|
|
[2026-04-13 00:59:27] [AutoResearch] === Trial 35 Summary ===
|
|
[2026-04-13 00:59:27] Total runs in history: 53
|
|
[2026-04-13 00:59:27] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:27] Top 5 results:
|
|
[2026-04-13 00:59:27] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:27] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:27] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:27] mean_reward=93.4900 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:27] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:29]
|
|
[AutoResearch] ========== Trial 36/100 ==========
|
|
[2026-04-13 00:59:29] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:59:29] UCB=2.3114 mu=1.1126 sigma=0.5994 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.004917818884535984}
|
|
[2026-04-13 00:59:29] UCB=2.2276 mu=1.9254 sigma=0.1511 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001333312491132739}
|
|
[2026-04-13 00:59:29] UCB=2.2024 mu=1.0496 sigma=0.5764 params={'n_steer': 5, 'n_throttle': 5, 'learning_rate': 0.0034239137817057386}
|
|
[2026-04-13 00:59:29] UCB=2.1809 mu=1.3343 sigma=0.4233 params={'n_steer': 6, 'n_throttle': 5, 'learning_rate': 0.00298246217937707}
|
|
[2026-04-13 00:59:29] UCB=2.1737 mu=1.7445 sigma=0.2146 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0011612712919320746}
|
|
[2026-04-13 00:59:29] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.004917818884535984, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:31] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.004918
|
|
[2026-04-13 00:59:39] [AutoResearch] Job finished in 8.1s, returncode=0
|
|
[2026-04-13 00:59:39] [AutoResearch] mean_reward=55.0527
|
|
[2026-04-13 00:59:39] [AutoResearch] === Trial 36 Summary ===
|
|
[2026-04-13 00:59:39] Total runs in history: 54
|
|
[2026-04-13 00:59:39] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:39] Top 5 results:
|
|
[2026-04-13 00:59:39] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:39] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:39] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:39] mean_reward=93.4900 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:39] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:41]
|
|
[AutoResearch] ========== Trial 37/100 ==========
|
|
[2026-04-13 00:59:41] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:59:41] UCB=2.2742 mu=1.0527 sigma=0.6107 params={'n_steer': 5, 'n_throttle': 5, 'learning_rate': 0.003103506151983674}
|
|
[2026-04-13 00:59:41] UCB=2.2724 mu=1.1272 sigma=0.5726 params={'n_steer': 5, 'n_throttle': 5, 'learning_rate': 0.0034776157423037283}
|
|
[2026-04-13 00:59:41] UCB=2.0847 mu=0.9820 sigma=0.5514 params={'n_steer': 5, 'n_throttle': 5, 'learning_rate': 0.002582735173196978}
|
|
[2026-04-13 00:59:41] UCB=2.0473 mu=1.0908 sigma=0.4782 params={'n_steer': 5, 'n_throttle': 5, 'learning_rate': 0.002541965017080456}
|
|
[2026-04-13 00:59:41] UCB=2.0151 mu=1.7621 sigma=0.1265 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0011043094917768544}
|
|
[2026-04-13 00:59:41] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 5, 'learning_rate': 0.003103506151983674, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:43] [AutoResearch] Launching job: n_steer=5 n_throttle=5 lr=0.003104
|
|
[2026-04-13 00:59:52] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 00:59:52] [AutoResearch] mean_reward=63.5424
|
|
[2026-04-13 00:59:52] [AutoResearch] === Trial 37 Summary ===
|
|
[2026-04-13 00:59:52] Total runs in history: 55
|
|
[2026-04-13 00:59:52] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:52] Top 5 results:
|
|
[2026-04-13 00:59:52] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:52] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:52] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:52] mean_reward=93.4900 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:52] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:54]
|
|
[AutoResearch] ========== Trial 38/100 ==========
|
|
[2026-04-13 00:59:54] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 00:59:54] UCB=2.1610 mu=1.7988 sigma=0.1811 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0012546378999387384}
|
|
[2026-04-13 00:59:54] UCB=2.1175 mu=1.9016 sigma=0.1080 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.001547211005018497}
|
|
[2026-04-13 00:59:54] UCB=1.9489 mu=1.7455 sigma=0.1017 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0016205129597316114}
|
|
[2026-04-13 00:59:54] UCB=1.8661 mu=1.3967 sigma=0.2347 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0016776035023467306}
|
|
[2026-04-13 00:59:54] UCB=1.7635 mu=1.3716 sigma=0.1959 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.0007438267354674167}
|
|
[2026-04-13 00:59:54] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0012546378999387384, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 00:59:56] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.001255
|
|
[2026-04-13 01:00:04] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:00:04] [AutoResearch] mean_reward=51.6969
|
|
[2026-04-13 01:00:04] [AutoResearch] === Trial 38 Summary ===
|
|
[2026-04-13 01:00:04] Total runs in history: 56
|
|
[2026-04-13 01:00:04] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:04] Top 5 results:
|
|
[2026-04-13 01:00:04] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:04] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:04] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:04] mean_reward=93.4900 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:04] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:06]
|
|
[AutoResearch] ========== Trial 39/100 ==========
|
|
[2026-04-13 01:00:06] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:00:06] UCB=2.4785 mu=2.1016 sigma=0.1885 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0017282835655091705}
|
|
[2026-04-13 01:00:06] UCB=1.9567 mu=1.6968 sigma=0.1300 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0016785833914654913}
|
|
[2026-04-13 01:00:06] UCB=1.9033 mu=1.4557 sigma=0.2238 params={'n_steer': 6, 'n_throttle': 5, 'learning_rate': 0.001446626886274234}
|
|
[2026-04-13 01:00:06] UCB=1.8881 mu=1.3018 sigma=0.2932 params={'n_steer': 6, 'n_throttle': 5, 'learning_rate': 0.0017206509983783897}
|
|
[2026-04-13 01:00:06] UCB=1.8052 mu=1.4879 sigma=0.1587 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004917649877892504}
|
|
[2026-04-13 01:00:06] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0017282835655091705, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:08] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.001728
|
|
[2026-04-13 01:00:17] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:00:17] [AutoResearch] mean_reward=64.1342
|
|
[2026-04-13 01:00:17] [AutoResearch] === Trial 39 Summary ===
|
|
[2026-04-13 01:00:17] Total runs in history: 57
|
|
[2026-04-13 01:00:17] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:17] Top 5 results:
|
|
[2026-04-13 01:00:17] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:17] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:17] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:17] mean_reward=93.4900 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:17] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:19]
|
|
[AutoResearch] ========== Trial 40/100 ==========
|
|
[2026-04-13 01:00:19] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:00:19] UCB=1.7335 mu=1.4558 sigma=0.1388 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.00481119662551233}
|
|
[2026-04-13 01:00:19] UCB=1.4495 mu=1.1771 sigma=0.1362 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006912338431165132}
|
|
[2026-04-13 01:00:19] UCB=1.4200 mu=1.2048 sigma=0.1076 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0005478886450648676}
|
|
[2026-04-13 01:00:19] UCB=1.4173 mu=1.3280 sigma=0.0447 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001192219905841403}
|
|
[2026-04-13 01:00:19] UCB=1.3630 mu=0.7763 sigma=0.2933 params={'n_steer': 4, 'n_throttle': 5, 'learning_rate': 0.00011096549329484543}
|
|
[2026-04-13 01:00:19] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.00481119662551233, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:21] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.004811
|
|
[2026-04-13 01:00:29] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:00:29] [AutoResearch] mean_reward=70.4464
|
|
[2026-04-13 01:00:29] [AutoResearch] === Trial 40 Summary ===
|
|
[2026-04-13 01:00:29] Total runs in history: 58
|
|
[2026-04-13 01:00:29] Best so far: mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:29] Top 5 results:
|
|
[2026-04-13 01:00:29] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:29] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:29] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:29] mean_reward=93.4900 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:29] mean_reward=93.2196 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00017653233829510154, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:31]
|
|
[AutoResearch] ========== Trial 41/100 ==========
|
|
[2026-04-13 01:00:31] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:00:31] UCB=1.6130 mu=1.2635 sigma=0.1747 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166}
|
|
[2026-04-13 01:00:31] UCB=1.5265 mu=1.2324 sigma=0.1470 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005854607961430371}
|
|
[2026-04-13 01:00:31] UCB=1.4742 mu=1.3586 sigma=0.0578 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001058401079187825}
|
|
[2026-04-13 01:00:31] UCB=1.4576 mu=1.3659 sigma=0.0458 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005950202924972575}
|
|
[2026-04-13 01:00:31] UCB=1.3862 mu=1.1610 sigma=0.1126 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.00033318036325833013}
|
|
[2026-04-13 01:00:31] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:33] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.000452
|
|
[2026-04-13 01:00:43] [AutoResearch] Job finished in 9.6s, returncode=0
|
|
[2026-04-13 01:00:43] [AutoResearch] mean_reward=104.4376
|
|
[2026-04-13 01:00:43] [AutoResearch] === Trial 41 Summary ===
|
|
[2026-04-13 01:00:43] Total runs in history: 59
|
|
[2026-04-13 01:00:43] Best so far: mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:43] Top 5 results:
|
|
[2026-04-13 01:00:43] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:43] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:43] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:43] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:43] mean_reward=93.4900 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:45]
|
|
[AutoResearch] ========== Trial 42/100 ==========
|
|
[2026-04-13 01:00:45] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:00:45] UCB=1.7582 mu=1.3895 sigma=0.1844 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006295287653892741}
|
|
[2026-04-13 01:00:45] UCB=1.6624 mu=0.8521 sigma=0.4051 params={'n_steer': 4, 'n_throttle': 5, 'learning_rate': 9.176407724461084e-05}
|
|
[2026-04-13 01:00:45] UCB=1.5472 mu=1.1412 sigma=0.2030 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.00029748145893806685}
|
|
[2026-04-13 01:00:45] UCB=1.4220 mu=1.0292 sigma=0.1964 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0008335800335562422}
|
|
[2026-04-13 01:00:45] UCB=1.3964 mu=0.7717 sigma=0.3124 params={'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.0048476049106356}
|
|
[2026-04-13 01:00:45] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006295287653892741, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:47] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.000630
|
|
[2026-04-13 01:00:56] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:00:56] [AutoResearch] mean_reward=70.9973
|
|
[2026-04-13 01:00:56] [AutoResearch] === Trial 42 Summary ===
|
|
[2026-04-13 01:00:56] Total runs in history: 60
|
|
[2026-04-13 01:00:56] Best so far: mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:56] Top 5 results:
|
|
[2026-04-13 01:00:56] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:56] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:56] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:56] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:56] mean_reward=93.4900 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:00:58]
|
|
[AutoResearch] ========== Trial 43/100 ==========
|
|
[2026-04-13 01:00:58] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:00:58] UCB=1.3193 mu=1.1360 sigma=0.0916 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0007159788938482198}
|
|
[2026-04-13 01:00:58] UCB=1.3050 mu=1.1244 sigma=0.0903 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045682057613451334}
|
|
[2026-04-13 01:00:58] UCB=1.2890 mu=0.9199 sigma=0.1845 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0007030551454791813}
|
|
[2026-04-13 01:00:58] UCB=1.2709 mu=1.1929 sigma=0.0390 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006005453602517772}
|
|
[2026-04-13 01:00:58] UCB=1.2118 mu=0.8668 sigma=0.1725 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0009341202323981211}
|
|
[2026-04-13 01:00:58] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0007159788938482198, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:00] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.000716
|
|
[2026-04-13 01:01:08] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:01:08] [AutoResearch] mean_reward=76.9652
|
|
[2026-04-13 01:01:08] [AutoResearch] === Trial 43 Summary ===
|
|
[2026-04-13 01:01:08] Total runs in history: 61
|
|
[2026-04-13 01:01:08] Best so far: mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:08] Top 5 results:
|
|
[2026-04-13 01:01:08] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:08] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:08] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:08] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:08] mean_reward=93.4900 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:10]
|
|
[AutoResearch] ========== Trial 44/100 ==========
|
|
[2026-04-13 01:01:11] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:01:11] UCB=1.5995 mu=0.5155 sigma=0.5420 params={'n_steer': 4, 'n_throttle': 5, 'learning_rate': 0.00011290227851633631}
|
|
[2026-04-13 01:01:11] UCB=1.5736 mu=0.4049 sigma=0.5844 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036195928187379594}
|
|
[2026-04-13 01:01:11] UCB=1.5638 mu=0.5520 sigma=0.5059 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037243294416534696}
|
|
[2026-04-13 01:01:11] UCB=1.5006 mu=0.1707 sigma=0.6650 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003717002717764468}
|
|
[2026-04-13 01:01:11] UCB=1.4669 mu=0.4719 sigma=0.4975 params={'n_steer': 4, 'n_throttle': 5, 'learning_rate': 0.00016829814424457922}
|
|
[2026-04-13 01:01:11] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 5, 'learning_rate': 0.00011290227851633631, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:13] [AutoResearch] Launching job: n_steer=4 n_throttle=5 lr=0.000113
|
|
[2026-04-13 01:01:21] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:01:21] [AutoResearch] mean_reward=56.7355
|
|
[2026-04-13 01:01:21] [AutoResearch] === Trial 44 Summary ===
|
|
[2026-04-13 01:01:21] Total runs in history: 62
|
|
[2026-04-13 01:01:21] Best so far: mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:21] Top 5 results:
|
|
[2026-04-13 01:01:21] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:21] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:21] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:21] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:21] mean_reward=93.4900 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004421780214786007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:23]
|
|
[AutoResearch] ========== Trial 45/100 ==========
|
|
[2026-04-13 01:01:23] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:01:23] UCB=1.6249 mu=0.5735 sigma=0.5257 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086}
|
|
[2026-04-13 01:01:23] UCB=1.5889 mu=1.5124 sigma=0.0383 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012067008657401645}
|
|
[2026-04-13 01:01:23] UCB=1.5641 mu=1.1911 sigma=0.1865 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 9.684164973679993e-05}
|
|
[2026-04-13 01:01:23] UCB=1.5500 mu=0.5704 sigma=0.4898 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0038494361423192305}
|
|
[2026-04-13 01:01:23] UCB=1.5301 mu=1.3060 sigma=0.1121 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005043277816389657}
|
|
[2026-04-13 01:01:23] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:25] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003537
|
|
[2026-04-13 01:01:34] [AutoResearch] Job finished in 9.1s, returncode=0
|
|
[2026-04-13 01:01:34] [AutoResearch] mean_reward=106.2747
|
|
[2026-04-13 01:01:34] [AutoResearch] === Trial 45 Summary ===
|
|
[2026-04-13 01:01:34] Total runs in history: 63
|
|
[2026-04-13 01:01:34] Best so far: mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:34] Top 5 results:
|
|
[2026-04-13 01:01:34] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:34] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:34] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:34] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:34] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:36]
|
|
[AutoResearch] ========== Trial 46/100 ==========
|
|
[2026-04-13 01:01:36] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:01:36] UCB=2.6148 mu=2.2008 sigma=0.2070 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003844280434415957}
|
|
[2026-04-13 01:01:36] UCB=2.5851 mu=2.0477 sigma=0.2687 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003953238060976906}
|
|
[2026-04-13 01:01:36] UCB=2.4550 mu=1.8287 sigma=0.3131 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0036644017424491207}
|
|
[2026-04-13 01:01:36] UCB=2.4106 mu=1.9238 sigma=0.2434 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.004029972076381933}
|
|
[2026-04-13 01:01:36] UCB=2.4035 mu=1.9144 sigma=0.2446 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0038811905759377073}
|
|
[2026-04-13 01:01:36] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003844280434415957, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:38] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003844
|
|
[2026-04-13 01:01:47] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:01:47] [AutoResearch] mean_reward=79.9905
|
|
[2026-04-13 01:01:47] [AutoResearch] === Trial 46 Summary ===
|
|
[2026-04-13 01:01:47] Total runs in history: 64
|
|
[2026-04-13 01:01:47] Best so far: mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:47] Top 5 results:
|
|
[2026-04-13 01:01:47] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:47] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:47] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:47] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:47] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:49]
|
|
[AutoResearch] ========== Trial 47/100 ==========
|
|
[2026-04-13 01:01:49] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:01:49] UCB=3.0297 mu=2.5806 sigma=0.2245 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0028291407121084407}
|
|
[2026-04-13 01:01:49] UCB=2.9375 mu=1.9087 sigma=0.5144 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0030278845563952352}
|
|
[2026-04-13 01:01:49] UCB=2.7578 mu=2.2941 sigma=0.2318 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0030479604464013423}
|
|
[2026-04-13 01:01:49] UCB=2.6960 mu=1.5127 sigma=0.5917 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.003332524578826928}
|
|
[2026-04-13 01:01:49] UCB=2.6809 mu=1.6907 sigma=0.4951 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0034532169135711747}
|
|
[2026-04-13 01:01:49] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0028291407121084407, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:01:51] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.002829
|
|
[2026-04-13 01:02:01] [AutoResearch] Job finished in 9.6s, returncode=0
|
|
[2026-04-13 01:02:01] [AutoResearch] mean_reward=68.3088
|
|
[2026-04-13 01:02:01] [AutoResearch] === Trial 47 Summary ===
|
|
[2026-04-13 01:02:01] Total runs in history: 65
|
|
[2026-04-13 01:02:01] Best so far: mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:01] Top 5 results:
|
|
[2026-04-13 01:02:01] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:01] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:01] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:01] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:01] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:03]
|
|
[AutoResearch] ========== Trial 48/100 ==========
|
|
[2026-04-13 01:02:03] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:02:03] UCB=3.1353 mu=2.1547 sigma=0.4903 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0032076500023948576}
|
|
[2026-04-13 01:02:03] UCB=3.0294 mu=1.7130 sigma=0.6582 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.003907835192628509}
|
|
[2026-04-13 01:02:03] UCB=2.9204 mu=1.4273 sigma=0.7466 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.003519085669838268}
|
|
[2026-04-13 01:02:03] UCB=2.8825 mu=1.7686 sigma=0.5569 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0033921286149784075}
|
|
[2026-04-13 01:02:03] UCB=2.6463 mu=1.6704 sigma=0.4880 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.003321355090899492}
|
|
[2026-04-13 01:02:03] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0032076500023948576, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:05] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.003208
|
|
[2026-04-13 01:02:13] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:02:13] [AutoResearch] mean_reward=67.6738
|
|
[2026-04-13 01:02:13] [AutoResearch] === Trial 48 Summary ===
|
|
[2026-04-13 01:02:13] Total runs in history: 66
|
|
[2026-04-13 01:02:13] Best so far: mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:13] Top 5 results:
|
|
[2026-04-13 01:02:13] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:13] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:13] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:13] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:13] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:15]
|
|
[AutoResearch] ========== Trial 49/100 ==========
|
|
[2026-04-13 01:02:15] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:02:15] UCB=2.0311 mu=1.7192 sigma=0.1559 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0035826935334325777}
|
|
[2026-04-13 01:02:15] UCB=2.0229 mu=1.6814 sigma=0.1708 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036870422967687157}
|
|
[2026-04-13 01:02:15] UCB=1.7291 mu=1.4852 sigma=0.1219 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003557464402673415}
|
|
[2026-04-13 01:02:15] UCB=1.6406 mu=1.0372 sigma=0.3017 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.003713624821817738}
|
|
[2026-04-13 01:02:15] UCB=1.5777 mu=1.2395 sigma=0.1691 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005236542537341072}
|
|
[2026-04-13 01:02:15] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0035826935334325777, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:17] [AutoResearch] Launching job: n_steer=9 n_throttle=3 lr=0.003583
|
|
[2026-04-13 01:02:26] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:02:26] [AutoResearch] mean_reward=76.6932
|
|
[2026-04-13 01:02:26] [AutoResearch] === Trial 49 Summary ===
|
|
[2026-04-13 01:02:26] Total runs in history: 67
|
|
[2026-04-13 01:02:26] Best so far: mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:26] Top 5 results:
|
|
[2026-04-13 01:02:26] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:26] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:26] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:26] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:26] mean_reward=97.7536 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:28]
|
|
[AutoResearch] ========== Trial 50/100 ==========
|
|
[2026-04-13 01:02:28] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:02:28] UCB=1.8598 mu=1.0155 sigma=0.4222 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467}
|
|
[2026-04-13 01:02:28] UCB=1.8420 mu=0.9708 sigma=0.4356 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033686534240031023}
|
|
[2026-04-13 01:02:28] UCB=1.8306 mu=1.4817 sigma=0.1744 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0037061421348747515}
|
|
[2026-04-13 01:02:28] UCB=1.8202 mu=1.3731 sigma=0.2235 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004982273167395865}
|
|
[2026-04-13 01:02:28] UCB=1.7419 mu=1.3599 sigma=0.1910 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0031800914080309457}
|
|
[2026-04-13 01:02:28] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:30] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.003357
|
|
[2026-04-13 01:02:39] [AutoResearch] Job finished in 9.4s, returncode=0
|
|
[2026-04-13 01:02:39] [AutoResearch] mean_reward=105.4572
|
|
[2026-04-13 01:02:39] [AutoResearch] === Trial 50 Summary ===
|
|
[2026-04-13 01:02:39] Total runs in history: 68
|
|
[2026-04-13 01:02:39] Best so far: mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:39] Top 5 results:
|
|
[2026-04-13 01:02:39] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:39] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:39] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:39] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:39] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:41]
|
|
[AutoResearch] ========== Trial 51/100 ==========
|
|
[2026-04-13 01:02:41] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:02:41] UCB=2.8469 mu=2.2887 sigma=0.2791 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.003503516421514994}
|
|
[2026-04-13 01:02:41] UCB=2.6024 mu=2.2457 sigma=0.1783 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0033562551248861}
|
|
[2026-04-13 01:02:41] UCB=2.5427 mu=2.1734 sigma=0.1847 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0038945356334680347}
|
|
[2026-04-13 01:02:41] UCB=2.4488 mu=2.2000 sigma=0.1244 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.003141952549916427}
|
|
[2026-04-13 01:02:41] UCB=2.2721 mu=1.8866 sigma=0.1927 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003418544100470281}
|
|
[2026-04-13 01:02:41] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.003503516421514994, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:43] [AutoResearch] Launching job: n_steer=7 n_throttle=4 lr=0.003504
|
|
[2026-04-13 01:02:52] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:02:52] [AutoResearch] mean_reward=61.6686
|
|
[2026-04-13 01:02:52] [AutoResearch] === Trial 51 Summary ===
|
|
[2026-04-13 01:02:52] Total runs in history: 69
|
|
[2026-04-13 01:02:52] Best so far: mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:52] Top 5 results:
|
|
[2026-04-13 01:02:52] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:52] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:52] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:52] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:52] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:54]
|
|
[AutoResearch] ========== Trial 52/100 ==========
|
|
[2026-04-13 01:02:54] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:02:54] UCB=2.0036 mu=1.6638 sigma=0.1699 params={'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.003136254777440734}
|
|
[2026-04-13 01:02:54] UCB=1.9031 mu=1.8117 sigma=0.0457 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003487642300268097}
|
|
[2026-04-13 01:02:54] UCB=1.8956 mu=1.7645 sigma=0.0656 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003283719702274022}
|
|
[2026-04-13 01:02:54] UCB=1.8323 mu=1.5945 sigma=0.1189 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0034095553108700032}
|
|
[2026-04-13 01:02:54] UCB=1.7755 mu=1.3700 sigma=0.2027 params={'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.003094797273522734}
|
|
[2026-04-13 01:02:54] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.003136254777440734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:02:56] [AutoResearch] Launching job: n_steer=9 n_throttle=4 lr=0.003136
|
|
[2026-04-13 01:03:04] [AutoResearch] Job finished in 8.0s, returncode=0
|
|
[2026-04-13 01:03:04] [AutoResearch] mean_reward=39.8701
|
|
[2026-04-13 01:03:04] [AutoResearch] === Trial 52 Summary ===
|
|
[2026-04-13 01:03:04] Total runs in history: 70
|
|
[2026-04-13 01:03:04] Best so far: mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:04] Top 5 results:
|
|
[2026-04-13 01:03:04] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:04] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:04] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:04] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:04] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:06]
|
|
[AutoResearch] ========== Trial 53/100 ==========
|
|
[2026-04-13 01:03:06] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:03:06] UCB=2.3894 mu=2.1031 sigma=0.1431 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0037791857980495444}
|
|
[2026-04-13 01:03:06] UCB=2.1083 mu=1.9663 sigma=0.0710 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0036257643355356185}
|
|
[2026-04-13 01:03:06] UCB=1.8660 mu=1.5402 sigma=0.1629 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0029978250406575173}
|
|
[2026-04-13 01:03:06] UCB=1.8328 mu=1.7426 sigma=0.0451 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003441303404564538}
|
|
[2026-04-13 01:03:06] UCB=1.7434 mu=1.4940 sigma=0.1247 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003602405132832305}
|
|
[2026-04-13 01:03:06] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0037791857980495444, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:08] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.003779
|
|
[2026-04-13 01:03:16] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:03:16] [AutoResearch] mean_reward=53.9561
|
|
[2026-04-13 01:03:16] [AutoResearch] === Trial 53 Summary ===
|
|
[2026-04-13 01:03:16] Total runs in history: 71
|
|
[2026-04-13 01:03:16] Best so far: mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:16] Top 5 results:
|
|
[2026-04-13 01:03:16] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:16] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:16] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:16] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:16] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:18]
|
|
[AutoResearch] ========== Trial 54/100 ==========
|
|
[2026-04-13 01:03:18] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:03:18] UCB=5.4921 mu=4.9185 sigma=0.2868 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.002091169453066519}
|
|
[2026-04-13 01:03:18] UCB=5.1617 mu=4.8210 sigma=0.1703 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0023742220940778315}
|
|
[2026-04-13 01:03:18] UCB=5.0092 mu=4.4156 sigma=0.2968 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.001984656314923199}
|
|
[2026-04-13 01:03:18] UCB=4.8825 mu=4.3546 sigma=0.2639 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0024304870908219965}
|
|
[2026-04-13 01:03:18] UCB=4.6763 mu=4.2447 sigma=0.2158 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.002459282996517355}
|
|
[2026-04-13 01:03:18] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.002091169453066519, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:20] [AutoResearch] Launching job: n_steer=7 n_throttle=4 lr=0.002091
|
|
[2026-04-13 01:03:29] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:03:29] [AutoResearch] mean_reward=66.9087
|
|
[2026-04-13 01:03:29] [AutoResearch] === Trial 54 Summary ===
|
|
[2026-04-13 01:03:29] Total runs in history: 72
|
|
[2026-04-13 01:03:29] Best so far: mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:29] Top 5 results:
|
|
[2026-04-13 01:03:29] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:29] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:29] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:29] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:29] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:31]
|
|
[AutoResearch] ========== Trial 55/100 ==========
|
|
[2026-04-13 01:03:31] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:03:31] UCB=2.7843 mu=2.6027 sigma=0.0908 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002602359210830878}
|
|
[2026-04-13 01:03:31] UCB=2.6461 mu=2.3439 sigma=0.1511 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0029884863308299718}
|
|
[2026-04-13 01:03:31] UCB=2.5984 mu=2.3608 sigma=0.1188 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0028026819437670806}
|
|
[2026-04-13 01:03:31] UCB=2.4660 mu=2.2362 sigma=0.1149 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002503379176524994}
|
|
[2026-04-13 01:03:31] UCB=2.2774 mu=1.9190 sigma=0.1792 params={'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.00475857734336384}
|
|
[2026-04-13 01:03:31] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002602359210830878, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:33] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.002602
|
|
[2026-04-13 01:03:42] [AutoResearch] Job finished in 9.5s, returncode=0
|
|
[2026-04-13 01:03:42] [AutoResearch] mean_reward=101.9303
|
|
[2026-04-13 01:03:42] [AutoResearch] === Trial 55 Summary ===
|
|
[2026-04-13 01:03:42] Total runs in history: 73
|
|
[2026-04-13 01:03:42] Best so far: mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:42] Top 5 results:
|
|
[2026-04-13 01:03:42] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:42] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:42] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:42] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:42] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:44]
|
|
[AutoResearch] ========== Trial 56/100 ==========
|
|
[2026-04-13 01:03:44] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:03:44] UCB=2.6169 mu=2.4768 sigma=0.0700 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.003021189222395069}
|
|
[2026-04-13 01:03:44] UCB=2.5127 mu=2.3775 sigma=0.0676 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0026110506703627768}
|
|
[2026-04-13 01:03:44] UCB=2.2265 mu=2.0045 sigma=0.1110 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.002439155938689591}
|
|
[2026-04-13 01:03:44] UCB=1.9627 mu=1.6870 sigma=0.1378 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0027005643690790376}
|
|
[2026-04-13 01:03:44] UCB=1.9020 mu=1.7649 sigma=0.0685 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0027929753351139605}
|
|
[2026-04-13 01:03:44] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.003021189222395069, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:46] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.003021
|
|
[2026-04-13 01:03:55] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:03:55] [AutoResearch] mean_reward=58.7155
|
|
[2026-04-13 01:03:55] [AutoResearch] === Trial 56 Summary ===
|
|
[2026-04-13 01:03:55] Total runs in history: 74
|
|
[2026-04-13 01:03:55] Best so far: mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:55] Top 5 results:
|
|
[2026-04-13 01:03:55] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:55] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:55] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:55] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:55] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:57]
|
|
[AutoResearch] ========== Trial 57/100 ==========
|
|
[2026-04-13 01:03:57] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:03:57] UCB=2.6746 mu=2.2939 sigma=0.1903 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004960121466140565}
|
|
[2026-04-13 01:03:57] UCB=1.7692 mu=1.4337 sigma=0.1677 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0034893159424251013}
|
|
[2026-04-13 01:03:57] UCB=1.7564 mu=1.5204 sigma=0.1180 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003395858746194034}
|
|
[2026-04-13 01:03:57] UCB=1.6935 mu=1.4156 sigma=0.1390 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002571761008693607}
|
|
[2026-04-13 01:03:57] UCB=1.6910 mu=1.5961 sigma=0.0474 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034426111314956598}
|
|
[2026-04-13 01:03:57] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.004960121466140565, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:03:59] [AutoResearch] Launching job: n_steer=9 n_throttle=5 lr=0.004960
|
|
[2026-04-13 01:04:08] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:04:08] [AutoResearch] mean_reward=70.3424
|
|
[2026-04-13 01:04:08] [AutoResearch] === Trial 57 Summary ===
|
|
[2026-04-13 01:04:08] Total runs in history: 75
|
|
[2026-04-13 01:04:08] Best so far: mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:08] Top 5 results:
|
|
[2026-04-13 01:04:08] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:08] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:08] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:08] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:08] mean_reward=103.5239 params={'n_steer': 9, 'n_throttle': 5, 'learning_rate': 0.0031013569868078485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:10]
|
|
[AutoResearch] ========== Trial 58/100 ==========
|
|
[2026-04-13 01:04:10] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:04:10] UCB=1.8567 mu=1.5632 sigma=0.1467 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773}
|
|
[2026-04-13 01:04:10] UCB=1.8357 mu=1.4635 sigma=0.1861 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0035538615086420167}
|
|
[2026-04-13 01:04:10] UCB=1.8036 mu=1.5465 sigma=0.1286 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034332347747791014}
|
|
[2026-04-13 01:04:10] UCB=1.7984 mu=1.4089 sigma=0.1948 params={'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.004877454938306927}
|
|
[2026-04-13 01:04:10] UCB=1.6583 mu=1.3264 sigma=0.1660 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0036425663695114395}
|
|
[2026-04-13 01:04:10] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:12] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.002078
|
|
[2026-04-13 01:04:21] [AutoResearch] Job finished in 9.6s, returncode=0
|
|
[2026-04-13 01:04:21] [AutoResearch] mean_reward=114.5598
|
|
[2026-04-13 01:04:21] [AutoResearch] === Trial 58 Summary ===
|
|
[2026-04-13 01:04:21] Total runs in history: 76
|
|
[2026-04-13 01:04:21] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:21] Top 5 results:
|
|
[2026-04-13 01:04:21] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:21] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:21] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:21] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:21] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:23]
|
|
[AutoResearch] ========== Trial 59/100 ==========
|
|
[2026-04-13 01:04:23] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:04:23] UCB=2.1303 mu=1.8207 sigma=0.1548 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0022104563389860883}
|
|
[2026-04-13 01:04:23] UCB=1.9377 mu=1.6827 sigma=0.1275 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003560303043080492}
|
|
[2026-04-13 01:04:23] UCB=1.9019 mu=1.6725 sigma=0.1147 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035325793902322507}
|
|
[2026-04-13 01:04:23] UCB=1.8126 mu=1.7242 sigma=0.0442 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002517185297488741}
|
|
[2026-04-13 01:04:23] UCB=1.6106 mu=1.2566 sigma=0.1770 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0017350695491934467}
|
|
[2026-04-13 01:04:23] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0022104563389860883, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:25] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.002210
|
|
[2026-04-13 01:04:34] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:04:34] [AutoResearch] mean_reward=59.787
|
|
[2026-04-13 01:04:34] [AutoResearch] === Trial 59 Summary ===
|
|
[2026-04-13 01:04:34] Total runs in history: 77
|
|
[2026-04-13 01:04:34] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:34] Top 5 results:
|
|
[2026-04-13 01:04:34] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:34] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:34] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:34] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:34] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:36]
|
|
[AutoResearch] ========== Trial 60/100 ==========
|
|
[2026-04-13 01:04:36] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:04:36] UCB=1.7627 mu=1.4372 sigma=0.1627 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003385032789495021}
|
|
[2026-04-13 01:04:36] UCB=1.6595 mu=1.3248 sigma=0.1673 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0019231990642406007}
|
|
[2026-04-13 01:04:36] UCB=1.6548 mu=1.3200 sigma=0.1674 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.003790052451193266}
|
|
[2026-04-13 01:04:36] UCB=1.5804 mu=1.2272 sigma=0.1766 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003227530884344602}
|
|
[2026-04-13 01:04:36] UCB=1.5379 mu=1.3356 sigma=0.1011 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002259727453879119}
|
|
[2026-04-13 01:04:36] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003385032789495021, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:38] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003385
|
|
[2026-04-13 01:04:46] [AutoResearch] Job finished in 8.0s, returncode=0
|
|
[2026-04-13 01:04:46] [AutoResearch] mean_reward=42.9815
|
|
[2026-04-13 01:04:46] [AutoResearch] === Trial 60 Summary ===
|
|
[2026-04-13 01:04:46] Total runs in history: 78
|
|
[2026-04-13 01:04:46] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:46] Top 5 results:
|
|
[2026-04-13 01:04:46] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:46] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:46] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:46] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:46] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:48]
|
|
[AutoResearch] ========== Trial 61/100 ==========
|
|
[2026-04-13 01:04:48] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:04:48] UCB=2.1279 mu=1.6515 sigma=0.2382 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.004787041320589642}
|
|
[2026-04-13 01:04:48] UCB=1.8676 mu=1.4351 sigma=0.2163 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.004701709517909715}
|
|
[2026-04-13 01:04:48] UCB=1.8432 mu=1.5163 sigma=0.1634 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.004149073105119007}
|
|
[2026-04-13 01:04:48] UCB=1.6988 mu=1.2390 sigma=0.2299 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.004744611261119761}
|
|
[2026-04-13 01:04:48] UCB=1.6827 mu=1.4290 sigma=0.1269 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0038868753942082244}
|
|
[2026-04-13 01:04:48] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.004787041320589642, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:50] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.004787
|
|
[2026-04-13 01:04:58] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:04:58] [AutoResearch] mean_reward=65.045
|
|
[2026-04-13 01:04:58] [AutoResearch] === Trial 61 Summary ===
|
|
[2026-04-13 01:04:58] Total runs in history: 79
|
|
[2026-04-13 01:04:58] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:58] Top 5 results:
|
|
[2026-04-13 01:04:58] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:58] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:58] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:58] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:04:58] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:00]
|
|
[AutoResearch] ========== Trial 62/100 ==========
|
|
[2026-04-13 01:05:00] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:05:00] UCB=1.5743 mu=1.2569 sigma=0.1587 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0021856615273897605}
|
|
[2026-04-13 01:05:00] UCB=1.4372 mu=1.1499 sigma=0.1437 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.003792279156073999}
|
|
[2026-04-13 01:05:00] UCB=1.3980 mu=1.0393 sigma=0.1794 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0036913757635166303}
|
|
[2026-04-13 01:05:00] UCB=1.3798 mu=1.0517 sigma=0.1641 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0022112083169295686}
|
|
[2026-04-13 01:05:00] UCB=1.3117 mu=1.2119 sigma=0.0499 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.002578160577901026}
|
|
[2026-04-13 01:05:00] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0021856615273897605, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:02] [AutoResearch] Launching job: n_steer=7 n_throttle=4 lr=0.002186
|
|
[2026-04-13 01:05:11] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:05:11] [AutoResearch] mean_reward=58.5231
|
|
[2026-04-13 01:05:11] [AutoResearch] === Trial 62 Summary ===
|
|
[2026-04-13 01:05:11] Total runs in history: 80
|
|
[2026-04-13 01:05:11] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:11] Top 5 results:
|
|
[2026-04-13 01:05:11] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:11] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:11] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:11] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:11] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:13]
|
|
[AutoResearch] ========== Trial 63/100 ==========
|
|
[2026-04-13 01:05:13] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:05:13] UCB=1.7901 mu=1.3156 sigma=0.2372 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.004930587667512456}
|
|
[2026-04-13 01:05:13] UCB=1.5102 mu=1.1315 sigma=0.1893 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.003720865491572539}
|
|
[2026-04-13 01:05:13] UCB=1.4927 mu=1.2466 sigma=0.1231 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002490144187604425}
|
|
[2026-04-13 01:05:13] UCB=1.4400 mu=1.2868 sigma=0.0766 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0019269838169304325}
|
|
[2026-04-13 01:05:13] UCB=1.3917 mu=1.1059 sigma=0.1429 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.00019760731914357475}
|
|
[2026-04-13 01:05:13] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.004930587667512456, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:15] [AutoResearch] Launching job: n_steer=9 n_throttle=3 lr=0.004931
|
|
[2026-04-13 01:05:24] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:05:24] [AutoResearch] mean_reward=78.6669
|
|
[2026-04-13 01:05:24] [AutoResearch] === Trial 63 Summary ===
|
|
[2026-04-13 01:05:24] Total runs in history: 81
|
|
[2026-04-13 01:05:24] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:24] Top 5 results:
|
|
[2026-04-13 01:05:24] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:24] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:24] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:24] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:24] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:26]
|
|
[AutoResearch] ========== Trial 64/100 ==========
|
|
[2026-04-13 01:05:26] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:05:26] UCB=1.5819 mu=1.1806 sigma=0.2006 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.004064294663065903}
|
|
[2026-04-13 01:05:26] UCB=1.5700 mu=1.1574 sigma=0.2063 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.004148802663890989}
|
|
[2026-04-13 01:05:26] UCB=1.5562 mu=1.2899 sigma=0.1331 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0018491226594447737}
|
|
[2026-04-13 01:05:26] UCB=1.5428 mu=1.0712 sigma=0.2358 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003943671910694899}
|
|
[2026-04-13 01:05:26] UCB=1.5066 mu=1.1739 sigma=0.1664 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0019804265088818845}
|
|
[2026-04-13 01:05:26] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.004064294663065903, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:28] [AutoResearch] Launching job: n_steer=9 n_throttle=3 lr=0.004064
|
|
[2026-04-13 01:05:36] [AutoResearch] Job finished in 8.1s, returncode=0
|
|
[2026-04-13 01:05:36] [AutoResearch] mean_reward=39.001
|
|
[2026-04-13 01:05:36] [AutoResearch] === Trial 64 Summary ===
|
|
[2026-04-13 01:05:36] Total runs in history: 82
|
|
[2026-04-13 01:05:36] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:36] Top 5 results:
|
|
[2026-04-13 01:05:36] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:36] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:36] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:36] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:36] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:38]
|
|
[AutoResearch] ========== Trial 65/100 ==========
|
|
[2026-04-13 01:05:38] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:05:38] UCB=2.8231 mu=2.4250 sigma=0.1990 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003923871637044892}
|
|
[2026-04-13 01:05:38] UCB=2.8161 mu=2.1770 sigma=0.3196 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.004453907360537908}
|
|
[2026-04-13 01:05:38] UCB=2.6501 mu=1.7653 sigma=0.4424 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.00453120038692142}
|
|
[2026-04-13 01:05:38] UCB=2.4845 mu=2.0625 sigma=0.2110 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.004064600267163309}
|
|
[2026-04-13 01:05:38] UCB=2.4047 mu=2.0052 sigma=0.1997 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003960646727604702}
|
|
[2026-04-13 01:05:38] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003923871637044892, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:40] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.003924
|
|
[2026-04-13 01:05:49] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:05:49] [AutoResearch] mean_reward=99.8089
|
|
[2026-04-13 01:05:49] [AutoResearch] === Trial 65 Summary ===
|
|
[2026-04-13 01:05:49] Total runs in history: 83
|
|
[2026-04-13 01:05:49] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:49] Top 5 results:
|
|
[2026-04-13 01:05:49] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:49] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:49] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:49] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:49] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:51]
|
|
[AutoResearch] ========== Trial 66/100 ==========
|
|
[2026-04-13 01:05:51] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:05:51] UCB=2.3738 mu=2.0157 sigma=0.1790 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.004159920154799998}
|
|
[2026-04-13 01:05:51] UCB=2.1982 mu=1.8497 sigma=0.1742 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003722002422435926}
|
|
[2026-04-13 01:05:51] UCB=2.0690 mu=1.6541 sigma=0.2075 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0037213477372882225}
|
|
[2026-04-13 01:05:51] UCB=1.9289 mu=1.5763 sigma=0.1763 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0036069747705951873}
|
|
[2026-04-13 01:05:51] UCB=1.9150 mu=1.6711 sigma=0.1219 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.004348602413613732}
|
|
[2026-04-13 01:05:51] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.004159920154799998, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:05:53] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.004160
|
|
[2026-04-13 01:06:01] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:06:01] [AutoResearch] mean_reward=62.1699
|
|
[2026-04-13 01:06:01] [AutoResearch] === Trial 66 Summary ===
|
|
[2026-04-13 01:06:01] Total runs in history: 84
|
|
[2026-04-13 01:06:01] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:01] Top 5 results:
|
|
[2026-04-13 01:06:01] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:01] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:01] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:01] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:01] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:03]
|
|
[AutoResearch] ========== Trial 67/100 ==========
|
|
[2026-04-13 01:06:03] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:06:03] UCB=2.4845 mu=2.1375 sigma=0.1735 params={'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.004768922625549707}
|
|
[2026-04-13 01:06:03] UCB=1.8316 mu=1.4908 sigma=0.1704 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002102618797847224}
|
|
[2026-04-13 01:06:03] UCB=1.7889 mu=1.4338 sigma=0.1775 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.003651668545736029}
|
|
[2026-04-13 01:06:03] UCB=1.7868 mu=1.4383 sigma=0.1742 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020305607126875954}
|
|
[2026-04-13 01:06:03] UCB=1.5796 mu=1.3612 sigma=0.1092 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0025010401732854662}
|
|
[2026-04-13 01:06:03] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 4, 'learning_rate': 0.004768922625549707, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:05] [AutoResearch] Launching job: n_steer=9 n_throttle=4 lr=0.004769
|
|
[2026-04-13 01:06:13] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:06:13] [AutoResearch] mean_reward=49.7339
|
|
[2026-04-13 01:06:13] [AutoResearch] === Trial 67 Summary ===
|
|
[2026-04-13 01:06:13] Total runs in history: 85
|
|
[2026-04-13 01:06:13] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:13] Top 5 results:
|
|
[2026-04-13 01:06:13] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:13] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:13] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:13] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:13] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:15]
|
|
[AutoResearch] ========== Trial 68/100 ==========
|
|
[2026-04-13 01:06:15] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:06:15] UCB=1.8279 mu=1.4831 sigma=0.1724 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0021441054792919454}
|
|
[2026-04-13 01:06:15] UCB=1.6779 mu=1.5381 sigma=0.0699 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0022630193815941792}
|
|
[2026-04-13 01:06:15] UCB=1.6722 mu=1.3677 sigma=0.1522 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018916578033984718}
|
|
[2026-04-13 01:06:15] UCB=1.6426 mu=1.2649 sigma=0.1888 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0034753308084023095}
|
|
[2026-04-13 01:06:15] UCB=1.3442 mu=1.1487 sigma=0.0977 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0013053033612060543}
|
|
[2026-04-13 01:06:15] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0021441054792919454, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:17] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.002144
|
|
[2026-04-13 01:06:26] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:06:26] [AutoResearch] mean_reward=72.0575
|
|
[2026-04-13 01:06:26] [AutoResearch] === Trial 68 Summary ===
|
|
[2026-04-13 01:06:26] Total runs in history: 86
|
|
[2026-04-13 01:06:26] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:26] Top 5 results:
|
|
[2026-04-13 01:06:26] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:26] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:26] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:26] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:26] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:28]
|
|
[AutoResearch] ========== Trial 69/100 ==========
|
|
[2026-04-13 01:06:28] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:06:28] UCB=1.9577 mu=1.6358 sigma=0.1610 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.001941517120125575}
|
|
[2026-04-13 01:06:28] UCB=1.4772 mu=1.1750 sigma=0.1511 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018578485850394286}
|
|
[2026-04-13 01:06:28] UCB=1.4426 mu=1.1328 sigma=0.1549 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0035239490635924774}
|
|
[2026-04-13 01:06:28] UCB=1.4413 mu=1.0452 sigma=0.1981 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0036342662642959655}
|
|
[2026-04-13 01:06:28] UCB=1.3373 mu=1.0434 sigma=0.1469 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0029010762882469504}
|
|
[2026-04-13 01:06:28] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.001941517120125575, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:30] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.001942
|
|
[2026-04-13 01:06:39] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:06:39] [AutoResearch] mean_reward=70.9996
|
|
[2026-04-13 01:06:39] [AutoResearch] === Trial 69 Summary ===
|
|
[2026-04-13 01:06:39] Total runs in history: 87
|
|
[2026-04-13 01:06:39] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:39] Top 5 results:
|
|
[2026-04-13 01:06:39] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:39] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:39] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:39] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:39] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:41]
|
|
[AutoResearch] ========== Trial 70/100 ==========
|
|
[2026-04-13 01:06:41] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:06:41] UCB=1.3844 mu=1.0722 sigma=0.1561 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0022822154073749816}
|
|
[2026-04-13 01:06:41] UCB=1.3433 mu=0.9570 sigma=0.1932 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.002470997479557751}
|
|
[2026-04-13 01:06:41] UCB=1.3234 mu=1.1054 sigma=0.1090 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0010815897302815733}
|
|
[2026-04-13 01:06:41] UCB=1.2684 mu=0.9257 sigma=0.1714 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.002572880361015175}
|
|
[2026-04-13 01:06:41] UCB=1.2022 mu=0.8520 sigma=0.1751 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0017673937581157142}
|
|
[2026-04-13 01:06:41] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0022822154073749816, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:43] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.002282
|
|
[2026-04-13 01:06:51] [AutoResearch] Job finished in 7.9s, returncode=0
|
|
[2026-04-13 01:06:51] [AutoResearch] mean_reward=39.6091
|
|
[2026-04-13 01:06:51] [AutoResearch] === Trial 70 Summary ===
|
|
[2026-04-13 01:06:51] Total runs in history: 88
|
|
[2026-04-13 01:06:51] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:51] Top 5 results:
|
|
[2026-04-13 01:06:51] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:51] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:51] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:51] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:51] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:53]
|
|
[AutoResearch] ========== Trial 71/100 ==========
|
|
[2026-04-13 01:06:53] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:06:53] UCB=1.6815 mu=1.3559 sigma=0.1628 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0038677663610547307}
|
|
[2026-04-13 01:06:53] UCB=1.4820 mu=1.2699 sigma=0.1061 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002133788012269342}
|
|
[2026-04-13 01:06:53] UCB=1.4407 mu=1.0930 sigma=0.1738 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.002077522029265502}
|
|
[2026-04-13 01:06:53] UCB=1.4355 mu=1.3229 sigma=0.0563 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0021095107539381886}
|
|
[2026-04-13 01:06:53] UCB=1.4144 mu=1.1384 sigma=0.1380 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.003570226570529891}
|
|
[2026-04-13 01:06:53] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0038677663610547307, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:06:55] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.003868
|
|
[2026-04-13 01:07:03] [AutoResearch] Job finished in 8.1s, returncode=0
|
|
[2026-04-13 01:07:03] [AutoResearch] mean_reward=42.0696
|
|
[2026-04-13 01:07:03] [AutoResearch] === Trial 71 Summary ===
|
|
[2026-04-13 01:07:03] Total runs in history: 89
|
|
[2026-04-13 01:07:03] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:03] Top 5 results:
|
|
[2026-04-13 01:07:03] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:03] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:03] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:03] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:03] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:05]
|
|
[AutoResearch] ========== Trial 72/100 ==========
|
|
[2026-04-13 01:07:05] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:07:05] UCB=1.3214 mu=1.0164 sigma=0.1525 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.00017532156584553645}
|
|
[2026-04-13 01:07:05] UCB=1.2832 mu=1.1746 sigma=0.0543 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002386810905546507}
|
|
[2026-04-13 01:07:05] UCB=1.2504 mu=0.9272 sigma=0.1616 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0011132553559260845}
|
|
[2026-04-13 01:07:05] UCB=1.2005 mu=0.8813 sigma=0.1596 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008109384582655324}
|
|
[2026-04-13 01:07:05] UCB=1.1827 mu=1.1034 sigma=0.0397 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00028591296525435364}
|
|
[2026-04-13 01:07:05] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.00017532156584553645, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:07] [AutoResearch] Launching job: n_steer=6 n_throttle=3 lr=0.000175
|
|
[2026-04-13 01:07:15] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:07:15] [AutoResearch] mean_reward=48.6265
|
|
[2026-04-13 01:07:15] [AutoResearch] === Trial 72 Summary ===
|
|
[2026-04-13 01:07:15] Total runs in history: 90
|
|
[2026-04-13 01:07:15] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:15] Top 5 results:
|
|
[2026-04-13 01:07:15] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:15] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:15] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:15] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:15] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:17]
|
|
[AutoResearch] ========== Trial 73/100 ==========
|
|
[2026-04-13 01:07:17] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:07:17] UCB=1.5766 mu=1.3290 sigma=0.1238 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002341169336851555}
|
|
[2026-04-13 01:07:17] UCB=1.4827 mu=1.3570 sigma=0.0629 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002217701172725509}
|
|
[2026-04-13 01:07:17] UCB=1.4400 mu=1.1389 sigma=0.1506 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0005288053739159532}
|
|
[2026-04-13 01:07:17] UCB=1.3275 mu=1.1833 sigma=0.0721 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0023286115107221473}
|
|
[2026-04-13 01:07:17] UCB=1.2935 mu=0.8902 sigma=0.2016 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0003269367454435055}
|
|
[2026-04-13 01:07:17] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002341169336851555, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:19] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.002341
|
|
[2026-04-13 01:07:28] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:07:28] [AutoResearch] mean_reward=68.5359
|
|
[2026-04-13 01:07:28] [AutoResearch] === Trial 73 Summary ===
|
|
[2026-04-13 01:07:28] Total runs in history: 91
|
|
[2026-04-13 01:07:28] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:28] Top 5 results:
|
|
[2026-04-13 01:07:28] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:28] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:28] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:28] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:28] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:30]
|
|
[AutoResearch] ========== Trial 74/100 ==========
|
|
[2026-04-13 01:07:30] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:07:30] UCB=1.3660 mu=1.0018 sigma=0.1821 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.00013118450437270035}
|
|
[2026-04-13 01:07:30] UCB=1.2592 mu=0.9240 sigma=0.1676 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0005908326484223881}
|
|
[2026-04-13 01:07:30] UCB=1.1911 mu=1.0274 sigma=0.0819 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.00026981748197182757}
|
|
[2026-04-13 01:07:30] UCB=1.1763 mu=0.7784 sigma=0.1990 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0006889612770841941}
|
|
[2026-04-13 01:07:30] UCB=1.1725 mu=0.6759 sigma=0.2483 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 0.00023334717896002257}
|
|
[2026-04-13 01:07:30] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.00013118450437270035, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:32] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.000131
|
|
[2026-04-13 01:07:41] [AutoResearch] Job finished in 8.1s, returncode=0
|
|
[2026-04-13 01:07:41] [AutoResearch] mean_reward=51.9963
|
|
[2026-04-13 01:07:41] [AutoResearch] === Trial 74 Summary ===
|
|
[2026-04-13 01:07:41] Total runs in history: 92
|
|
[2026-04-13 01:07:41] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:41] Top 5 results:
|
|
[2026-04-13 01:07:41] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:41] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:41] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:41] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:41] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:43]
|
|
[AutoResearch] ========== Trial 75/100 ==========
|
|
[2026-04-13 01:07:43] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:07:43] UCB=1.6588 mu=1.2882 sigma=0.1853 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.00046117001666045514}
|
|
[2026-04-13 01:07:43] UCB=1.5870 mu=1.1905 sigma=0.1982 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0005040372864607814}
|
|
[2026-04-13 01:07:43] UCB=1.5701 mu=1.2283 sigma=0.1709 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.00030771275380513807}
|
|
[2026-04-13 01:07:43] UCB=1.4998 mu=1.1746 sigma=0.1626 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002117408613900014}
|
|
[2026-04-13 01:07:43] UCB=1.4899 mu=1.2462 sigma=0.1218 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.00027287889384595645}
|
|
[2026-04-13 01:07:43] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.00046117001666045514, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:45] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.000461
|
|
[2026-04-13 01:07:53] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:07:53] [AutoResearch] mean_reward=56.1605
|
|
[2026-04-13 01:07:53] [AutoResearch] === Trial 75 Summary ===
|
|
[2026-04-13 01:07:53] Total runs in history: 93
|
|
[2026-04-13 01:07:53] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:53] Top 5 results:
|
|
[2026-04-13 01:07:53] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:53] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:53] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:53] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:53] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:55]
|
|
[AutoResearch] ========== Trial 76/100 ==========
|
|
[2026-04-13 01:07:55] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:07:55] UCB=1.4214 mu=1.3384 sigma=0.0415 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001347654327627616}
|
|
[2026-04-13 01:07:55] UCB=1.3225 mu=1.2357 sigma=0.0434 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0014249730012658279}
|
|
[2026-04-13 01:07:55] UCB=1.2768 mu=1.0004 sigma=0.1382 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012528857120195475}
|
|
[2026-04-13 01:07:55] UCB=1.1969 mu=0.8763 sigma=0.1603 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0025219248336333005}
|
|
[2026-04-13 01:07:55] UCB=1.1925 mu=1.0750 sigma=0.0587 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002059596645381912}
|
|
[2026-04-13 01:07:55] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001347654327627616, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:07:57] [AutoResearch] Launching job: n_steer=8 n_throttle=2 lr=0.001348
|
|
[2026-04-13 01:08:06] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:08:06] [AutoResearch] mean_reward=81.6865
|
|
[2026-04-13 01:08:06] [AutoResearch] === Trial 76 Summary ===
|
|
[2026-04-13 01:08:06] Total runs in history: 94
|
|
[2026-04-13 01:08:06] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:06] Top 5 results:
|
|
[2026-04-13 01:08:06] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:06] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:06] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:06] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:06] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:08]
|
|
[AutoResearch] ========== Trial 77/100 ==========
|
|
[2026-04-13 01:08:08] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:08:08] UCB=1.4578 mu=1.1769 sigma=0.1404 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0017917653607137495}
|
|
[2026-04-13 01:08:08] UCB=1.3159 mu=0.9814 sigma=0.1672 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.002102833121140987}
|
|
[2026-04-13 01:08:08] UCB=1.2889 mu=1.0023 sigma=0.1433 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0009580862420739903}
|
|
[2026-04-13 01:08:08] UCB=1.2791 mu=0.9342 sigma=0.1724 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002221904731396675}
|
|
[2026-04-13 01:08:08] UCB=1.2584 mu=1.0242 sigma=0.1171 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0024983608109220394}
|
|
[2026-04-13 01:08:08] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0017917653607137495, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:10] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.001792
|
|
[2026-04-13 01:08:19] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:08:19] [AutoResearch] mean_reward=61.0567
|
|
[2026-04-13 01:08:19] [AutoResearch] === Trial 77 Summary ===
|
|
[2026-04-13 01:08:19] Total runs in history: 95
|
|
[2026-04-13 01:08:19] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:19] Top 5 results:
|
|
[2026-04-13 01:08:19] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:19] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:19] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:19] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:19] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:21]
|
|
[AutoResearch] ========== Trial 78/100 ==========
|
|
[2026-04-13 01:08:21] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:08:21] UCB=1.3446 mu=1.1068 sigma=0.1189 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0023780235745902933}
|
|
[2026-04-13 01:08:21] UCB=1.3087 mu=1.0042 sigma=0.1522 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0025258216322202012}
|
|
[2026-04-13 01:08:21] UCB=1.1681 mu=0.8774 sigma=0.1453 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002733459266081019}
|
|
[2026-04-13 01:08:21] UCB=1.1126 mu=0.7712 sigma=0.1707 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002257683133868499}
|
|
[2026-04-13 01:08:21] UCB=1.0971 mu=0.8188 sigma=0.1391 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0034158263786659266}
|
|
[2026-04-13 01:08:21] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0023780235745902933, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:23] [AutoResearch] Launching job: n_steer=7 n_throttle=4 lr=0.002378
|
|
[2026-04-13 01:08:31] [AutoResearch] Job finished in 8.0s, returncode=0
|
|
[2026-04-13 01:08:31] [AutoResearch] mean_reward=38.546
|
|
[2026-04-13 01:08:31] [AutoResearch] === Trial 78 Summary ===
|
|
[2026-04-13 01:08:31] Total runs in history: 96
|
|
[2026-04-13 01:08:31] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:31] Top 5 results:
|
|
[2026-04-13 01:08:31] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:31] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:31] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:31] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:31] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:33]
|
|
[AutoResearch] ========== Trial 79/100 ==========
|
|
[2026-04-13 01:08:33] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:08:33] UCB=1.6215 mu=1.3294 sigma=0.1461 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002028760331744987}
|
|
[2026-04-13 01:08:33] UCB=1.5806 mu=0.3378 sigma=0.6214 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.004998289424338085}
|
|
[2026-04-13 01:08:33] UCB=1.4475 mu=1.1825 sigma=0.1325 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005062885881127107}
|
|
[2026-04-13 01:08:33] UCB=1.4376 mu=1.1007 sigma=0.1685 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0005609744130017323}
|
|
[2026-04-13 01:08:33] UCB=1.4294 mu=1.0627 sigma=0.1833 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0004649220698835301}
|
|
[2026-04-13 01:08:33] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002028760331744987, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:35] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.002029
|
|
[2026-04-13 01:08:43] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:08:43] [AutoResearch] mean_reward=69.4453
|
|
[2026-04-13 01:08:43] [AutoResearch] === Trial 79 Summary ===
|
|
[2026-04-13 01:08:43] Total runs in history: 97
|
|
[2026-04-13 01:08:43] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:43] Top 5 results:
|
|
[2026-04-13 01:08:43] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:43] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:43] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:43] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:43] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:45]
|
|
[AutoResearch] ========== Trial 80/100 ==========
|
|
[2026-04-13 01:08:45] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:08:45] UCB=1.6606 mu=1.3717 sigma=0.1445 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0016067312277644138}
|
|
[2026-04-13 01:08:45] UCB=1.6509 mu=1.4214 sigma=0.1147 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0016703305469950702}
|
|
[2026-04-13 01:08:45] UCB=1.4507 mu=1.1467 sigma=0.1520 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0012979064260646488}
|
|
[2026-04-13 01:08:45] UCB=1.4179 mu=1.2213 sigma=0.0983 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0016635199081980558}
|
|
[2026-04-13 01:08:45] UCB=1.4073 mu=1.0428 sigma=0.1822 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0003019160616348487}
|
|
[2026-04-13 01:08:45] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0016067312277644138, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:47] [AutoResearch] Launching job: n_steer=7 n_throttle=4 lr=0.001607
|
|
[2026-04-13 01:08:55] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:08:55] [AutoResearch] mean_reward=50.2438
|
|
[2026-04-13 01:08:55] [AutoResearch] === Trial 80 Summary ===
|
|
[2026-04-13 01:08:55] Total runs in history: 98
|
|
[2026-04-13 01:08:55] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:55] Top 5 results:
|
|
[2026-04-13 01:08:55] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:55] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:55] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:55] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:55] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:57]
|
|
[AutoResearch] ========== Trial 81/100 ==========
|
|
[2026-04-13 01:08:57] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:08:57] UCB=1.2158 mu=1.1355 sigma=0.0401 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005893003500149926}
|
|
[2026-04-13 01:08:57] UCB=1.2012 mu=0.8107 sigma=0.1952 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.983700793219779e-05}
|
|
[2026-04-13 01:08:57] UCB=1.1753 mu=0.8678 sigma=0.1538 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002428500468202209}
|
|
[2026-04-13 01:08:57] UCB=1.1692 mu=0.7386 sigma=0.2153 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.0001260609890009103}
|
|
[2026-04-13 01:08:57] UCB=1.1148 mu=0.8273 sigma=0.1437 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002218936026390214}
|
|
[2026-04-13 01:08:57] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005893003500149926, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:08:59] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.000589
|
|
[2026-04-13 01:09:08] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:09:08] [AutoResearch] mean_reward=75.7692
|
|
[2026-04-13 01:09:08] [AutoResearch] === Trial 81 Summary ===
|
|
[2026-04-13 01:09:08] Total runs in history: 99
|
|
[2026-04-13 01:09:08] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:08] Top 5 results:
|
|
[2026-04-13 01:09:08] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:08] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:08] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:08] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:08] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:10]
|
|
[AutoResearch] ========== Trial 82/100 ==========
|
|
[2026-04-13 01:09:10] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:09:10] UCB=1.2987 mu=0.9741 sigma=0.1623 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002001997141004719}
|
|
[2026-04-13 01:09:10] UCB=1.1998 mu=0.9142 sigma=0.1428 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001186821589727907}
|
|
[2026-04-13 01:09:10] UCB=1.1643 mu=0.9181 sigma=0.1231 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.00196209978200202}
|
|
[2026-04-13 01:09:10] UCB=1.1538 mu=-0.2714 sigma=0.7126 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 0.004965370651401135}
|
|
[2026-04-13 01:09:10] UCB=1.1366 mu=0.6394 sigma=0.2486 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 5.963217036030606e-05}
|
|
[2026-04-13 01:09:10] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002001997141004719, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:12] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.002002
|
|
[2026-04-13 01:09:21] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:09:21] [AutoResearch] mean_reward=65.9396
|
|
[2026-04-13 01:09:21] [AutoResearch] === Trial 82 Summary ===
|
|
[2026-04-13 01:09:21] Total runs in history: 100
|
|
[2026-04-13 01:09:21] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:21] Top 5 results:
|
|
[2026-04-13 01:09:21] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:21] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:21] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:21] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:21] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:23]
|
|
[AutoResearch] ========== Trial 83/100 ==========
|
|
[2026-04-13 01:09:23] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:09:23] UCB=1.4071 mu=1.2497 sigma=0.0787 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 5.336725017475624e-05}
|
|
[2026-04-13 01:09:23] UCB=1.3390 mu=1.1631 sigma=0.0880 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001193830106390006}
|
|
[2026-04-13 01:09:23] UCB=1.3137 mu=1.0688 sigma=0.1225 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0011118360561132383}
|
|
[2026-04-13 01:09:23] UCB=1.2002 mu=0.8542 sigma=0.1730 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0008786550152382135}
|
|
[2026-04-13 01:09:23] UCB=1.1995 mu=0.8922 sigma=0.1536 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0008107696947442385}
|
|
[2026-04-13 01:09:23] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 5, 'learning_rate': 5.336725017475624e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:25] [AutoResearch] Launching job: n_steer=3 n_throttle=5 lr=0.000053
|
|
[2026-04-13 01:09:33] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:09:33] [AutoResearch] mean_reward=54.5207
|
|
[2026-04-13 01:09:33] [AutoResearch] === Trial 83 Summary ===
|
|
[2026-04-13 01:09:33] Total runs in history: 101
|
|
[2026-04-13 01:09:33] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:33] Top 5 results:
|
|
[2026-04-13 01:09:33] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:33] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:33] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:33] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:33] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:35]
|
|
[AutoResearch] ========== Trial 84/100 ==========
|
|
[2026-04-13 01:09:35] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:09:35] UCB=5.3906 mu=4.7587 sigma=0.3159 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0010190081139636001}
|
|
[2026-04-13 01:09:35] UCB=4.6531 mu=3.6801 sigma=0.4865 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0014157582283546982}
|
|
[2026-04-13 01:09:35] UCB=4.2436 mu=3.6231 sigma=0.3103 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0009771170803992335}
|
|
[2026-04-13 01:09:35] UCB=3.7720 mu=2.7046 sigma=0.5337 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.001891485956812575}
|
|
[2026-04-13 01:09:35] UCB=3.4614 mu=2.6373 sigma=0.4121 params={'n_steer': 4, 'n_throttle': 5, 'learning_rate': 0.0015985230919204777}
|
|
[2026-04-13 01:09:35] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0010190081139636001, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:37] [AutoResearch] Launching job: n_steer=3 n_throttle=5 lr=0.001019
|
|
[2026-04-13 01:09:46] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:09:46] [AutoResearch] mean_reward=60.3526
|
|
[2026-04-13 01:09:46] [AutoResearch] === Trial 84 Summary ===
|
|
[2026-04-13 01:09:46] Total runs in history: 102
|
|
[2026-04-13 01:09:46] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:46] Top 5 results:
|
|
[2026-04-13 01:09:46] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:46] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:46] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:46] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:46] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:48]
|
|
[AutoResearch] ========== Trial 85/100 ==========
|
|
[2026-04-13 01:09:48] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:09:48] UCB=1.2515 mu=0.9691 sigma=0.1412 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0008346519566613488}
|
|
[2026-04-13 01:09:48] UCB=1.2435 mu=0.7422 sigma=0.2506 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 7.186271283250567e-05}
|
|
[2026-04-13 01:09:48] UCB=1.1825 mu=0.8652 sigma=0.1586 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0038008113107174672}
|
|
[2026-04-13 01:09:48] UCB=1.1489 mu=0.8862 sigma=0.1314 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.003773630646287029}
|
|
[2026-04-13 01:09:48] UCB=1.0591 mu=0.7890 sigma=0.1351 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0032512487116587102}
|
|
[2026-04-13 01:09:48] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0008346519566613488, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:50] [AutoResearch] Launching job: n_steer=8 n_throttle=2 lr=0.000835
|
|
[2026-04-13 01:09:58] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:09:58] [AutoResearch] mean_reward=63.5974
|
|
[2026-04-13 01:09:58] [AutoResearch] === Trial 85 Summary ===
|
|
[2026-04-13 01:09:58] Total runs in history: 103
|
|
[2026-04-13 01:09:58] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:58] Top 5 results:
|
|
[2026-04-13 01:09:58] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:58] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:58] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:58] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:09:58] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:00]
|
|
[AutoResearch] ========== Trial 86/100 ==========
|
|
[2026-04-13 01:10:00] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:10:00] UCB=1.2294 mu=1.0832 sigma=0.0731 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012330311718626993}
|
|
[2026-04-13 01:10:00] UCB=1.2252 mu=-0.5373 sigma=0.8813 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 0.004972949792562931}
|
|
[2026-04-13 01:10:00] UCB=1.1680 mu=0.9651 sigma=0.1015 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0011551857897432129}
|
|
[2026-04-13 01:10:00] UCB=1.1514 mu=1.0795 sigma=0.0359 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005370734894558025}
|
|
[2026-04-13 01:10:00] UCB=1.1423 mu=0.8480 sigma=0.1471 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0031720152540253292}
|
|
[2026-04-13 01:10:00] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012330311718626993, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:02] [AutoResearch] Launching job: n_steer=8 n_throttle=2 lr=0.001233
|
|
[2026-04-13 01:10:11] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:10:11] [AutoResearch] mean_reward=63.9004
|
|
[2026-04-13 01:10:11] [AutoResearch] === Trial 86 Summary ===
|
|
[2026-04-13 01:10:11] Total runs in history: 104
|
|
[2026-04-13 01:10:11] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:11] Top 5 results:
|
|
[2026-04-13 01:10:11] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:11] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:11] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:11] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:11] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:13]
|
|
[AutoResearch] ========== Trial 87/100 ==========
|
|
[2026-04-13 01:10:13] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:10:13] UCB=1.1567 mu=1.0800 sigma=0.0384 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00048208107033808066}
|
|
[2026-04-13 01:10:13] UCB=1.1456 mu=0.8438 sigma=0.1509 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.003631525110926688}
|
|
[2026-04-13 01:10:13] UCB=1.1343 mu=0.7311 sigma=0.2016 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.00010442019610316827}
|
|
[2026-04-13 01:10:13] UCB=1.1219 mu=0.8290 sigma=0.1465 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009493499508042529}
|
|
[2026-04-13 01:10:13] UCB=1.1060 mu=0.6254 sigma=0.2403 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 5.257460836089584e-05}
|
|
[2026-04-13 01:10:13] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00048208107033808066, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:15] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.000482
|
|
[2026-04-13 01:10:24] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:10:24] [AutoResearch] mean_reward=100.167
|
|
[2026-04-13 01:10:24] [AutoResearch] === Trial 87 Summary ===
|
|
[2026-04-13 01:10:24] Total runs in history: 105
|
|
[2026-04-13 01:10:24] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:24] Top 5 results:
|
|
[2026-04-13 01:10:24] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:24] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:24] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:24] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:24] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:26]
|
|
[AutoResearch] ========== Trial 88/100 ==========
|
|
[2026-04-13 01:10:26] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:10:26] UCB=1.3792 mu=-0.3378 sigma=0.8585 params={'n_steer': 3, 'n_throttle': 4, 'learning_rate': 0.004992343168718288}
|
|
[2026-04-13 01:10:26] UCB=1.2550 mu=1.0658 sigma=0.0946 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0006745838642859029}
|
|
[2026-04-13 01:10:26] UCB=1.2196 mu=0.9178 sigma=0.1509 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00041290544985278976}
|
|
[2026-04-13 01:10:26] UCB=1.1446 mu=0.7990 sigma=0.1728 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0006922921062978035}
|
|
[2026-04-13 01:10:26] UCB=1.1003 mu=0.9672 sigma=0.0666 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0026186347140936565}
|
|
[2026-04-13 01:10:26] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 4, 'learning_rate': 0.004992343168718288, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:28] [AutoResearch] Launching job: n_steer=3 n_throttle=4 lr=0.004992
|
|
[2026-04-13 01:10:36] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:10:36] [AutoResearch] mean_reward=47.6472
|
|
[2026-04-13 01:10:36] [AutoResearch] === Trial 88 Summary ===
|
|
[2026-04-13 01:10:36] Total runs in history: 106
|
|
[2026-04-13 01:10:36] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:36] Top 5 results:
|
|
[2026-04-13 01:10:36] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:36] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:36] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:36] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:36] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:38]
|
|
[AutoResearch] ========== Trial 89/100 ==========
|
|
[2026-04-13 01:10:38] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:10:38] UCB=1.3491 mu=1.1258 sigma=0.1116 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0004602054739918405}
|
|
[2026-04-13 01:10:38] UCB=1.1814 mu=0.9007 sigma=0.1404 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006109623401257269}
|
|
[2026-04-13 01:10:38] UCB=1.1621 mu=0.8172 sigma=0.1725 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002111726399501471}
|
|
[2026-04-13 01:10:38] UCB=1.1417 mu=0.8709 sigma=0.1354 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001071329409929369}
|
|
[2026-04-13 01:10:38] UCB=1.1011 mu=0.7567 sigma=0.1722 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0037386364813561963}
|
|
[2026-04-13 01:10:38] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0004602054739918405, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:40] [AutoResearch] Launching job: n_steer=3 n_throttle=5 lr=0.000460
|
|
[2026-04-13 01:10:49] [AutoResearch] Job finished in 9.1s, returncode=0
|
|
[2026-04-13 01:10:49] [AutoResearch] mean_reward=81.8086
|
|
[2026-04-13 01:10:49] [AutoResearch] === Trial 89 Summary ===
|
|
[2026-04-13 01:10:49] Total runs in history: 107
|
|
[2026-04-13 01:10:49] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:49] Top 5 results:
|
|
[2026-04-13 01:10:49] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:49] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:49] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:49] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:49] mean_reward=103.9999 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012261414232850496, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:51]
|
|
[AutoResearch] ========== Trial 90/100 ==========
|
|
[2026-04-13 01:10:51] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:10:51] UCB=1.4435 mu=0.9886 sigma=0.2275 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05}
|
|
[2026-04-13 01:10:51] UCB=1.2784 mu=0.9679 sigma=0.1552 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0022322376609316364}
|
|
[2026-04-13 01:10:51] UCB=1.2392 mu=1.0132 sigma=0.1130 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0003044420490197477}
|
|
[2026-04-13 01:10:51] UCB=1.2348 mu=1.0908 sigma=0.0720 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00046807669724663087}
|
|
[2026-04-13 01:10:51] UCB=1.1840 mu=0.8548 sigma=0.1646 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00034824842199368723}
|
|
[2026-04-13 01:10:51] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:10:54] [AutoResearch] Launching job: n_steer=5 n_throttle=4 lr=0.000089
|
|
[2026-04-13 01:11:03] [AutoResearch] Job finished in 9.6s, returncode=0
|
|
[2026-04-13 01:11:03] [AutoResearch] mean_reward=105.5329
|
|
[2026-04-13 01:11:03] [AutoResearch] === Trial 90 Summary ===
|
|
[2026-04-13 01:11:03] Total runs in history: 108
|
|
[2026-04-13 01:11:03] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:03] Top 5 results:
|
|
[2026-04-13 01:11:03] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:03] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:03] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:03] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:03] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:05]
|
|
[AutoResearch] ========== Trial 91/100 ==========
|
|
[2026-04-13 01:11:05] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:11:05] UCB=2.0770 mu=1.7687 sigma=0.1542 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.00011198018382533822}
|
|
[2026-04-13 01:11:05] UCB=1.8709 mu=1.5816 sigma=0.1447 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.00027551828644932323}
|
|
[2026-04-13 01:11:05] UCB=1.6610 mu=1.2937 sigma=0.1836 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 0.0004170895205361135}
|
|
[2026-04-13 01:11:05] UCB=1.2101 mu=0.8014 sigma=0.2044 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 0.0004445379470750576}
|
|
[2026-04-13 01:11:05] UCB=1.2029 mu=0.7655 sigma=0.2187 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 0.0005032580756027205}
|
|
[2026-04-13 01:11:05] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.00011198018382533822, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:07] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.000112
|
|
[2026-04-13 01:11:17] [AutoResearch] Job finished in 9.4s, returncode=0
|
|
[2026-04-13 01:11:17] [AutoResearch] mean_reward=93.477
|
|
[2026-04-13 01:11:17] [AutoResearch] === Trial 91 Summary ===
|
|
[2026-04-13 01:11:17] Total runs in history: 109
|
|
[2026-04-13 01:11:17] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:17] Top 5 results:
|
|
[2026-04-13 01:11:17] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:17] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:17] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:17] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:17] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:19]
|
|
[AutoResearch] ========== Trial 92/100 ==========
|
|
[2026-04-13 01:11:19] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:11:19] UCB=1.8614 mu=1.5991 sigma=0.1311 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.0002536050540973701}
|
|
[2026-04-13 01:11:19] UCB=1.1758 mu=0.8559 sigma=0.1600 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002308340951440408}
|
|
[2026-04-13 01:11:19] UCB=1.1734 mu=0.8548 sigma=0.1593 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.00044471562042723106}
|
|
[2026-04-13 01:11:19] UCB=1.1490 mu=0.7683 sigma=0.1904 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 0.00041431428686904603}
|
|
[2026-04-13 01:11:19] UCB=1.0685 mu=0.8407 sigma=0.1139 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006750902507197291}
|
|
[2026-04-13 01:11:19] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.0002536050540973701, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:21] [AutoResearch] Launching job: n_steer=5 n_throttle=4 lr=0.000254
|
|
[2026-04-13 01:11:29] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:11:29] [AutoResearch] mean_reward=67.7234
|
|
[2026-04-13 01:11:29] [AutoResearch] === Trial 92 Summary ===
|
|
[2026-04-13 01:11:29] Total runs in history: 110
|
|
[2026-04-13 01:11:29] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:29] Top 5 results:
|
|
[2026-04-13 01:11:29] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:29] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:29] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:29] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:29] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:31]
|
|
[AutoResearch] ========== Trial 93/100 ==========
|
|
[2026-04-13 01:11:31] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:11:31] UCB=1.7354 mu=1.4439 sigma=0.1457 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 0.00013753277508119969}
|
|
[2026-04-13 01:11:31] UCB=1.3878 mu=1.1832 sigma=0.1023 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.00014509388722004827}
|
|
[2026-04-13 01:11:31] UCB=1.2719 mu=0.9431 sigma=0.1644 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0005864186638369855}
|
|
[2026-04-13 01:11:31] UCB=1.1488 mu=0.8527 sigma=0.1480 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.00020831824314291537}
|
|
[2026-04-13 01:11:31] UCB=1.0961 mu=0.9749 sigma=0.0606 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0007179180203524914}
|
|
[2026-04-13 01:11:31] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 4, 'learning_rate': 0.00013753277508119969, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:33] [AutoResearch] Launching job: n_steer=4 n_throttle=4 lr=0.000138
|
|
[2026-04-13 01:11:42] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:11:42] [AutoResearch] mean_reward=70.2254
|
|
[2026-04-13 01:11:42] [AutoResearch] === Trial 93 Summary ===
|
|
[2026-04-13 01:11:42] Total runs in history: 111
|
|
[2026-04-13 01:11:42] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:42] Top 5 results:
|
|
[2026-04-13 01:11:42] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:42] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:42] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:42] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:42] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:44]
|
|
[AutoResearch] ========== Trial 94/100 ==========
|
|
[2026-04-13 01:11:44] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:11:44] UCB=1.7223 mu=1.4388 sigma=0.1418 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.00015453290965161844}
|
|
[2026-04-13 01:11:44] UCB=1.6381 mu=1.3523 sigma=0.1429 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 6.693352800203282e-05}
|
|
[2026-04-13 01:11:44] UCB=1.4221 mu=1.0687 sigma=0.1767 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010449536947293048}
|
|
[2026-04-13 01:11:44] UCB=1.4061 mu=1.0438 sigma=0.1811 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0011069062379367204}
|
|
[2026-04-13 01:11:44] UCB=1.3167 mu=0.9692 sigma=0.1737 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0019022982870572412}
|
|
[2026-04-13 01:11:44] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.00015453290965161844, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:46] [AutoResearch] Launching job: n_steer=5 n_throttle=4 lr=0.000155
|
|
[2026-04-13 01:11:55] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:11:55] [AutoResearch] mean_reward=59.2725
|
|
[2026-04-13 01:11:55] [AutoResearch] === Trial 94 Summary ===
|
|
[2026-04-13 01:11:55] Total runs in history: 112
|
|
[2026-04-13 01:11:55] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:55] Top 5 results:
|
|
[2026-04-13 01:11:55] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:55] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:55] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:55] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:55] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:57]
|
|
[AutoResearch] ========== Trial 95/100 ==========
|
|
[2026-04-13 01:11:57] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:11:57] UCB=1.3073 mu=1.0405 sigma=0.1334 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0019380284912298895}
|
|
[2026-04-13 01:11:57] UCB=1.3033 mu=0.9791 sigma=0.1621 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0007238521441296611}
|
|
[2026-04-13 01:11:57] UCB=1.2630 mu=0.9024 sigma=0.1803 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012833324272171528}
|
|
[2026-04-13 01:11:57] UCB=1.2347 mu=1.0504 sigma=0.0921 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002005535418223178}
|
|
[2026-04-13 01:11:57] UCB=1.2015 mu=0.8081 sigma=0.1967 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.00018092858054988116}
|
|
[2026-04-13 01:11:57] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0019380284912298895, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:11:59] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.001938
|
|
[2026-04-13 01:12:08] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:12:08] [AutoResearch] mean_reward=72.2123
|
|
[2026-04-13 01:12:08] [AutoResearch] === Trial 95 Summary ===
|
|
[2026-04-13 01:12:08] Total runs in history: 113
|
|
[2026-04-13 01:12:08] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:08] Top 5 results:
|
|
[2026-04-13 01:12:08] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:08] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:08] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:08] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:08] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:10]
|
|
[AutoResearch] ========== Trial 96/100 ==========
|
|
[2026-04-13 01:12:10] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:12:10] UCB=1.5963 mu=1.2724 sigma=0.1620 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00017259113073183038}
|
|
[2026-04-13 01:12:10] UCB=1.2499 mu=1.0013 sigma=0.1243 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002412581968174107}
|
|
[2026-04-13 01:12:10] UCB=1.2284 mu=1.0097 sigma=0.1093 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.00014768465066268227}
|
|
[2026-04-13 01:12:10] UCB=1.1874 mu=0.9783 sigma=0.1046 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0008036631096827625}
|
|
[2026-04-13 01:12:10] UCB=1.1440 mu=0.8503 sigma=0.1468 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0007605653731244607}
|
|
[2026-04-13 01:12:10] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00017259113073183038, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:12] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.000173
|
|
[2026-04-13 01:12:21] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:12:21] [AutoResearch] mean_reward=79.6415
|
|
[2026-04-13 01:12:21] [AutoResearch] === Trial 96 Summary ===
|
|
[2026-04-13 01:12:21] Total runs in history: 114
|
|
[2026-04-13 01:12:21] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:21] Top 5 results:
|
|
[2026-04-13 01:12:21] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:21] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:21] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:21] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:21] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:23]
|
|
[AutoResearch] ========== Trial 97/100 ==========
|
|
[2026-04-13 01:12:23] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:12:23] UCB=1.5034 mu=1.1888 sigma=0.1573 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008233713843841076}
|
|
[2026-04-13 01:12:23] UCB=1.2638 mu=0.9794 sigma=0.1422 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.00013240839691821177}
|
|
[2026-04-13 01:12:23] UCB=1.2242 mu=0.9063 sigma=0.1590 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0006804510035824424}
|
|
[2026-04-13 01:12:23] UCB=1.1931 mu=1.0079 sigma=0.0926 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.00194380592649283}
|
|
[2026-04-13 01:12:23] UCB=1.1798 mu=1.0505 sigma=0.0646 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006964002849883243}
|
|
[2026-04-13 01:12:23] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008233713843841076, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:25] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.000823
|
|
[2026-04-13 01:12:33] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:12:33] [AutoResearch] mean_reward=89.6491
|
|
[2026-04-13 01:12:33] [AutoResearch] === Trial 97 Summary ===
|
|
[2026-04-13 01:12:33] Total runs in history: 115
|
|
[2026-04-13 01:12:33] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:33] Top 5 results:
|
|
[2026-04-13 01:12:33] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:33] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:33] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:33] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:33] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:35]
|
|
[AutoResearch] ========== Trial 98/100 ==========
|
|
[2026-04-13 01:12:35] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:12:35] UCB=1.5337 mu=1.3866 sigma=0.0735 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 5.101696856963226e-05}
|
|
[2026-04-13 01:12:35] UCB=1.3443 mu=1.0300 sigma=0.1571 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0005915489870004928}
|
|
[2026-04-13 01:12:35] UCB=1.3420 mu=1.0193 sigma=0.1614 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0011505680865918906}
|
|
[2026-04-13 01:12:35] UCB=1.2544 mu=0.8872 sigma=0.1836 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.001216174633593528}
|
|
[2026-04-13 01:12:35] UCB=1.1673 mu=1.0735 sigma=0.0469 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006001395193231157}
|
|
[2026-04-13 01:12:35] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 4, 'learning_rate': 5.101696856963226e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:37] [AutoResearch] Launching job: n_steer=6 n_throttle=4 lr=0.000051
|
|
[2026-04-13 01:12:46] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:12:46] [AutoResearch] mean_reward=77.685
|
|
[2026-04-13 01:12:46] [AutoResearch] === Trial 98 Summary ===
|
|
[2026-04-13 01:12:46] Total runs in history: 116
|
|
[2026-04-13 01:12:46] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:46] Top 5 results:
|
|
[2026-04-13 01:12:46] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:46] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:46] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:46] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:46] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:48]
|
|
[AutoResearch] ========== Trial 99/100 ==========
|
|
[2026-04-13 01:12:49] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:12:49] UCB=1.4919 mu=1.1488 sigma=0.1715 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0007663303008625854}
|
|
[2026-04-13 01:12:49] UCB=1.3947 mu=1.0572 sigma=0.1687 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.000913835605765118}
|
|
[2026-04-13 01:12:49] UCB=1.3590 mu=0.9982 sigma=0.1804 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0010176327472226293}
|
|
[2026-04-13 01:12:49] UCB=1.2690 mu=0.9219 sigma=0.1736 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012546322569282975}
|
|
[2026-04-13 01:12:49] UCB=1.1336 mu=0.9021 sigma=0.1158 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.00016758665102179138}
|
|
[2026-04-13 01:12:49] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0007663303008625854, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:51] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.000766
|
|
[2026-04-13 01:12:59] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:12:59] [AutoResearch] mean_reward=56.452
|
|
[2026-04-13 01:12:59] [AutoResearch] === Trial 99 Summary ===
|
|
[2026-04-13 01:12:59] Total runs in history: 117
|
|
[2026-04-13 01:12:59] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:59] Top 5 results:
|
|
[2026-04-13 01:12:59] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:59] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:59] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:59] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:12:59] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:01]
|
|
[AutoResearch] ========== Trial 100/100 ==========
|
|
[2026-04-13 01:13:01] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:13:01] UCB=1.4148 mu=1.0676 sigma=0.1736 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008136466756522354}
|
|
[2026-04-13 01:13:01] UCB=1.3537 mu=1.0027 sigma=0.1755 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0007707032592846195}
|
|
[2026-04-13 01:13:01] UCB=1.1273 mu=0.7874 sigma=0.1699 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0017056189036625962}
|
|
[2026-04-13 01:13:01] UCB=1.1056 mu=0.8225 sigma=0.1415 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0007575664272591687}
|
|
[2026-04-13 01:13:01] UCB=1.0309 mu=0.8933 sigma=0.0688 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0028874375773115203}
|
|
[2026-04-13 01:13:01] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008136466756522354, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:03] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.000814
|
|
[2026-04-13 01:13:11] [AutoResearch] Job finished in 7.9s, returncode=0
|
|
[2026-04-13 01:13:11] [AutoResearch] mean_reward=40.5081
|
|
[2026-04-13 01:13:11] [AutoResearch] === Trial 100 Summary ===
|
|
[2026-04-13 01:13:11] Total runs in history: 118
|
|
[2026-04-13 01:13:11] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:11] Top 5 results:
|
|
[2026-04-13 01:13:11] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:11] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:11] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:11] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:11] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:13] [AutoResearch] All trials complete!
|
|
[2026-04-13 01:13:13] [AutoResearch] === Trial 100 Summary ===
|
|
[2026-04-13 01:13:13] Total runs in history: 118
|
|
[2026-04-13 01:13:13] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:13] Top 5 results:
|
|
[2026-04-13 01:13:13] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:13] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:13] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:13] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:13] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:24] ============================================================
|
|
[2026-04-13 01:13:24] [AutoResearch] Starting Karpathy-style autoresearch controller
|
|
[2026-04-13 01:13:24] [AutoResearch] Max trials: 200
|
|
[2026-04-13 01:13:24] [AutoResearch] Runner: /home/paulh/projects/donkeycar-rl-autoresearch/agent/donkeycar_sb3_runner.py
|
|
[2026-04-13 01:13:24] [AutoResearch] Results: /home/paulh/projects/donkeycar-rl-autoresearch/agent/outerloop-results/autoresearch_results.jsonl
|
|
[2026-04-13 01:13:24] ============================================================
|
|
[2026-04-13 01:13:24] [AutoResearch] Loaded 118 existing result(s) from base sweep + history.
|
|
[2026-04-13 01:13:24] [AutoResearch] === Trial 0 Summary ===
|
|
[2026-04-13 01:13:24] Total runs in history: 118
|
|
[2026-04-13 01:13:24] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:24] Top 5 results:
|
|
[2026-04-13 01:13:24] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:24] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:24] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:24] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:24] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:24]
|
|
[AutoResearch] ========== Trial 1/200 ==========
|
|
[2026-04-13 01:13:24] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:13:24] UCB=1.1837 mu=1.0255 sigma=0.0791 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00030688692092544655}
|
|
[2026-04-13 01:13:24] UCB=1.1799 mu=0.8235 sigma=0.1782 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003584059690587128}
|
|
[2026-04-13 01:13:24] UCB=1.0989 mu=0.8853 sigma=0.1068 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0036680681282239692}
|
|
[2026-04-13 01:13:24] UCB=1.0944 mu=0.7565 sigma=0.1689 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0023708301469378096}
|
|
[2026-04-13 01:13:24] UCB=1.0873 mu=0.8230 sigma=0.1321 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0017823350780525212}
|
|
[2026-04-13 01:13:24] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00030688692092544655, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:26] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.000307
|
|
[2026-04-13 01:13:35] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:13:35] [AutoResearch] mean_reward=64.8119
|
|
[2026-04-13 01:13:35] [AutoResearch] === Trial 1 Summary ===
|
|
[2026-04-13 01:13:35] Total runs in history: 119
|
|
[2026-04-13 01:13:35] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:35] Top 5 results:
|
|
[2026-04-13 01:13:35] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:35] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:35] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:35] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:35] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:37]
|
|
[AutoResearch] ========== Trial 2/200 ==========
|
|
[2026-04-13 01:13:37] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:13:37] UCB=1.2371 mu=0.9371 sigma=0.1500 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003589159194684833}
|
|
[2026-04-13 01:13:37] UCB=1.1660 mu=0.9799 sigma=0.0930 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002115618686941948}
|
|
[2026-04-13 01:13:37] UCB=1.1305 mu=0.7746 sigma=0.1780 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001108464726735241}
|
|
[2026-04-13 01:13:37] UCB=1.1203 mu=0.7754 sigma=0.1724 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009117787458105356}
|
|
[2026-04-13 01:13:37] UCB=1.0994 mu=0.7528 sigma=0.1733 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.002103911523542329}
|
|
[2026-04-13 01:13:37] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003589159194684833, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:39] [AutoResearch] Launching job: n_steer=8 n_throttle=2 lr=0.003589
|
|
[2026-04-13 01:13:48] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:13:48] [AutoResearch] mean_reward=76.7165
|
|
[2026-04-13 01:13:48] [AutoResearch] === Trial 2 Summary ===
|
|
[2026-04-13 01:13:48] Total runs in history: 120
|
|
[2026-04-13 01:13:48] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:48] Top 5 results:
|
|
[2026-04-13 01:13:48] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:48] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:48] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:48] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:48] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:50]
|
|
[AutoResearch] ========== Trial 3/200 ==========
|
|
[2026-04-13 01:13:50] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:13:50] UCB=1.1870 mu=0.9524 sigma=0.1173 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005195574461063374}
|
|
[2026-04-13 01:13:50] UCB=1.1441 mu=0.9943 sigma=0.0749 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006705072528193966}
|
|
[2026-04-13 01:13:50] UCB=1.1431 mu=0.9093 sigma=0.1169 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006396403674041309}
|
|
[2026-04-13 01:13:50] UCB=1.1218 mu=0.7721 sigma=0.1749 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008165372044114939}
|
|
[2026-04-13 01:13:50] UCB=1.1049 mu=0.7804 sigma=0.1622 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0035935154013706956}
|
|
[2026-04-13 01:13:50] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005195574461063374, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:13:52] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.000520
|
|
[2026-04-13 01:14:00] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:14:00] [AutoResearch] mean_reward=78.3084
|
|
[2026-04-13 01:14:00] [AutoResearch] === Trial 3 Summary ===
|
|
[2026-04-13 01:14:00] Total runs in history: 121
|
|
[2026-04-13 01:14:00] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:00] Top 5 results:
|
|
[2026-04-13 01:14:00] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:00] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:00] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:00] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:00] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:02]
|
|
[AutoResearch] ========== Trial 4/200 ==========
|
|
[2026-04-13 01:14:02] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:14:02] UCB=1.4773 mu=1.3293 sigma=0.0740 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 9.684034278734162e-05}
|
|
[2026-04-13 01:14:02] UCB=1.1504 mu=1.0349 sigma=0.0577 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.00011871854952420045}
|
|
[2026-04-13 01:14:02] UCB=1.1482 mu=0.9719 sigma=0.0882 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0021125983919883778}
|
|
[2026-04-13 01:14:02] UCB=1.1309 mu=0.9933 sigma=0.0688 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006005519527125052}
|
|
[2026-04-13 01:14:02] UCB=1.1280 mu=0.9632 sigma=0.0824 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002523556444910515}
|
|
[2026-04-13 01:14:02] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 4, 'learning_rate': 9.684034278734162e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:04] [AutoResearch] Launching job: n_steer=5 n_throttle=4 lr=0.000097
|
|
[2026-04-13 01:14:13] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:14:13] [AutoResearch] mean_reward=92.6769
|
|
[2026-04-13 01:14:13] [AutoResearch] === Trial 4 Summary ===
|
|
[2026-04-13 01:14:13] Total runs in history: 122
|
|
[2026-04-13 01:14:13] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:13] Top 5 results:
|
|
[2026-04-13 01:14:13] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:13] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:13] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:13] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:13] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:15]
|
|
[AutoResearch] ========== Trial 5/200 ==========
|
|
[2026-04-13 01:14:15] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:14:15] UCB=1.0823 mu=0.7708 sigma=0.1557 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010496856479407747}
|
|
[2026-04-13 01:14:15] UCB=1.0762 mu=0.7171 sigma=0.1796 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009854511161931873}
|
|
[2026-04-13 01:14:15] UCB=1.0377 mu=0.7782 sigma=0.1297 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006213466996247071}
|
|
[2026-04-13 01:14:15] UCB=1.0285 mu=0.6934 sigma=0.1675 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 5.915605841331967e-05}
|
|
[2026-04-13 01:14:15] UCB=1.0098 mu=0.7135 sigma=0.1482 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.0002327839810583217}
|
|
[2026-04-13 01:14:15] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010496856479407747, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:17] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.001050
|
|
[2026-04-13 01:14:26] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:14:26] [AutoResearch] mean_reward=90.5086
|
|
[2026-04-13 01:14:26] [AutoResearch] === Trial 5 Summary ===
|
|
[2026-04-13 01:14:26] Total runs in history: 123
|
|
[2026-04-13 01:14:26] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:26] Top 5 results:
|
|
[2026-04-13 01:14:26] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:26] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:26] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:26] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:26] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:28]
|
|
[AutoResearch] ========== Trial 6/200 ==========
|
|
[2026-04-13 01:14:28] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:14:28] UCB=1.8500 mu=1.6571 sigma=0.0964 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 6.872427398026235e-05}
|
|
[2026-04-13 01:14:28] UCB=1.2777 mu=1.0458 sigma=0.1159 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0001706350451155647}
|
|
[2026-04-13 01:14:28] UCB=1.1633 mu=0.8584 sigma=0.1525 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0010376904845925343}
|
|
[2026-04-13 01:14:28] UCB=1.1570 mu=0.9763 sigma=0.0904 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.00014451415502566363}
|
|
[2026-04-13 01:14:28] UCB=1.0953 mu=0.7327 sigma=0.1813 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0013072251906760337}
|
|
[2026-04-13 01:14:28] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 4, 'learning_rate': 6.872427398026235e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:30] [AutoResearch] Launching job: n_steer=5 n_throttle=4 lr=0.000069
|
|
[2026-04-13 01:14:39] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:14:39] [AutoResearch] mean_reward=65.5647
|
|
[2026-04-13 01:14:39] [AutoResearch] === Trial 6 Summary ===
|
|
[2026-04-13 01:14:39] Total runs in history: 124
|
|
[2026-04-13 01:14:39] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:39] Top 5 results:
|
|
[2026-04-13 01:14:39] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:39] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:39] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:39] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:39] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:41]
|
|
[AutoResearch] ========== Trial 7/200 ==========
|
|
[2026-04-13 01:14:41] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:14:41] UCB=1.2764 mu=0.9883 sigma=0.1441 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.00012295561575993287}
|
|
[2026-04-13 01:14:41] UCB=1.2010 mu=0.9603 sigma=0.1203 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004480496725632844}
|
|
[2026-04-13 01:14:41] UCB=1.1626 mu=0.8173 sigma=0.1727 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020735435597937584}
|
|
[2026-04-13 01:14:41] UCB=1.1142 mu=0.8259 sigma=0.1441 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.003806687426864916}
|
|
[2026-04-13 01:14:41] UCB=1.0252 mu=0.7086 sigma=0.1583 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008522496566784977}
|
|
[2026-04-13 01:14:41] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.00012295561575993287, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:43] [AutoResearch] Launching job: n_steer=5 n_throttle=4 lr=0.000123
|
|
[2026-04-13 01:14:51] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:14:51] [AutoResearch] mean_reward=52.3437
|
|
[2026-04-13 01:14:51] [AutoResearch] === Trial 7 Summary ===
|
|
[2026-04-13 01:14:51] Total runs in history: 125
|
|
[2026-04-13 01:14:51] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:51] Top 5 results:
|
|
[2026-04-13 01:14:51] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:51] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:51] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:51] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:51] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:53]
|
|
[AutoResearch] ========== Trial 8/200 ==========
|
|
[2026-04-13 01:14:53] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:14:53] UCB=1.4582 mu=1.2162 sigma=0.1210 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 8.568477884043701e-05}
|
|
[2026-04-13 01:14:53] UCB=1.1709 mu=0.8690 sigma=0.1509 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 8.1321058627201e-05}
|
|
[2026-04-13 01:14:53] UCB=1.1292 mu=0.8036 sigma=0.1628 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.003593116108323929}
|
|
[2026-04-13 01:14:53] UCB=1.0970 mu=0.7850 sigma=0.1560 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002464657814683018}
|
|
[2026-04-13 01:14:53] UCB=1.0829 mu=0.7221 sigma=0.1804 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.003921477000966561}
|
|
[2026-04-13 01:14:53] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 8.568477884043701e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:14:55] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.000086
|
|
[2026-04-13 01:15:04] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:15:04] [AutoResearch] mean_reward=65.0679
|
|
[2026-04-13 01:15:04] [AutoResearch] === Trial 8 Summary ===
|
|
[2026-04-13 01:15:04] Total runs in history: 126
|
|
[2026-04-13 01:15:04] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:04] Top 5 results:
|
|
[2026-04-13 01:15:04] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:04] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:04] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:04] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:04] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:06]
|
|
[AutoResearch] ========== Trial 9/200 ==========
|
|
[2026-04-13 01:15:06] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:15:06] UCB=1.1856 mu=0.9349 sigma=0.1254 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00047172019197039587}
|
|
[2026-04-13 01:15:06] UCB=1.1714 mu=0.8228 sigma=0.1743 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0037381453552823678}
|
|
[2026-04-13 01:15:06] UCB=1.1073 mu=1.0093 sigma=0.0490 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005599905417831684}
|
|
[2026-04-13 01:15:06] UCB=1.0507 mu=0.7222 sigma=0.1643 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0024542111135629416}
|
|
[2026-04-13 01:15:06] UCB=1.0203 mu=0.7041 sigma=0.1581 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.00018481077169035025}
|
|
[2026-04-13 01:15:06] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00047172019197039587, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:08] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.000472
|
|
[2026-04-13 01:15:16] [AutoResearch] Job finished in 8.1s, returncode=0
|
|
[2026-04-13 01:15:16] [AutoResearch] mean_reward=51.0534
|
|
[2026-04-13 01:15:16] [AutoResearch] === Trial 9 Summary ===
|
|
[2026-04-13 01:15:16] Total runs in history: 127
|
|
[2026-04-13 01:15:16] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:16] Top 5 results:
|
|
[2026-04-13 01:15:16] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:16] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:16] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:16] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:16] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:18]
|
|
[AutoResearch] ========== Trial 10/200 ==========
|
|
[2026-04-13 01:15:18] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:15:18] UCB=1.0664 mu=0.9008 sigma=0.0828 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0030277879006223116}
|
|
[2026-04-13 01:15:18] UCB=1.0663 mu=0.7187 sigma=0.1738 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.003567470482356219}
|
|
[2026-04-13 01:15:18] UCB=1.0511 mu=0.7557 sigma=0.1477 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0025382771458285743}
|
|
[2026-04-13 01:15:18] UCB=0.9989 mu=0.6642 sigma=0.1674 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0010338624251166376}
|
|
[2026-04-13 01:15:18] UCB=0.9612 mu=0.6575 sigma=0.1519 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 9.902538291625305e-05}
|
|
[2026-04-13 01:15:18] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0030277879006223116, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:20] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.003028
|
|
[2026-04-13 01:15:29] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:15:29] [AutoResearch] mean_reward=83.6333
|
|
[2026-04-13 01:15:29] [AutoResearch] === Trial 10 Summary ===
|
|
[2026-04-13 01:15:29] Total runs in history: 128
|
|
[2026-04-13 01:15:29] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:29] Top 5 results:
|
|
[2026-04-13 01:15:29] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:29] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:29] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:29] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:29] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:31]
|
|
[AutoResearch] ========== Trial 11/200 ==========
|
|
[2026-04-13 01:15:31] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:15:31] UCB=1.1775 mu=0.8424 sigma=0.1676 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010494911856969611}
|
|
[2026-04-13 01:15:31] UCB=1.1049 mu=0.8468 sigma=0.1290 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00038887974765684525}
|
|
[2026-04-13 01:15:31] UCB=1.0713 mu=0.9146 sigma=0.0784 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0035753425015795978}
|
|
[2026-04-13 01:15:31] UCB=1.0264 mu=0.6630 sigma=0.1817 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0040408350430505}
|
|
[2026-04-13 01:15:31] UCB=1.0247 mu=0.6942 sigma=0.1652 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0037372313886840296}
|
|
[2026-04-13 01:15:31] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010494911856969611, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:33] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.001049
|
|
[2026-04-13 01:15:42] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:15:42] [AutoResearch] mean_reward=78.6034
|
|
[2026-04-13 01:15:42] [AutoResearch] === Trial 11 Summary ===
|
|
[2026-04-13 01:15:42] Total runs in history: 129
|
|
[2026-04-13 01:15:42] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:42] Top 5 results:
|
|
[2026-04-13 01:15:42] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:42] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:42] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:42] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:42] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:44]
|
|
[AutoResearch] ========== Trial 12/200 ==========
|
|
[2026-04-13 01:15:44] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:15:44] UCB=1.2005 mu=0.9556 sigma=0.1225 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.003732805608559358}
|
|
[2026-04-13 01:15:44] UCB=1.1786 mu=0.8278 sigma=0.1754 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010170189545727969}
|
|
[2026-04-13 01:15:44] UCB=1.0969 mu=0.7488 sigma=0.1740 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0009275899324666605}
|
|
[2026-04-13 01:15:44] UCB=1.0532 mu=0.7631 sigma=0.1450 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0033044280314105962}
|
|
[2026-04-13 01:15:44] UCB=1.0285 mu=0.7395 sigma=0.1445 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034265170243920435}
|
|
[2026-04-13 01:15:44] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.003732805608559358, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:46] [AutoResearch] Launching job: n_steer=7 n_throttle=2 lr=0.003733
|
|
[2026-04-13 01:15:54] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:15:54] [AutoResearch] mean_reward=68.4295
|
|
[2026-04-13 01:15:54] [AutoResearch] === Trial 12 Summary ===
|
|
[2026-04-13 01:15:54] Total runs in history: 130
|
|
[2026-04-13 01:15:54] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:54] Top 5 results:
|
|
[2026-04-13 01:15:54] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:54] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:54] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:54] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:54] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:56]
|
|
[AutoResearch] ========== Trial 13/200 ==========
|
|
[2026-04-13 01:15:56] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:15:56] UCB=1.2530 mu=0.7690 sigma=0.2420 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.004970649502370244}
|
|
[2026-04-13 01:15:56] UCB=1.1944 mu=0.8653 sigma=0.1646 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0019862181132288452}
|
|
[2026-04-13 01:15:56] UCB=1.1015 mu=0.7613 sigma=0.1701 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.00020140221672122768}
|
|
[2026-04-13 01:15:56] UCB=1.0637 mu=0.7441 sigma=0.1598 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.00022260416696503574}
|
|
[2026-04-13 01:15:56] UCB=0.9749 mu=0.6703 sigma=0.1523 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.000946943139397055}
|
|
[2026-04-13 01:15:56] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.004970649502370244, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:15:58] [AutoResearch] Launching job: n_steer=9 n_throttle=2 lr=0.004971
|
|
[2026-04-13 01:16:06] [AutoResearch] Job finished in 7.9s, returncode=0
|
|
[2026-04-13 01:16:06] [AutoResearch] mean_reward=34.1511
|
|
[2026-04-13 01:16:06] [AutoResearch] === Trial 13 Summary ===
|
|
[2026-04-13 01:16:06] Total runs in history: 131
|
|
[2026-04-13 01:16:06] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:06] Top 5 results:
|
|
[2026-04-13 01:16:06] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:06] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:06] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:06] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:06] mean_reward=104.4376 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00045173785418973166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:08]
|
|
[AutoResearch] ========== Trial 14/200 ==========
|
|
[2026-04-13 01:16:08] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:16:08] UCB=1.2641 mu=-0.3969 sigma=0.8305 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236}
|
|
[2026-04-13 01:16:08] UCB=1.1759 mu=0.8590 sigma=0.1585 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00010539295519755278}
|
|
[2026-04-13 01:16:08] UCB=1.0302 mu=0.7477 sigma=0.1413 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.00015235550163579764}
|
|
[2026-04-13 01:16:08] UCB=0.9887 mu=0.7220 sigma=0.1334 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002866269604287722}
|
|
[2026-04-13 01:16:08] UCB=0.9805 mu=0.8945 sigma=0.0430 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0014277243582635284}
|
|
[2026-04-13 01:16:08] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:10] [AutoResearch] Launching job: n_steer=3 n_throttle=2 lr=0.004942
|
|
[2026-04-13 01:16:20] [AutoResearch] Job finished in 9.6s, returncode=0
|
|
[2026-04-13 01:16:20] [AutoResearch] mean_reward=106.8657
|
|
[2026-04-13 01:16:20] [AutoResearch] === Trial 14 Summary ===
|
|
[2026-04-13 01:16:20] Total runs in history: 132
|
|
[2026-04-13 01:16:20] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:20] Top 5 results:
|
|
[2026-04-13 01:16:20] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:20] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:20] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:20] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:20] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:22]
|
|
[AutoResearch] ========== Trial 15/200 ==========
|
|
[2026-04-13 01:16:22] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:16:22] UCB=1.9769 mu=0.7972 sigma=0.5898 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004962313696906092}
|
|
[2026-04-13 01:16:22] UCB=1.6826 mu=0.4920 sigma=0.5953 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004870158896665235}
|
|
[2026-04-13 01:16:22] UCB=1.4388 mu=0.7618 sigma=0.3385 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004612828434196954}
|
|
[2026-04-13 01:16:22] UCB=1.1678 mu=0.9077 sigma=0.1300 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0023289897300567373}
|
|
[2026-04-13 01:16:22] UCB=1.1133 mu=0.7675 sigma=0.1729 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020845909145895875}
|
|
[2026-04-13 01:16:22] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004962313696906092, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:24] [AutoResearch] Launching job: n_steer=3 n_throttle=3 lr=0.004962
|
|
[2026-04-13 01:16:33] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:16:33] [AutoResearch] mean_reward=84.4645
|
|
[2026-04-13 01:16:33] [AutoResearch] === Trial 15 Summary ===
|
|
[2026-04-13 01:16:33] Total runs in history: 133
|
|
[2026-04-13 01:16:33] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:33] Top 5 results:
|
|
[2026-04-13 01:16:33] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:33] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:33] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:33] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:33] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:35]
|
|
[AutoResearch] ========== Trial 16/200 ==========
|
|
[2026-04-13 01:16:35] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:16:35] UCB=2.2221 mu=1.7605 sigma=0.2308 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004936630447830499}
|
|
[2026-04-13 01:16:35] UCB=1.9704 mu=1.2566 sigma=0.3569 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004987931436331709}
|
|
[2026-04-13 01:16:35] UCB=1.8627 mu=1.3029 sigma=0.2799 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004728931333743095}
|
|
[2026-04-13 01:16:35] UCB=1.8559 mu=1.2393 sigma=0.3083 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004715214242290354}
|
|
[2026-04-13 01:16:35] UCB=1.8138 mu=1.2337 sigma=0.2901 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004663918619289079}
|
|
[2026-04-13 01:16:35] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004936630447830499, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:37] [AutoResearch] Launching job: n_steer=3 n_throttle=2 lr=0.004937
|
|
[2026-04-13 01:16:45] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:16:45] [AutoResearch] mean_reward=51.6651
|
|
[2026-04-13 01:16:45] [AutoResearch] === Trial 16 Summary ===
|
|
[2026-04-13 01:16:45] Total runs in history: 134
|
|
[2026-04-13 01:16:45] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:45] Top 5 results:
|
|
[2026-04-13 01:16:45] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:45] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:45] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:45] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:45] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:47]
|
|
[AutoResearch] ========== Trial 17/200 ==========
|
|
[2026-04-13 01:16:47] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:16:47] UCB=1.2283 mu=0.4556 sigma=0.3863 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004876148233640193}
|
|
[2026-04-13 01:16:47] UCB=1.1600 mu=0.4097 sigma=0.3752 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004855685478035734}
|
|
[2026-04-13 01:16:47] UCB=1.0066 mu=0.7136 sigma=0.1465 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.003497495482779589}
|
|
[2026-04-13 01:16:47] UCB=0.9828 mu=0.7672 sigma=0.1078 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00026066599346656876}
|
|
[2026-04-13 01:16:47] UCB=0.9475 mu=0.7889 sigma=0.0793 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0018641082059065468}
|
|
[2026-04-13 01:16:47] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004876148233640193, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:49] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.004876
|
|
[2026-04-13 01:16:58] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:16:58] [AutoResearch] mean_reward=80.3169
|
|
[2026-04-13 01:16:58] [AutoResearch] === Trial 17 Summary ===
|
|
[2026-04-13 01:16:58] Total runs in history: 135
|
|
[2026-04-13 01:16:58] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:58] Top 5 results:
|
|
[2026-04-13 01:16:58] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:58] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:58] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:58] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:16:58] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:00]
|
|
[AutoResearch] ========== Trial 18/200 ==========
|
|
[2026-04-13 01:17:00] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:17:00] UCB=1.5375 mu=1.3249 sigma=0.1063 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004997848098034423}
|
|
[2026-04-13 01:17:00] UCB=1.2766 mu=0.9894 sigma=0.1436 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004874248521400943}
|
|
[2026-04-13 01:17:00] UCB=1.2687 mu=0.7192 sigma=0.2748 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.00484512788832915}
|
|
[2026-04-13 01:17:00] UCB=1.1848 mu=1.0494 sigma=0.0677 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004962653162707711}
|
|
[2026-04-13 01:17:00] UCB=1.0866 mu=0.8035 sigma=0.1416 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 7.701972708909523e-05}
|
|
[2026-04-13 01:17:00] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004997848098034423, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:02] [AutoResearch] Launching job: n_steer=3 n_throttle=3 lr=0.004998
|
|
[2026-04-13 01:17:12] [AutoResearch] Job finished in 9.3s, returncode=0
|
|
[2026-04-13 01:17:12] [AutoResearch] mean_reward=90.9515
|
|
[2026-04-13 01:17:12] [AutoResearch] === Trial 18 Summary ===
|
|
[2026-04-13 01:17:12] Total runs in history: 136
|
|
[2026-04-13 01:17:12] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:12] Top 5 results:
|
|
[2026-04-13 01:17:12] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:12] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:12] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:12] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:12] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:14]
|
|
[AutoResearch] ========== Trial 19/200 ==========
|
|
[2026-04-13 01:17:14] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:17:14] UCB=1.2396 mu=0.7948 sigma=0.2224 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004828810146372665}
|
|
[2026-04-13 01:17:14] UCB=1.1280 mu=0.8822 sigma=0.1229 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.001981766982389298}
|
|
[2026-04-13 01:17:14] UCB=1.0490 mu=0.7104 sigma=0.1693 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.00016893940524035075}
|
|
[2026-04-13 01:17:14] UCB=0.9519 mu=0.7923 sigma=0.0798 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002646148255963399}
|
|
[2026-04-13 01:17:14] UCB=0.9353 mu=0.8638 sigma=0.0357 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0013154605826687345}
|
|
[2026-04-13 01:17:14] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004828810146372665, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:16] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.004829
|
|
[2026-04-13 01:17:24] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:17:24] [AutoResearch] mean_reward=73.6553
|
|
[2026-04-13 01:17:24] [AutoResearch] === Trial 19 Summary ===
|
|
[2026-04-13 01:17:24] Total runs in history: 137
|
|
[2026-04-13 01:17:24] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:24] Top 5 results:
|
|
[2026-04-13 01:17:24] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:24] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:24] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:24] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:24] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:26]
|
|
[AutoResearch] ========== Trial 20/200 ==========
|
|
[2026-04-13 01:17:26] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:17:26] UCB=1.0068 mu=0.8462 sigma=0.0803 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0026605785625446647}
|
|
[2026-04-13 01:17:26] UCB=0.9953 mu=0.6802 sigma=0.1576 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.00043100093346606444}
|
|
[2026-04-13 01:17:26] UCB=0.9433 mu=0.7440 sigma=0.0996 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002531913231148961}
|
|
[2026-04-13 01:17:26] UCB=0.9164 mu=0.6387 sigma=0.1389 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 0.00014051974176180622}
|
|
[2026-04-13 01:17:26] UCB=0.9074 mu=0.7697 sigma=0.0689 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0015691594208171265}
|
|
[2026-04-13 01:17:26] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0026605785625446647, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:28] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.002661
|
|
[2026-04-13 01:17:37] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:17:37] [AutoResearch] mean_reward=55.4628
|
|
[2026-04-13 01:17:37] [AutoResearch] === Trial 20 Summary ===
|
|
[2026-04-13 01:17:37] Total runs in history: 138
|
|
[2026-04-13 01:17:37] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:37] Top 5 results:
|
|
[2026-04-13 01:17:37] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:37] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:37] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:37] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:37] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:39]
|
|
[AutoResearch] ========== Trial 21/200 ==========
|
|
[2026-04-13 01:17:39] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:17:39] UCB=1.5988 mu=1.1202 sigma=0.2393 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004911175230960728}
|
|
[2026-04-13 01:17:39] UCB=1.3600 mu=0.9653 sigma=0.1974 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004900355122332494}
|
|
[2026-04-13 01:17:39] UCB=1.1487 mu=0.6024 sigma=0.2732 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004825153918170842}
|
|
[2026-04-13 01:17:39] UCB=1.0423 mu=0.7006 sigma=0.1709 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010367390647949173}
|
|
[2026-04-13 01:17:39] UCB=0.9986 mu=0.6472 sigma=0.1757 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0009570812318744727}
|
|
[2026-04-13 01:17:39] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004911175230960728, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:41] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.004911
|
|
[2026-04-13 01:17:49] [AutoResearch] Job finished in 7.9s, returncode=0
|
|
[2026-04-13 01:17:49] [AutoResearch] mean_reward=48.0429
|
|
[2026-04-13 01:17:49] [AutoResearch] === Trial 21 Summary ===
|
|
[2026-04-13 01:17:49] Total runs in history: 139
|
|
[2026-04-13 01:17:49] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:49] Top 5 results:
|
|
[2026-04-13 01:17:49] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:49] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:49] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:49] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:49] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:51]
|
|
[AutoResearch] ========== Trial 22/200 ==========
|
|
[2026-04-13 01:17:51] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:17:51] UCB=1.2236 mu=0.8818 sigma=0.1709 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004981279658413488}
|
|
[2026-04-13 01:17:51] UCB=0.9795 mu=0.0862 sigma=0.4466 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.004956773135281383}
|
|
[2026-04-13 01:17:51] UCB=0.9766 mu=0.6241 sigma=0.1762 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.002854527973776278}
|
|
[2026-04-13 01:17:51] UCB=0.9578 mu=0.6411 sigma=0.1584 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0032496125465852708}
|
|
[2026-04-13 01:17:51] UCB=0.9496 mu=0.5660 sigma=0.1918 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0034779739858346237}
|
|
[2026-04-13 01:17:51] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004981279658413488, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:17:53] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.004981
|
|
[2026-04-13 01:18:01] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:18:01] [AutoResearch] mean_reward=51.388
|
|
[2026-04-13 01:18:01] [AutoResearch] === Trial 22 Summary ===
|
|
[2026-04-13 01:18:01] Total runs in history: 140
|
|
[2026-04-13 01:18:01] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:01] Top 5 results:
|
|
[2026-04-13 01:18:01] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:01] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:01] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:01] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:01] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:04]
|
|
[AutoResearch] ========== Trial 23/200 ==========
|
|
[2026-04-13 01:18:04] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:18:04] UCB=3.9208 mu=3.3727 sigma=0.2740 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0039998399644586005}
|
|
[2026-04-13 01:18:04] UCB=3.8710 mu=3.2127 sigma=0.3292 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003875211233591401}
|
|
[2026-04-13 01:18:04] UCB=3.4662 mu=2.8113 sigma=0.3274 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003614252722332397}
|
|
[2026-04-13 01:18:04] UCB=3.2671 mu=2.4774 sigma=0.3948 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0035295423264421004}
|
|
[2026-04-13 01:18:04] UCB=3.2391 mu=2.8896 sigma=0.1748 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0043334012038109375}
|
|
[2026-04-13 01:18:04] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0039998399644586005, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:06] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.004000
|
|
[2026-04-13 01:18:14] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:18:14] [AutoResearch] mean_reward=71.4069
|
|
[2026-04-13 01:18:14] [AutoResearch] === Trial 23 Summary ===
|
|
[2026-04-13 01:18:14] Total runs in history: 141
|
|
[2026-04-13 01:18:14] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:14] Top 5 results:
|
|
[2026-04-13 01:18:14] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:14] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:14] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:14] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:14] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:16]
|
|
[AutoResearch] ========== Trial 24/200 ==========
|
|
[2026-04-13 01:18:16] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:18:16] UCB=3.3580 mu=2.8869 sigma=0.2355 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004371390474272943}
|
|
[2026-04-13 01:18:16] UCB=2.6376 mu=2.2086 sigma=0.2145 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004046379222654533}
|
|
[2026-04-13 01:18:16] UCB=2.2151 mu=1.7254 sigma=0.2448 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004116042907085728}
|
|
[2026-04-13 01:18:16] UCB=2.1471 mu=1.6171 sigma=0.2650 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003899824680578241}
|
|
[2026-04-13 01:18:16] UCB=2.0741 mu=1.4952 sigma=0.2895 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004324512330628218}
|
|
[2026-04-13 01:18:16] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004371390474272943, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:18] [AutoResearch] Launching job: n_steer=3 n_throttle=3 lr=0.004371
|
|
[2026-04-13 01:18:27] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:18:27] [AutoResearch] mean_reward=52.0188
|
|
[2026-04-13 01:18:27] [AutoResearch] === Trial 24 Summary ===
|
|
[2026-04-13 01:18:27] Total runs in history: 142
|
|
[2026-04-13 01:18:27] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:27] Top 5 results:
|
|
[2026-04-13 01:18:27] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:27] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:27] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:27] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:27] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:29]
|
|
[AutoResearch] ========== Trial 25/200 ==========
|
|
[2026-04-13 01:18:29] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:18:29] UCB=1.0589 mu=0.7427 sigma=0.1581 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010459005205559967}
|
|
[2026-04-13 01:18:29] UCB=0.9907 mu=0.6478 sigma=0.1715 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003005078281129521}
|
|
[2026-04-13 01:18:29] UCB=0.9758 mu=0.4372 sigma=0.2693 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 0.004475866985672839}
|
|
[2026-04-13 01:18:29] UCB=0.9334 mu=0.5914 sigma=0.1710 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0006987414254614035}
|
|
[2026-04-13 01:18:29] UCB=0.9305 mu=0.5793 sigma=0.1756 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010283716078246636}
|
|
[2026-04-13 01:18:29] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010459005205559967, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:31] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.001046
|
|
[2026-04-13 01:18:41] [AutoResearch] Job finished in 9.5s, returncode=0
|
|
[2026-04-13 01:18:41] [AutoResearch] mean_reward=96.1376
|
|
[2026-04-13 01:18:41] [AutoResearch] === Trial 25 Summary ===
|
|
[2026-04-13 01:18:41] Total runs in history: 143
|
|
[2026-04-13 01:18:41] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:41] Top 5 results:
|
|
[2026-04-13 01:18:41] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:41] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:41] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:41] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:41] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:43]
|
|
[AutoResearch] ========== Trial 26/200 ==========
|
|
[2026-04-13 01:18:43] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:18:43] UCB=1.4350 mu=1.1047 sigma=0.1651 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.0049953727522973265}
|
|
[2026-04-13 01:18:43] UCB=1.0816 mu=0.5883 sigma=0.2467 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0044436823814281295}
|
|
[2026-04-13 01:18:43] UCB=1.0489 mu=0.7516 sigma=0.1487 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008921662409648054}
|
|
[2026-04-13 01:18:43] UCB=0.9958 mu=0.6594 sigma=0.1682 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004655801345187677}
|
|
[2026-04-13 01:18:43] UCB=0.9316 mu=0.6982 sigma=0.1167 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0031588697131345535}
|
|
[2026-04-13 01:18:43] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.0049953727522973265, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:45] [AutoResearch] Launching job: n_steer=3 n_throttle=2 lr=0.004995
|
|
[2026-04-13 01:18:53] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:18:53] [AutoResearch] mean_reward=42.7665
|
|
[2026-04-13 01:18:53] [AutoResearch] === Trial 26 Summary ===
|
|
[2026-04-13 01:18:53] Total runs in history: 144
|
|
[2026-04-13 01:18:53] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:53] Top 5 results:
|
|
[2026-04-13 01:18:53] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:53] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:53] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:53] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:53] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:55]
|
|
[AutoResearch] ========== Trial 27/200 ==========
|
|
[2026-04-13 01:18:55] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:18:55] UCB=3.4484 mu=2.7304 sigma=0.3590 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0037785432389428816}
|
|
[2026-04-13 01:18:55] UCB=3.2464 mu=2.6965 sigma=0.2750 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003932133649875772}
|
|
[2026-04-13 01:18:55] UCB=3.1669 mu=2.5394 sigma=0.3137 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0038467965624580953}
|
|
[2026-04-13 01:18:55] UCB=3.1312 mu=2.6554 sigma=0.2379 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0041471945422519516}
|
|
[2026-04-13 01:18:55] UCB=3.0831 mu=2.6962 sigma=0.1934 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004454127370269698}
|
|
[2026-04-13 01:18:55] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0037785432389428816, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:18:57] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003779
|
|
[2026-04-13 01:19:06] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:19:06] [AutoResearch] mean_reward=64.9309
|
|
[2026-04-13 01:19:06] [AutoResearch] === Trial 27 Summary ===
|
|
[2026-04-13 01:19:06] Total runs in history: 145
|
|
[2026-04-13 01:19:06] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:06] Top 5 results:
|
|
[2026-04-13 01:19:06] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:06] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:06] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:06] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:06] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:08]
|
|
[AutoResearch] ========== Trial 28/200 ==========
|
|
[2026-04-13 01:19:08] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:19:08] UCB=1.6280 mu=1.2756 sigma=0.1762 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004680716875763725}
|
|
[2026-04-13 01:19:08] UCB=1.4031 mu=1.0598 sigma=0.1716 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004364443812977204}
|
|
[2026-04-13 01:19:08] UCB=1.3083 mu=1.1105 sigma=0.0989 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0047050183474512805}
|
|
[2026-04-13 01:19:08] UCB=1.0549 mu=0.7489 sigma=0.1530 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004531997930233946}
|
|
[2026-04-13 01:19:08] UCB=0.9987 mu=0.6390 sigma=0.1799 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0012195212348836396}
|
|
[2026-04-13 01:19:08] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004680716875763725, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:10] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.004681
|
|
[2026-04-13 01:19:18] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:19:18] [AutoResearch] mean_reward=62.9633
|
|
[2026-04-13 01:19:18] [AutoResearch] === Trial 28 Summary ===
|
|
[2026-04-13 01:19:18] Total runs in history: 146
|
|
[2026-04-13 01:19:18] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:18] Top 5 results:
|
|
[2026-04-13 01:19:18] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:18] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:18] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:18] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:18] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:20]
|
|
[AutoResearch] ========== Trial 29/200 ==========
|
|
[2026-04-13 01:19:20] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:19:20] UCB=1.6262 mu=1.2796 sigma=0.1733 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004587322834454481}
|
|
[2026-04-13 01:19:20] UCB=1.3867 mu=1.0558 sigma=0.1654 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.0049387487835517285}
|
|
[2026-04-13 01:19:20] UCB=1.3561 mu=1.0607 sigma=0.1477 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.0046335399170885465}
|
|
[2026-04-13 01:19:20] UCB=1.0629 mu=0.7184 sigma=0.1723 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0030130982948122944}
|
|
[2026-04-13 01:19:20] UCB=1.0089 mu=0.8334 sigma=0.0877 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004398987183488154}
|
|
[2026-04-13 01:19:20] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004587322834454481, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:22] [AutoResearch] Launching job: n_steer=3 n_throttle=3 lr=0.004587
|
|
[2026-04-13 01:19:30] [AutoResearch] Job finished in 7.9s, returncode=0
|
|
[2026-04-13 01:19:30] [AutoResearch] mean_reward=36.4379
|
|
[2026-04-13 01:19:30] [AutoResearch] === Trial 29 Summary ===
|
|
[2026-04-13 01:19:30] Total runs in history: 147
|
|
[2026-04-13 01:19:30] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:30] Top 5 results:
|
|
[2026-04-13 01:19:30] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:30] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:30] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:30] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:30] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:32]
|
|
[AutoResearch] ========== Trial 30/200 ==========
|
|
[2026-04-13 01:19:32] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:19:32] UCB=1.5634 mu=1.1871 sigma=0.1881 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004029716465310493}
|
|
[2026-04-13 01:19:32] UCB=1.2303 mu=0.9568 sigma=0.1367 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 8.035131396357773e-05}
|
|
[2026-04-13 01:19:32] UCB=1.1324 mu=0.7824 sigma=0.1750 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0008672317584574283}
|
|
[2026-04-13 01:19:32] UCB=1.0655 mu=0.8037 sigma=0.1309 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.00442640917368853}
|
|
[2026-04-13 01:19:32] UCB=1.0635 mu=0.8615 sigma=0.1010 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0011576397456445045}
|
|
[2026-04-13 01:19:32] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004029716465310493, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:34] [AutoResearch] Launching job: n_steer=3 n_throttle=2 lr=0.004030
|
|
[2026-04-13 01:19:43] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:19:43] [AutoResearch] mean_reward=59.4183
|
|
[2026-04-13 01:19:43] [AutoResearch] === Trial 30 Summary ===
|
|
[2026-04-13 01:19:43] Total runs in history: 148
|
|
[2026-04-13 01:19:43] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:43] Top 5 results:
|
|
[2026-04-13 01:19:43] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:43] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:43] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:43] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:43] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:45]
|
|
[AutoResearch] ========== Trial 31/200 ==========
|
|
[2026-04-13 01:19:45] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:19:45] UCB=1.3736 mu=0.9843 sigma=0.1947 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003611947623128948}
|
|
[2026-04-13 01:19:45] UCB=1.3348 mu=0.9326 sigma=0.2011 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0034200248800445436}
|
|
[2026-04-13 01:19:45] UCB=1.1771 mu=0.7977 sigma=0.1897 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004026745587661155}
|
|
[2026-04-13 01:19:45] UCB=1.0869 mu=0.6999 sigma=0.1935 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004051540281344283}
|
|
[2026-04-13 01:19:45] UCB=1.0787 mu=0.7030 sigma=0.1879 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003995175838657314}
|
|
[2026-04-13 01:19:45] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003611947623128948, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:47] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.003612
|
|
[2026-04-13 01:19:56] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:19:56] [AutoResearch] mean_reward=56.2969
|
|
[2026-04-13 01:19:56] [AutoResearch] === Trial 31 Summary ===
|
|
[2026-04-13 01:19:56] Total runs in history: 149
|
|
[2026-04-13 01:19:56] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:56] Top 5 results:
|
|
[2026-04-13 01:19:56] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:56] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:56] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:56] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:56] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:19:58]
|
|
[AutoResearch] ========== Trial 32/200 ==========
|
|
[2026-04-13 01:19:58] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:19:58] UCB=1.0136 mu=0.7507 sigma=0.1315 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003206013880889007}
|
|
[2026-04-13 01:19:58] UCB=1.0060 mu=0.8131 sigma=0.0964 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004970968585686247}
|
|
[2026-04-13 01:19:58] UCB=0.9855 mu=0.6672 sigma=0.1592 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0030958734916456594}
|
|
[2026-04-13 01:19:58] UCB=0.9660 mu=0.6777 sigma=0.1441 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0032926419954475428}
|
|
[2026-04-13 01:19:58] UCB=0.9589 mu=0.6317 sigma=0.1636 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.000988240246804709}
|
|
[2026-04-13 01:19:58] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003206013880889007, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:00] [AutoResearch] Launching job: n_steer=8 n_throttle=2 lr=0.003206
|
|
[2026-04-13 01:20:09] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:20:09] [AutoResearch] mean_reward=67.0593
|
|
[2026-04-13 01:20:09] [AutoResearch] === Trial 32 Summary ===
|
|
[2026-04-13 01:20:09] Total runs in history: 150
|
|
[2026-04-13 01:20:09] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:09] Top 5 results:
|
|
[2026-04-13 01:20:09] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:09] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:09] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:09] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:09] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:11]
|
|
[AutoResearch] ========== Trial 33/200 ==========
|
|
[2026-04-13 01:20:11] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:20:11] UCB=0.9524 mu=0.6000 sigma=0.1762 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.001007796212762873}
|
|
[2026-04-13 01:20:11] UCB=0.9394 mu=0.7498 sigma=0.0948 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0019447691730077933}
|
|
[2026-04-13 01:20:11] UCB=0.9112 mu=0.8372 sigma=0.0370 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012143381465046827}
|
|
[2026-04-13 01:20:11] UCB=0.8652 mu=0.5706 sigma=0.1473 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035895217378103324}
|
|
[2026-04-13 01:20:11] UCB=0.8448 mu=0.4829 sigma=0.1810 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012209539547954227}
|
|
[2026-04-13 01:20:11] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.001007796212762873, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:13] [AutoResearch] Launching job: n_steer=6 n_throttle=3 lr=0.001008
|
|
[2026-04-13 01:20:22] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:20:22] [AutoResearch] mean_reward=86.2483
|
|
[2026-04-13 01:20:22] [AutoResearch] === Trial 33 Summary ===
|
|
[2026-04-13 01:20:22] Total runs in history: 151
|
|
[2026-04-13 01:20:22] Best so far: mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:22] Top 5 results:
|
|
[2026-04-13 01:20:22] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:22] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:22] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:22] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:22] mean_reward=105.4572 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033568431430984467, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:24]
|
|
[AutoResearch] ========== Trial 34/200 ==========
|
|
[2026-04-13 01:20:24] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:20:24] UCB=1.2866 mu=0.9362 sigma=0.1752 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085}
|
|
[2026-04-13 01:20:24] UCB=1.2266 mu=0.9742 sigma=0.1262 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.00010078382915981222}
|
|
[2026-04-13 01:20:24] UCB=1.2194 mu=0.8588 sigma=0.1803 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0014682918947926053}
|
|
[2026-04-13 01:20:24] UCB=1.1793 mu=0.9904 sigma=0.0945 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0007773034679732442}
|
|
[2026-04-13 01:20:24] UCB=1.1478 mu=0.8002 sigma=0.1738 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0011446137587055234}
|
|
[2026-04-13 01:20:24] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:26] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.001222
|
|
[2026-04-13 01:20:35] [AutoResearch] Job finished in 9.4s, returncode=0
|
|
[2026-04-13 01:20:35] [AutoResearch] mean_reward=122.297
|
|
[2026-04-13 01:20:35] [AutoResearch] === Trial 34 Summary ===
|
|
[2026-04-13 01:20:35] Total runs in history: 152
|
|
[2026-04-13 01:20:35] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:35] Top 5 results:
|
|
[2026-04-13 01:20:35] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:35] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:35] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:35] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:35] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:37]
|
|
[AutoResearch] ========== Trial 35/200 ==========
|
|
[2026-04-13 01:20:38] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:20:38] UCB=2.6313 mu=2.4886 sigma=0.0714 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012429907446823646}
|
|
[2026-04-13 01:20:38] UCB=2.4834 mu=2.3257 sigma=0.0789 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0013102018742646637}
|
|
[2026-04-13 01:20:38] UCB=2.2150 mu=1.8930 sigma=0.1610 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008631448817525436}
|
|
[2026-04-13 01:20:38] UCB=2.0784 mu=1.8567 sigma=0.1109 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0006093918352688042}
|
|
[2026-04-13 01:20:38] UCB=2.0239 mu=1.6846 sigma=0.1696 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0013776767009479637}
|
|
[2026-04-13 01:20:38] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012429907446823646, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:40] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.001243
|
|
[2026-04-13 01:20:48] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:20:48] [AutoResearch] mean_reward=59.9342
|
|
[2026-04-13 01:20:48] [AutoResearch] === Trial 35 Summary ===
|
|
[2026-04-13 01:20:48] Total runs in history: 153
|
|
[2026-04-13 01:20:48] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:48] Top 5 results:
|
|
[2026-04-13 01:20:48] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:48] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:48] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:48] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:48] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:50]
|
|
[AutoResearch] ========== Trial 36/200 ==========
|
|
[2026-04-13 01:20:50] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:20:50] UCB=2.0121 mu=1.6721 sigma=0.1700 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009754999027616014}
|
|
[2026-04-13 01:20:50] UCB=1.5467 mu=1.1966 sigma=0.1750 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0012248560240971222}
|
|
[2026-04-13 01:20:50] UCB=1.4984 mu=1.2434 sigma=0.1275 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0012654046047480955}
|
|
[2026-04-13 01:20:50] UCB=1.4588 mu=1.2598 sigma=0.0995 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0010213317543270196}
|
|
[2026-04-13 01:20:50] UCB=1.2400 mu=0.8977 sigma=0.1712 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0009136515757976173}
|
|
[2026-04-13 01:20:50] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009754999027616014, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:20:52] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.000975
|
|
[2026-04-13 01:21:01] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:21:01] [AutoResearch] mean_reward=77.9726
|
|
[2026-04-13 01:21:01] [AutoResearch] === Trial 36 Summary ===
|
|
[2026-04-13 01:21:01] Total runs in history: 154
|
|
[2026-04-13 01:21:01] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:01] Top 5 results:
|
|
[2026-04-13 01:21:01] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:01] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:01] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:01] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:01] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:03]
|
|
[AutoResearch] ========== Trial 37/200 ==========
|
|
[2026-04-13 01:21:03] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:21:03] UCB=1.8357 mu=1.4851 sigma=0.1753 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012352462437364739}
|
|
[2026-04-13 01:21:03] UCB=1.6914 mu=1.3742 sigma=0.1586 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0013804087163857923}
|
|
[2026-04-13 01:21:03] UCB=1.6328 mu=1.3681 sigma=0.1323 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001145934294785194}
|
|
[2026-04-13 01:21:03] UCB=1.6047 mu=1.2578 sigma=0.1734 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0011940267430182114}
|
|
[2026-04-13 01:21:03] UCB=1.3095 mu=1.0807 sigma=0.1144 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.001202729865697338}
|
|
[2026-04-13 01:21:03] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012352462437364739, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:05] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.001235
|
|
[2026-04-13 01:21:13] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:21:13] [AutoResearch] mean_reward=67.3062
|
|
[2026-04-13 01:21:13] [AutoResearch] === Trial 37 Summary ===
|
|
[2026-04-13 01:21:13] Total runs in history: 155
|
|
[2026-04-13 01:21:13] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:13] Top 5 results:
|
|
[2026-04-13 01:21:13] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:13] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:13] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:13] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:13] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:15]
|
|
[AutoResearch] ========== Trial 38/200 ==========
|
|
[2026-04-13 01:21:16] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:21:16] UCB=1.1004 mu=0.8162 sigma=0.1421 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0012072808064251595}
|
|
[2026-04-13 01:21:16] UCB=0.9878 mu=0.7003 sigma=0.1437 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0033109813835161604}
|
|
[2026-04-13 01:21:16] UCB=0.8893 mu=0.6174 sigma=0.1360 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0008890351599402595}
|
|
[2026-04-13 01:21:16] UCB=0.8847 mu=0.7182 sigma=0.0833 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005478127758922886}
|
|
[2026-04-13 01:21:16] UCB=0.8533 mu=0.5041 sigma=0.1746 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.003178108768998331}
|
|
[2026-04-13 01:21:16] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0012072808064251595, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:18] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.001207
|
|
[2026-04-13 01:21:26] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:21:26] [AutoResearch] mean_reward=61.3275
|
|
[2026-04-13 01:21:26] [AutoResearch] === Trial 38 Summary ===
|
|
[2026-04-13 01:21:26] Total runs in history: 156
|
|
[2026-04-13 01:21:26] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:26] Top 5 results:
|
|
[2026-04-13 01:21:26] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:26] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:26] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:26] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:26] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:28]
|
|
[AutoResearch] ========== Trial 39/200 ==========
|
|
[2026-04-13 01:21:28] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:21:28] UCB=1.4849 mu=1.1644 sigma=0.1603 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0010522373443849866}
|
|
[2026-04-13 01:21:28] UCB=1.4384 mu=1.1332 sigma=0.1526 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0011058152947954838}
|
|
[2026-04-13 01:21:28] UCB=1.4062 mu=1.0610 sigma=0.1726 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009937952424397879}
|
|
[2026-04-13 01:21:28] UCB=1.3430 mu=1.0409 sigma=0.1511 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0012395868402214566}
|
|
[2026-04-13 01:21:28] UCB=1.3312 mu=1.0297 sigma=0.1508 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.000891563663214404}
|
|
[2026-04-13 01:21:28] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0010522373443849866, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:30] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.001052
|
|
[2026-04-13 01:21:39] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:21:39] [AutoResearch] mean_reward=46.6116
|
|
[2026-04-13 01:21:39] [AutoResearch] === Trial 39 Summary ===
|
|
[2026-04-13 01:21:39] Total runs in history: 157
|
|
[2026-04-13 01:21:39] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:39] Top 5 results:
|
|
[2026-04-13 01:21:39] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:39] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:39] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:39] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:39] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:41]
|
|
[AutoResearch] ========== Trial 40/200 ==========
|
|
[2026-04-13 01:21:41] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:21:41] UCB=1.2833 mu=1.0235 sigma=0.1299 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0027068452722899747}
|
|
[2026-04-13 01:21:41] UCB=1.0920 mu=0.9257 sigma=0.0832 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0013635837521765387}
|
|
[2026-04-13 01:21:41] UCB=1.0824 mu=0.7692 sigma=0.1566 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0011350451328971305}
|
|
[2026-04-13 01:21:41] UCB=1.0633 mu=0.8378 sigma=0.1127 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0027371620795865366}
|
|
[2026-04-13 01:21:41] UCB=1.0294 mu=0.8237 sigma=0.1028 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0012009620002527183}
|
|
[2026-04-13 01:21:41] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0027068452722899747, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:43] [AutoResearch] Launching job: n_steer=9 n_throttle=3 lr=0.002707
|
|
[2026-04-13 01:21:52] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:21:52] [AutoResearch] mean_reward=78.3482
|
|
[2026-04-13 01:21:52] [AutoResearch] === Trial 40 Summary ===
|
|
[2026-04-13 01:21:52] Total runs in history: 158
|
|
[2026-04-13 01:21:52] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:52] Top 5 results:
|
|
[2026-04-13 01:21:52] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:52] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:52] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:52] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:52] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:54]
|
|
[AutoResearch] ========== Trial 41/200 ==========
|
|
[2026-04-13 01:21:54] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:21:54] UCB=1.4482 mu=1.0993 sigma=0.1745 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001131716769063254}
|
|
[2026-04-13 01:21:54] UCB=1.4221 mu=1.1081 sigma=0.1570 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0010354887934262314}
|
|
[2026-04-13 01:21:54] UCB=1.3494 mu=1.0433 sigma=0.1530 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0011049930144957942}
|
|
[2026-04-13 01:21:54] UCB=1.2198 mu=0.9217 sigma=0.1491 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0010883843644875955}
|
|
[2026-04-13 01:21:54] UCB=1.1841 mu=0.9972 sigma=0.0934 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009412900894604573}
|
|
[2026-04-13 01:21:54] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001131716769063254, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:21:56] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.001132
|
|
[2026-04-13 01:22:04] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:22:04] [AutoResearch] mean_reward=54.5905
|
|
[2026-04-13 01:22:04] [AutoResearch] === Trial 41 Summary ===
|
|
[2026-04-13 01:22:04] Total runs in history: 159
|
|
[2026-04-13 01:22:04] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:04] Top 5 results:
|
|
[2026-04-13 01:22:04] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:04] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:04] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:04] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:04] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:06]
|
|
[AutoResearch] ========== Trial 42/200 ==========
|
|
[2026-04-13 01:22:06] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:22:06] UCB=1.2245 mu=0.8947 sigma=0.1649 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.000941574442631983}
|
|
[2026-04-13 01:22:06] UCB=1.0108 mu=0.7354 sigma=0.1377 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.003426767993054967}
|
|
[2026-04-13 01:22:06] UCB=0.9872 mu=0.6316 sigma=0.1778 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004200012122546207}
|
|
[2026-04-13 01:22:06] UCB=0.9771 mu=0.6903 sigma=0.1434 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0022896317636002333}
|
|
[2026-04-13 01:22:06] UCB=0.9610 mu=0.6117 sigma=0.1746 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004242965977952805}
|
|
[2026-04-13 01:22:06] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.000941574442631983, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:08] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.000942
|
|
[2026-04-13 01:22:17] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:22:17] [AutoResearch] mean_reward=62.0247
|
|
[2026-04-13 01:22:17] [AutoResearch] === Trial 42 Summary ===
|
|
[2026-04-13 01:22:17] Total runs in history: 160
|
|
[2026-04-13 01:22:17] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:17] Top 5 results:
|
|
[2026-04-13 01:22:17] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:17] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:17] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:17] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:17] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:19]
|
|
[AutoResearch] ========== Trial 43/200 ==========
|
|
[2026-04-13 01:22:19] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:22:19] UCB=1.2645 mu=0.9472 sigma=0.1586 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010736340863620952}
|
|
[2026-04-13 01:22:19] UCB=1.2426 mu=0.9763 sigma=0.1331 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0011786124032765097}
|
|
[2026-04-13 01:22:19] UCB=1.0220 mu=0.7505 sigma=0.1358 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0033724081772814754}
|
|
[2026-04-13 01:22:19] UCB=0.8977 mu=0.7359 sigma=0.0809 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0010637334250634778}
|
|
[2026-04-13 01:22:19] UCB=0.8973 mu=0.5724 sigma=0.1625 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0006713724305939915}
|
|
[2026-04-13 01:22:19] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010736340863620952, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:21] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.001074
|
|
[2026-04-13 01:22:30] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:22:30] [AutoResearch] mean_reward=83.9954
|
|
[2026-04-13 01:22:30] [AutoResearch] === Trial 43 Summary ===
|
|
[2026-04-13 01:22:30] Total runs in history: 161
|
|
[2026-04-13 01:22:30] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:30] Top 5 results:
|
|
[2026-04-13 01:22:30] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:30] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:30] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:30] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:30] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:32]
|
|
[AutoResearch] ========== Trial 44/200 ==========
|
|
[2026-04-13 01:22:32] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:22:32] UCB=1.1601 mu=0.8619 sigma=0.1491 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0009799227402454674}
|
|
[2026-04-13 01:22:32] UCB=1.0391 mu=0.7275 sigma=0.1558 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.003125699253808702}
|
|
[2026-04-13 01:22:32] UCB=0.9382 mu=0.6419 sigma=0.1482 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.00014864744574033547}
|
|
[2026-04-13 01:22:32] UCB=0.9127 mu=0.5765 sigma=0.1681 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0007901282298058858}
|
|
[2026-04-13 01:22:32] UCB=0.8994 mu=0.7219 sigma=0.0887 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 7.842110682579884e-05}
|
|
[2026-04-13 01:22:32] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0009799227402454674, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:34] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.000980
|
|
[2026-04-13 01:22:43] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:22:43] [AutoResearch] mean_reward=60.2581
|
|
[2026-04-13 01:22:43] [AutoResearch] === Trial 44 Summary ===
|
|
[2026-04-13 01:22:43] Total runs in history: 162
|
|
[2026-04-13 01:22:43] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:43] Top 5 results:
|
|
[2026-04-13 01:22:43] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:43] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:43] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:43] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:43] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:45]
|
|
[AutoResearch] ========== Trial 45/200 ==========
|
|
[2026-04-13 01:22:45] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:22:45] UCB=1.3327 mu=1.0073 sigma=0.1627 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008982835770520015}
|
|
[2026-04-13 01:22:45] UCB=1.1537 mu=0.9067 sigma=0.1235 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0010698498524485394}
|
|
[2026-04-13 01:22:45] UCB=1.0587 mu=0.7633 sigma=0.1477 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.002978380700568231}
|
|
[2026-04-13 01:22:45] UCB=1.0408 mu=0.7790 sigma=0.1309 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.002600460469488493}
|
|
[2026-04-13 01:22:45] UCB=1.0076 mu=0.6636 sigma=0.1720 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0007021173532083369}
|
|
[2026-04-13 01:22:45] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008982835770520015, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:47] [AutoResearch] Launching job: n_steer=6 n_throttle=3 lr=0.000898
|
|
[2026-04-13 01:22:56] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:22:56] [AutoResearch] mean_reward=88.724
|
|
[2026-04-13 01:22:56] [AutoResearch] === Trial 45 Summary ===
|
|
[2026-04-13 01:22:56] Total runs in history: 163
|
|
[2026-04-13 01:22:56] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:56] Top 5 results:
|
|
[2026-04-13 01:22:56] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:56] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:56] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:56] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:56] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:22:58]
|
|
[AutoResearch] ========== Trial 46/200 ==========
|
|
[2026-04-13 01:22:58] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:22:58] UCB=1.4320 mu=1.1516 sigma=0.1402 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0012001839351679785}
|
|
[2026-04-13 01:22:58] UCB=1.3665 mu=1.0186 sigma=0.1739 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0011071668898296153}
|
|
[2026-04-13 01:22:58] UCB=1.1915 mu=0.9253 sigma=0.1331 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001139965546364589}
|
|
[2026-04-13 01:22:58] UCB=1.1722 mu=0.8537 sigma=0.1592 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010190957207702471}
|
|
[2026-04-13 01:22:58] UCB=1.1216 mu=0.8350 sigma=0.1433 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.00012495829394204312}
|
|
[2026-04-13 01:22:58] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0012001839351679785, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:00] [AutoResearch] Launching job: n_steer=6 n_throttle=3 lr=0.001200
|
|
[2026-04-13 01:23:09] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:23:09] [AutoResearch] mean_reward=64.019
|
|
[2026-04-13 01:23:09] [AutoResearch] === Trial 46 Summary ===
|
|
[2026-04-13 01:23:09] Total runs in history: 164
|
|
[2026-04-13 01:23:09] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:09] Top 5 results:
|
|
[2026-04-13 01:23:09] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:09] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:09] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:09] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:09] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:11]
|
|
[AutoResearch] ========== Trial 47/200 ==========
|
|
[2026-04-13 01:23:11] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:23:11] UCB=1.3799 mu=1.0344 sigma=0.1728 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009786447902364636}
|
|
[2026-04-13 01:23:11] UCB=0.9991 mu=0.6547 sigma=0.1722 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0018641187335541585}
|
|
[2026-04-13 01:23:11] UCB=0.9796 mu=0.6552 sigma=0.1622 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0016798012587621342}
|
|
[2026-04-13 01:23:11] UCB=0.9710 mu=0.5889 sigma=0.1910 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0037503790854853227}
|
|
[2026-04-13 01:23:11] UCB=0.9219 mu=0.5796 sigma=0.1712 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0038952079829483225}
|
|
[2026-04-13 01:23:11] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009786447902364636, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:13] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.000979
|
|
[2026-04-13 01:23:22] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:23:22] [AutoResearch] mean_reward=68.3048
|
|
[2026-04-13 01:23:22] [AutoResearch] === Trial 47 Summary ===
|
|
[2026-04-13 01:23:22] Total runs in history: 165
|
|
[2026-04-13 01:23:22] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:22] Top 5 results:
|
|
[2026-04-13 01:23:22] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:22] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:22] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:22] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:22] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:24]
|
|
[AutoResearch] ========== Trial 48/200 ==========
|
|
[2026-04-13 01:23:24] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:23:24] UCB=1.2654 mu=0.9191 sigma=0.1731 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010966832209934026}
|
|
[2026-04-13 01:23:24] UCB=1.1292 mu=0.8065 sigma=0.1614 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020996867370792686}
|
|
[2026-04-13 01:23:24] UCB=1.0579 mu=0.6681 sigma=0.1949 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0042507899002680195}
|
|
[2026-04-13 01:23:24] UCB=1.0377 mu=0.8591 sigma=0.0893 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 7.331560535576644e-05}
|
|
[2026-04-13 01:23:24] UCB=0.9711 mu=0.7229 sigma=0.1241 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035194406883770238}
|
|
[2026-04-13 01:23:24] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010966832209934026, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:26] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.001097
|
|
[2026-04-13 01:23:34] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:23:34] [AutoResearch] mean_reward=62.2961
|
|
[2026-04-13 01:23:34] [AutoResearch] === Trial 48 Summary ===
|
|
[2026-04-13 01:23:34] Total runs in history: 166
|
|
[2026-04-13 01:23:34] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:34] Top 5 results:
|
|
[2026-04-13 01:23:34] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:34] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:34] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:34] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:34] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:36]
|
|
[AutoResearch] ========== Trial 49/200 ==========
|
|
[2026-04-13 01:23:36] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:23:36] UCB=1.2018 mu=0.9385 sigma=0.1317 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009902425688755937}
|
|
[2026-04-13 01:23:36] UCB=1.1523 mu=0.8448 sigma=0.1538 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009133280694210631}
|
|
[2026-04-13 01:23:36] UCB=1.1439 mu=0.8553 sigma=0.1443 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0011699104069449256}
|
|
[2026-04-13 01:23:36] UCB=0.9898 mu=0.6048 sigma=0.1925 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004317864407365183}
|
|
[2026-04-13 01:23:36] UCB=0.9352 mu=0.5789 sigma=0.1781 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.003741575983627763}
|
|
[2026-04-13 01:23:36] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009902425688755937, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:38] [AutoResearch] Launching job: n_steer=6 n_throttle=3 lr=0.000990
|
|
[2026-04-13 01:23:46] [AutoResearch] Job finished in 8.1s, returncode=0
|
|
[2026-04-13 01:23:46] [AutoResearch] mean_reward=53.2362
|
|
[2026-04-13 01:23:46] [AutoResearch] === Trial 49 Summary ===
|
|
[2026-04-13 01:23:46] Total runs in history: 167
|
|
[2026-04-13 01:23:46] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:46] Top 5 results:
|
|
[2026-04-13 01:23:46] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:46] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:46] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:46] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:46] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:48]
|
|
[AutoResearch] ========== Trial 50/200 ==========
|
|
[2026-04-13 01:23:48] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:23:48] UCB=1.0982 mu=0.8189 sigma=0.1396 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004999023093426963}
|
|
[2026-04-13 01:23:48] UCB=1.0855 mu=0.7849 sigma=0.1503 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0027109853981565043}
|
|
[2026-04-13 01:23:48] UCB=1.0688 mu=0.8086 sigma=0.1301 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0018615154329199358}
|
|
[2026-04-13 01:23:48] UCB=1.0506 mu=0.8108 sigma=0.1199 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 6.931458432109274e-05}
|
|
[2026-04-13 01:23:48] UCB=1.0105 mu=0.6867 sigma=0.1619 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004159305817761329}
|
|
[2026-04-13 01:23:48] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004999023093426963, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:50] [AutoResearch] Launching job: n_steer=3 n_throttle=3 lr=0.004999
|
|
[2026-04-13 01:23:59] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:23:59] [AutoResearch] mean_reward=79.2458
|
|
[2026-04-13 01:23:59] [AutoResearch] === Trial 50 Summary ===
|
|
[2026-04-13 01:23:59] Total runs in history: 168
|
|
[2026-04-13 01:23:59] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:59] Top 5 results:
|
|
[2026-04-13 01:23:59] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:59] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:59] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:59] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:23:59] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:01]
|
|
[AutoResearch] ========== Trial 51/200 ==========
|
|
[2026-04-13 01:24:01] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:24:01] UCB=1.1816 mu=0.8889 sigma=0.1464 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.00011188116023695013}
|
|
[2026-04-13 01:24:01] UCB=1.0453 mu=0.6746 sigma=0.1854 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004152164448299693}
|
|
[2026-04-13 01:24:01] UCB=1.0448 mu=0.7235 sigma=0.1607 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002273279923679517}
|
|
[2026-04-13 01:24:01] UCB=1.0356 mu=0.7393 sigma=0.1482 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.003072468451790972}
|
|
[2026-04-13 01:24:01] UCB=0.9534 mu=0.7143 sigma=0.1195 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.00344459224498233}
|
|
[2026-04-13 01:24:01] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.00011188116023695013, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:03] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.000112
|
|
[2026-04-13 01:24:11] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:24:11] [AutoResearch] mean_reward=48.6379
|
|
[2026-04-13 01:24:11] [AutoResearch] === Trial 51 Summary ===
|
|
[2026-04-13 01:24:11] Total runs in history: 169
|
|
[2026-04-13 01:24:11] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:11] Top 5 results:
|
|
[2026-04-13 01:24:11] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:11] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:11] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:11] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:11] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:13]
|
|
[AutoResearch] ========== Trial 52/200 ==========
|
|
[2026-04-13 01:24:13] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:24:13] UCB=1.1885 mu=0.8403 sigma=0.1741 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0011796051797273704}
|
|
[2026-04-13 01:24:13] UCB=1.1595 mu=0.8142 sigma=0.1727 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0011179088435417206}
|
|
[2026-04-13 01:24:13] UCB=1.0595 mu=0.7667 sigma=0.1464 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034532151716747802}
|
|
[2026-04-13 01:24:13] UCB=1.0486 mu=0.7094 sigma=0.1696 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004233831273072446}
|
|
[2026-04-13 01:24:13] UCB=1.0400 mu=0.7543 sigma=0.1429 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.003464146283125912}
|
|
[2026-04-13 01:24:13] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0011796051797273704, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:15] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.001180
|
|
[2026-04-13 01:24:24] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:24:24] [AutoResearch] mean_reward=82.1774
|
|
[2026-04-13 01:24:24] [AutoResearch] === Trial 52 Summary ===
|
|
[2026-04-13 01:24:24] Total runs in history: 170
|
|
[2026-04-13 01:24:24] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:24] Top 5 results:
|
|
[2026-04-13 01:24:24] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:24] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:24] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:24] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:24] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:26]
|
|
[AutoResearch] ========== Trial 53/200 ==========
|
|
[2026-04-13 01:24:26] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:24:26] UCB=0.9948 mu=0.7647 sigma=0.1151 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.001232240761408109}
|
|
[2026-04-13 01:24:26] UCB=0.9641 mu=0.5741 sigma=0.1950 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004318806097538942}
|
|
[2026-04-13 01:24:26] UCB=0.9473 mu=0.7437 sigma=0.1018 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008057327912095038}
|
|
[2026-04-13 01:24:26] UCB=0.9257 mu=0.5495 sigma=0.1881 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004468933926736107}
|
|
[2026-04-13 01:24:26] UCB=0.9054 mu=0.5575 sigma=0.1740 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003962539587525891}
|
|
[2026-04-13 01:24:26] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.001232240761408109, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:28] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.001232
|
|
[2026-04-13 01:24:37] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:24:37] [AutoResearch] mean_reward=63.6418
|
|
[2026-04-13 01:24:37] [AutoResearch] === Trial 53 Summary ===
|
|
[2026-04-13 01:24:37] Total runs in history: 171
|
|
[2026-04-13 01:24:37] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:37] Top 5 results:
|
|
[2026-04-13 01:24:37] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:37] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:37] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:37] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:37] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:39]
|
|
[AutoResearch] ========== Trial 54/200 ==========
|
|
[2026-04-13 01:24:39] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:24:39] UCB=1.2301 mu=0.8823 sigma=0.1739 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010158267933267258}
|
|
[2026-04-13 01:24:39] UCB=1.0792 mu=0.7306 sigma=0.1743 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0009958638543499133}
|
|
[2026-04-13 01:24:39] UCB=1.0427 mu=0.8113 sigma=0.1157 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.002760890351015242}
|
|
[2026-04-13 01:24:39] UCB=0.9598 mu=0.8039 sigma=0.0780 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0030302621599551157}
|
|
[2026-04-13 01:24:39] UCB=0.9510 mu=0.8140 sigma=0.0685 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.004976120288752267}
|
|
[2026-04-13 01:24:39] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010158267933267258, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:41] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.001016
|
|
[2026-04-13 01:24:49] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:24:49] [AutoResearch] mean_reward=83.2882
|
|
[2026-04-13 01:24:49] [AutoResearch] === Trial 54 Summary ===
|
|
[2026-04-13 01:24:49] Total runs in history: 172
|
|
[2026-04-13 01:24:49] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:49] Top 5 results:
|
|
[2026-04-13 01:24:49] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:49] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:49] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:49] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:49] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:51]
|
|
[AutoResearch] ========== Trial 55/200 ==========
|
|
[2026-04-13 01:24:51] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:24:51] UCB=1.0509 mu=0.7800 sigma=0.1354 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033457737537196707}
|
|
[2026-04-13 01:24:51] UCB=1.0379 mu=0.6510 sigma=0.1934 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0040524449909916304}
|
|
[2026-04-13 01:24:51] UCB=1.0254 mu=0.7546 sigma=0.1354 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0007404272235286298}
|
|
[2026-04-13 01:24:51] UCB=1.0140 mu=0.7175 sigma=0.1482 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 6.243931960329284e-05}
|
|
[2026-04-13 01:24:51] UCB=1.0132 mu=0.6425 sigma=0.1853 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004335103194003657}
|
|
[2026-04-13 01:24:51] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033457737537196707, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:24:53] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.003346
|
|
[2026-04-13 01:25:02] [AutoResearch] Job finished in 8.1s, returncode=0
|
|
[2026-04-13 01:25:02] [AutoResearch] mean_reward=47.1845
|
|
[2026-04-13 01:25:02] [AutoResearch] === Trial 55 Summary ===
|
|
[2026-04-13 01:25:02] Total runs in history: 173
|
|
[2026-04-13 01:25:02] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:02] Top 5 results:
|
|
[2026-04-13 01:25:02] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:02] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:02] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:02] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:02] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:04]
|
|
[AutoResearch] ========== Trial 56/200 ==========
|
|
[2026-04-13 01:25:04] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:25:04] UCB=1.1061 mu=0.7700 sigma=0.1681 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009429099573211977}
|
|
[2026-04-13 01:25:04] UCB=1.0867 mu=0.7652 sigma=0.1607 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.000877404222061965}
|
|
[2026-04-13 01:25:04] UCB=1.0159 mu=0.6879 sigma=0.1640 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0013509822573340438}
|
|
[2026-04-13 01:25:04] UCB=1.0029 mu=0.6564 sigma=0.1733 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.002844978284451131}
|
|
[2026-04-13 01:25:04] UCB=0.9917 mu=0.6813 sigma=0.1552 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0010329300382464497}
|
|
[2026-04-13 01:25:04] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009429099573211977, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:06] [AutoResearch] Launching job: n_steer=6 n_throttle=3 lr=0.000943
|
|
[2026-04-13 01:25:14] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:25:14] [AutoResearch] mean_reward=62.1694
|
|
[2026-04-13 01:25:14] [AutoResearch] === Trial 56 Summary ===
|
|
[2026-04-13 01:25:14] Total runs in history: 174
|
|
[2026-04-13 01:25:14] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:14] Top 5 results:
|
|
[2026-04-13 01:25:14] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:14] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:14] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:14] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:14] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:16]
|
|
[AutoResearch] ========== Trial 57/200 ==========
|
|
[2026-04-13 01:25:16] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:25:16] UCB=0.9692 mu=0.6963 sigma=0.1364 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004026901083100558}
|
|
[2026-04-13 01:25:16] UCB=0.9049 mu=0.6869 sigma=0.1090 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0007762356053490825}
|
|
[2026-04-13 01:25:16] UCB=0.9017 mu=0.6136 sigma=0.1440 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.00013340085206036144}
|
|
[2026-04-13 01:25:16] UCB=0.8900 mu=0.6747 sigma=0.1076 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0005280026299433064}
|
|
[2026-04-13 01:25:16] UCB=0.8794 mu=0.5353 sigma=0.1720 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0008481995397061724}
|
|
[2026-04-13 01:25:16] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0004026901083100558, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:18] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.000403
|
|
[2026-04-13 01:25:26] [AutoResearch] Job finished in 7.9s, returncode=0
|
|
[2026-04-13 01:25:26] [AutoResearch] mean_reward=42.3952
|
|
[2026-04-13 01:25:26] [AutoResearch] === Trial 57 Summary ===
|
|
[2026-04-13 01:25:26] Total runs in history: 175
|
|
[2026-04-13 01:25:26] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:26] Top 5 results:
|
|
[2026-04-13 01:25:26] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:26] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:26] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:26] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:26] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:28]
|
|
[AutoResearch] ========== Trial 58/200 ==========
|
|
[2026-04-13 01:25:28] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:25:28] UCB=0.9772 mu=0.6290 sigma=0.1741 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0013562014767354756}
|
|
[2026-04-13 01:25:28] UCB=0.9699 mu=0.5798 sigma=0.1950 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004165997352752082}
|
|
[2026-04-13 01:25:28] UCB=0.9400 mu=0.6790 sigma=0.1305 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0025384578184663247}
|
|
[2026-04-13 01:25:28] UCB=0.9189 mu=0.5389 sigma=0.1900 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.00439523067398733}
|
|
[2026-04-13 01:25:28] UCB=0.9047 mu=0.5199 sigma=0.1924 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004321391129972184}
|
|
[2026-04-13 01:25:28] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0013562014767354756, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:30] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.001356
|
|
[2026-04-13 01:25:39] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:25:39] [AutoResearch] mean_reward=73.3629
|
|
[2026-04-13 01:25:39] [AutoResearch] === Trial 58 Summary ===
|
|
[2026-04-13 01:25:39] Total runs in history: 176
|
|
[2026-04-13 01:25:39] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:39] Top 5 results:
|
|
[2026-04-13 01:25:39] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:39] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:39] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:39] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:39] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:41]
|
|
[AutoResearch] ========== Trial 59/200 ==========
|
|
[2026-04-13 01:25:41] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:25:41] UCB=1.2019 mu=0.8635 sigma=0.1692 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.00110273326265245}
|
|
[2026-04-13 01:25:41] UCB=1.1597 mu=0.8254 sigma=0.1672 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020336776032648827}
|
|
[2026-04-13 01:25:41] UCB=1.1429 mu=0.8299 sigma=0.1565 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0011578062532476806}
|
|
[2026-04-13 01:25:41] UCB=1.1365 mu=0.8023 sigma=0.1671 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.001965308386558463}
|
|
[2026-04-13 01:25:41] UCB=1.0969 mu=0.7502 sigma=0.1733 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001838795613939342}
|
|
[2026-04-13 01:25:41] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.00110273326265245, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:43] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.001103
|
|
[2026-04-13 01:25:52] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:25:52] [AutoResearch] mean_reward=70.3633
|
|
[2026-04-13 01:25:52] [AutoResearch] === Trial 59 Summary ===
|
|
[2026-04-13 01:25:52] Total runs in history: 177
|
|
[2026-04-13 01:25:52] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:52] Top 5 results:
|
|
[2026-04-13 01:25:52] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:52] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:52] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:52] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:52] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:54]
|
|
[AutoResearch] ========== Trial 60/200 ==========
|
|
[2026-04-13 01:25:54] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:25:54] UCB=1.1488 mu=0.8096 sigma=0.1696 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0018936405085969582}
|
|
[2026-04-13 01:25:54] UCB=0.9929 mu=0.6498 sigma=0.1715 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.002774441373992652}
|
|
[2026-04-13 01:25:54] UCB=0.9524 mu=0.6342 sigma=0.1591 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0006788804970097635}
|
|
[2026-04-13 01:25:54] UCB=0.9113 mu=0.5395 sigma=0.1859 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003935473116864854}
|
|
[2026-04-13 01:25:54] UCB=0.8725 mu=0.5719 sigma=0.1503 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0013521279279444264}
|
|
[2026-04-13 01:25:54] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0018936405085969582, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:25:56] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.001894
|
|
[2026-04-13 01:26:05] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:26:05] [AutoResearch] mean_reward=71.2732
|
|
[2026-04-13 01:26:05] [AutoResearch] === Trial 60 Summary ===
|
|
[2026-04-13 01:26:05] Total runs in history: 178
|
|
[2026-04-13 01:26:05] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:05] Top 5 results:
|
|
[2026-04-13 01:26:05] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:05] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:05] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:05] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:05] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:07]
|
|
[AutoResearch] ========== Trial 61/200 ==========
|
|
[2026-04-13 01:26:07] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:26:07] UCB=1.0973 mu=0.7719 sigma=0.1627 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0018205638930811675}
|
|
[2026-04-13 01:26:07] UCB=1.0743 mu=0.7669 sigma=0.1537 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0009908723842467835}
|
|
[2026-04-13 01:26:07] UCB=1.0550 mu=0.7387 sigma=0.1581 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008886194695426734}
|
|
[2026-04-13 01:26:07] UCB=0.9368 mu=0.6301 sigma=0.1533 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009213606922432187}
|
|
[2026-04-13 01:26:07] UCB=0.9235 mu=0.6395 sigma=0.1420 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002392914641742015}
|
|
[2026-04-13 01:26:07] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0018205638930811675, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:09] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.001821
|
|
[2026-04-13 01:26:17] [AutoResearch] Job finished in 7.8s, returncode=0
|
|
[2026-04-13 01:26:17] [AutoResearch] mean_reward=33.5928
|
|
[2026-04-13 01:26:17] [AutoResearch] === Trial 61 Summary ===
|
|
[2026-04-13 01:26:17] Total runs in history: 179
|
|
[2026-04-13 01:26:17] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:17] Top 5 results:
|
|
[2026-04-13 01:26:17] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:17] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:17] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:17] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:17] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:19]
|
|
[AutoResearch] ========== Trial 62/200 ==========
|
|
[2026-04-13 01:26:19] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:26:19] UCB=1.2230 mu=0.9143 sigma=0.1543 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0028837097235016166}
|
|
[2026-04-13 01:26:19] UCB=1.1789 mu=0.8462 sigma=0.1664 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009685877079852755}
|
|
[2026-04-13 01:26:19] UCB=1.1638 mu=0.8157 sigma=0.1741 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.001174309173411514}
|
|
[2026-04-13 01:26:19] UCB=1.0807 mu=0.8015 sigma=0.1396 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.002699253736197411}
|
|
[2026-04-13 01:26:19] UCB=1.0280 mu=0.6838 sigma=0.1721 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.002380542540778746}
|
|
[2026-04-13 01:26:19] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0028837097235016166, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:21] [AutoResearch] Launching job: n_steer=9 n_throttle=3 lr=0.002884
|
|
[2026-04-13 01:26:29] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:26:29] [AutoResearch] mean_reward=57.9037
|
|
[2026-04-13 01:26:29] [AutoResearch] === Trial 62 Summary ===
|
|
[2026-04-13 01:26:29] Total runs in history: 180
|
|
[2026-04-13 01:26:29] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:29] Top 5 results:
|
|
[2026-04-13 01:26:29] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:29] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:29] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:29] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:29] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:31]
|
|
[AutoResearch] ========== Trial 63/200 ==========
|
|
[2026-04-13 01:26:32] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:26:32] UCB=1.1778 mu=0.4115 sigma=0.3831 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.0045935988112811715}
|
|
[2026-04-13 01:26:32] UCB=1.1220 mu=0.3386 sigma=0.3917 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.004545995779885068}
|
|
[2026-04-13 01:26:32] UCB=1.0835 mu=0.2693 sigma=0.4071 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.004613659590048317}
|
|
[2026-04-13 01:26:32] UCB=0.9919 mu=0.6467 sigma=0.1726 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0019839118149626933}
|
|
[2026-04-13 01:26:32] UCB=0.9792 mu=0.6510 sigma=0.1641 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0010266414531083872}
|
|
[2026-04-13 01:26:32] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.0045935988112811715, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:34] [AutoResearch] Launching job: n_steer=5 n_throttle=4 lr=0.004594
|
|
[2026-04-13 01:26:42] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:26:42] [AutoResearch] mean_reward=53.3764
|
|
[2026-04-13 01:26:42] [AutoResearch] === Trial 63 Summary ===
|
|
[2026-04-13 01:26:42] Total runs in history: 181
|
|
[2026-04-13 01:26:42] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:42] Top 5 results:
|
|
[2026-04-13 01:26:42] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:42] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:42] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:42] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:42] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:44]
|
|
[AutoResearch] ========== Trial 64/200 ==========
|
|
[2026-04-13 01:26:44] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:26:44] UCB=1.1345 mu=0.7985 sigma=0.1680 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009553657070767133}
|
|
[2026-04-13 01:26:44] UCB=0.9968 mu=0.7299 sigma=0.1335 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020088432571567063}
|
|
[2026-04-13 01:26:44] UCB=0.9662 mu=0.6247 sigma=0.1707 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009946233217824852}
|
|
[2026-04-13 01:26:44] UCB=0.9292 mu=0.5679 sigma=0.1806 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003809154060616564}
|
|
[2026-04-13 01:26:44] UCB=0.9175 mu=0.6108 sigma=0.1533 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003917855677501139}
|
|
[2026-04-13 01:26:44] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009553657070767133, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:46] [AutoResearch] Launching job: n_steer=6 n_throttle=3 lr=0.000955
|
|
[2026-04-13 01:26:55] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:26:55] [AutoResearch] mean_reward=86.437
|
|
[2026-04-13 01:26:55] [AutoResearch] === Trial 64 Summary ===
|
|
[2026-04-13 01:26:55] Total runs in history: 182
|
|
[2026-04-13 01:26:55] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:55] Top 5 results:
|
|
[2026-04-13 01:26:55] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:55] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:55] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:55] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:55] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:57]
|
|
[AutoResearch] ========== Trial 65/200 ==========
|
|
[2026-04-13 01:26:57] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:26:57] UCB=1.0664 mu=0.7727 sigma=0.1469 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0012308127186181443}
|
|
[2026-04-13 01:26:57] UCB=1.0380 mu=0.6964 sigma=0.1708 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.001136518218220036}
|
|
[2026-04-13 01:26:57] UCB=1.0342 mu=0.6814 sigma=0.1764 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0042716402868481955}
|
|
[2026-04-13 01:26:57] UCB=1.0127 mu=0.7732 sigma=0.1198 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009405489493717941}
|
|
[2026-04-13 01:26:57] UCB=0.9481 mu=0.6267 sigma=0.1607 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004276310376128396}
|
|
[2026-04-13 01:26:57] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0012308127186181443, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:26:59] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.001231
|
|
[2026-04-13 01:27:08] [AutoResearch] Job finished in 9.2s, returncode=0
|
|
[2026-04-13 01:27:08] [AutoResearch] mean_reward=87.1236
|
|
[2026-04-13 01:27:08] [AutoResearch] === Trial 65 Summary ===
|
|
[2026-04-13 01:27:08] Total runs in history: 183
|
|
[2026-04-13 01:27:08] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:08] Top 5 results:
|
|
[2026-04-13 01:27:08] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:08] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:08] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:08] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:08] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:10]
|
|
[AutoResearch] ========== Trial 66/200 ==========
|
|
[2026-04-13 01:27:10] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:27:10] UCB=1.0769 mu=0.7813 sigma=0.1478 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.00010736862966274818}
|
|
[2026-04-13 01:27:10] UCB=1.0657 mu=0.7009 sigma=0.1824 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004185644811830243}
|
|
[2026-04-13 01:27:10] UCB=0.9409 mu=0.7231 sigma=0.1089 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004951350441450762}
|
|
[2026-04-13 01:27:10] UCB=0.9291 mu=0.7343 sigma=0.0974 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 8.888464331266834e-05}
|
|
[2026-04-13 01:27:10] UCB=0.9274 mu=0.6299 sigma=0.1487 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009324164208905265}
|
|
[2026-04-13 01:27:10] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.00010736862966274818, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:12] [AutoResearch] Launching job: n_steer=7 n_throttle=4 lr=0.000107
|
|
[2026-04-13 01:27:20] [AutoResearch] Job finished in 8.1s, returncode=0
|
|
[2026-04-13 01:27:20] [AutoResearch] mean_reward=44.0804
|
|
[2026-04-13 01:27:20] [AutoResearch] === Trial 66 Summary ===
|
|
[2026-04-13 01:27:20] Total runs in history: 184
|
|
[2026-04-13 01:27:20] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:20] Top 5 results:
|
|
[2026-04-13 01:27:20] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:20] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:20] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:20] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:20] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:22]
|
|
[AutoResearch] ========== Trial 67/200 ==========
|
|
[2026-04-13 01:27:22] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:27:22] UCB=1.1993 mu=0.8865 sigma=0.1564 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0010948049824650245}
|
|
[2026-04-13 01:27:22] UCB=1.1298 mu=0.8436 sigma=0.1431 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001010192769896707}
|
|
[2026-04-13 01:27:22] UCB=1.0212 mu=0.7021 sigma=0.1596 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008454205318987869}
|
|
[2026-04-13 01:27:22] UCB=0.9683 mu=0.6262 sigma=0.1710 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008316509495681354}
|
|
[2026-04-13 01:27:22] UCB=0.9590 mu=0.7296 sigma=0.1147 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012186226317534188}
|
|
[2026-04-13 01:27:22] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0010948049824650245, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:25] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.001095
|
|
[2026-04-13 01:27:33] [AutoResearch] Job finished in 8.0s, returncode=0
|
|
[2026-04-13 01:27:33] [AutoResearch] mean_reward=54.1815
|
|
[2026-04-13 01:27:33] [AutoResearch] === Trial 67 Summary ===
|
|
[2026-04-13 01:27:33] Total runs in history: 185
|
|
[2026-04-13 01:27:33] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:33] Top 5 results:
|
|
[2026-04-13 01:27:33] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:33] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:33] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:33] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:33] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:35]
|
|
[AutoResearch] ========== Trial 68/200 ==========
|
|
[2026-04-13 01:27:35] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:27:35] UCB=1.1373 mu=0.7982 sigma=0.1696 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0010613546107872205}
|
|
[2026-04-13 01:27:35] UCB=1.0593 mu=0.7199 sigma=0.1697 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0007750060520610024}
|
|
[2026-04-13 01:27:35] UCB=1.0480 mu=0.6791 sigma=0.1845 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.00417824991468941}
|
|
[2026-04-13 01:27:35] UCB=0.9816 mu=0.6105 sigma=0.1855 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.00416268786705356}
|
|
[2026-04-13 01:27:35] UCB=0.9712 mu=0.6117 sigma=0.1798 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004118198700172941}
|
|
[2026-04-13 01:27:35] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0010613546107872205, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:37] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.001061
|
|
[2026-04-13 01:27:45] [AutoResearch] Job finished in 8.0s, returncode=0
|
|
[2026-04-13 01:27:45] [AutoResearch] mean_reward=43.9887
|
|
[2026-04-13 01:27:45] [AutoResearch] === Trial 68 Summary ===
|
|
[2026-04-13 01:27:45] Total runs in history: 186
|
|
[2026-04-13 01:27:45] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:45] Top 5 results:
|
|
[2026-04-13 01:27:45] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:45] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:45] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:45] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:45] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:47]
|
|
[AutoResearch] ========== Trial 69/200 ==========
|
|
[2026-04-13 01:27:47] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:27:47] UCB=1.0759 mu=0.7305 sigma=0.1727 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.00412433764093791}
|
|
[2026-04-13 01:27:47] UCB=1.0214 mu=0.8213 sigma=0.1001 params={'n_steer': 6, 'n_throttle': 5, 'learning_rate': 0.0032438545249371033}
|
|
[2026-04-13 01:27:47] UCB=0.9706 mu=0.6702 sigma=0.1502 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004431330336123322}
|
|
[2026-04-13 01:27:47] UCB=0.8676 mu=0.6973 sigma=0.0852 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008264900948862796}
|
|
[2026-04-13 01:27:47] UCB=0.8506 mu=0.5033 sigma=0.1736 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0006607050545501232}
|
|
[2026-04-13 01:27:47] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.00412433764093791, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:49] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.004124
|
|
[2026-04-13 01:27:58] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:27:58] [AutoResearch] mean_reward=82.0013
|
|
[2026-04-13 01:27:58] [AutoResearch] === Trial 69 Summary ===
|
|
[2026-04-13 01:27:58] Total runs in history: 187
|
|
[2026-04-13 01:27:58] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:58] Top 5 results:
|
|
[2026-04-13 01:27:58] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:58] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:58] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:58] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:27:58] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:00]
|
|
[AutoResearch] ========== Trial 70/200 ==========
|
|
[2026-04-13 01:28:00] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:28:00] UCB=1.0991 mu=0.7924 sigma=0.1533 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004404880317934756}
|
|
[2026-04-13 01:28:00] UCB=1.0572 mu=0.8000 sigma=0.1286 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.00452749718796997}
|
|
[2026-04-13 01:28:00] UCB=0.9652 mu=0.7233 sigma=0.1210 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012081333414378833}
|
|
[2026-04-13 01:28:00] UCB=0.9610 mu=0.6153 sigma=0.1728 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.004118079022372628}
|
|
[2026-04-13 01:28:00] UCB=0.9065 mu=0.7317 sigma=0.0874 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0045666097233897154}
|
|
[2026-04-13 01:28:00] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004404880317934756, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:02] [AutoResearch] Launching job: n_steer=3 n_throttle=2 lr=0.004405
|
|
[2026-04-13 01:28:10] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:28:10] [AutoResearch] mean_reward=45.3415
|
|
[2026-04-13 01:28:10] [AutoResearch] === Trial 70 Summary ===
|
|
[2026-04-13 01:28:10] Total runs in history: 188
|
|
[2026-04-13 01:28:10] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:10] Top 5 results:
|
|
[2026-04-13 01:28:10] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:10] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:10] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:10] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:10] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:12]
|
|
[AutoResearch] ========== Trial 71/200 ==========
|
|
[2026-04-13 01:28:12] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:28:12] UCB=1.0001 mu=0.6655 sigma=0.1673 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.001147868046009807}
|
|
[2026-04-13 01:28:12] UCB=0.9797 mu=0.7060 sigma=0.1369 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004137140744767535}
|
|
[2026-04-13 01:28:12] UCB=0.9287 mu=0.5892 sigma=0.1698 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0007026716323244885}
|
|
[2026-04-13 01:28:12] UCB=0.8835 mu=0.5360 sigma=0.1738 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004387787486781375}
|
|
[2026-04-13 01:28:12] UCB=0.8831 mu=0.5418 sigma=0.1707 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0010390905296179618}
|
|
[2026-04-13 01:28:12] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.001147868046009807, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:14] [AutoResearch] Launching job: n_steer=6 n_throttle=3 lr=0.001148
|
|
[2026-04-13 01:28:23] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:28:23] [AutoResearch] mean_reward=54.677
|
|
[2026-04-13 01:28:23] [AutoResearch] === Trial 71 Summary ===
|
|
[2026-04-13 01:28:23] Total runs in history: 189
|
|
[2026-04-13 01:28:23] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:23] Top 5 results:
|
|
[2026-04-13 01:28:23] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:23] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:23] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:23] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:23] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:25]
|
|
[AutoResearch] ========== Trial 72/200 ==========
|
|
[2026-04-13 01:28:25] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:28:25] UCB=1.0872 mu=0.7262 sigma=0.1805 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0038482348857310084}
|
|
[2026-04-13 01:28:25] UCB=0.9497 mu=0.6408 sigma=0.1545 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0018011787808786743}
|
|
[2026-04-13 01:28:25] UCB=0.9152 mu=0.7082 sigma=0.1035 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0024167457893668107}
|
|
[2026-04-13 01:28:25] UCB=0.8816 mu=0.5415 sigma=0.1701 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020786269928949976}
|
|
[2026-04-13 01:28:25] UCB=0.8769 mu=0.6050 sigma=0.1359 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0010307831486864125}
|
|
[2026-04-13 01:28:25] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0038482348857310084, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:27] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.003848
|
|
[2026-04-13 01:28:36] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:28:36] [AutoResearch] mean_reward=71.0598
|
|
[2026-04-13 01:28:36] [AutoResearch] === Trial 72 Summary ===
|
|
[2026-04-13 01:28:36] Total runs in history: 190
|
|
[2026-04-13 01:28:36] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:36] Top 5 results:
|
|
[2026-04-13 01:28:36] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:36] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:36] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:36] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:36] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:38]
|
|
[AutoResearch] ========== Trial 73/200 ==========
|
|
[2026-04-13 01:28:38] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:28:38] UCB=0.9851 mu=0.6927 sigma=0.1462 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.001007259782796827}
|
|
[2026-04-13 01:28:38] UCB=0.9818 mu=0.7182 sigma=0.1318 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008635201406077614}
|
|
[2026-04-13 01:28:38] UCB=0.8865 mu=0.6227 sigma=0.1319 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0037974374493285386}
|
|
[2026-04-13 01:28:38] UCB=0.8837 mu=0.5630 sigma=0.1604 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.004195102732874855}
|
|
[2026-04-13 01:28:38] UCB=0.8718 mu=0.6176 sigma=0.1271 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.00014330466131677553}
|
|
[2026-04-13 01:28:38] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.001007259782796827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:40] [AutoResearch] Launching job: n_steer=6 n_throttle=3 lr=0.001007
|
|
[2026-04-13 01:28:48] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:28:48] [AutoResearch] mean_reward=58.4808
|
|
[2026-04-13 01:28:48] [AutoResearch] === Trial 73 Summary ===
|
|
[2026-04-13 01:28:48] Total runs in history: 191
|
|
[2026-04-13 01:28:48] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:48] Top 5 results:
|
|
[2026-04-13 01:28:48] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:48] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:48] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:48] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:48] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:50]
|
|
[AutoResearch] ========== Trial 74/200 ==========
|
|
[2026-04-13 01:28:50] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:28:50] UCB=1.1168 mu=0.7661 sigma=0.1754 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004011977434863096}
|
|
[2026-04-13 01:28:50] UCB=1.0951 mu=0.7651 sigma=0.1650 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004165631771410099}
|
|
[2026-04-13 01:28:50] UCB=1.0833 mu=0.7483 sigma=0.1675 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0009689480836419862}
|
|
[2026-04-13 01:28:50] UCB=0.9140 mu=0.5586 sigma=0.1777 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0043898239957204455}
|
|
[2026-04-13 01:28:50] UCB=0.9123 mu=0.5686 sigma=0.1719 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0035982368873298393}
|
|
[2026-04-13 01:28:50] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004011977434863096, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:28:52] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.004012
|
|
[2026-04-13 01:29:02] [AutoResearch] Job finished in 9.4s, returncode=0
|
|
[2026-04-13 01:29:02] [AutoResearch] mean_reward=101.2082
|
|
[2026-04-13 01:29:02] [AutoResearch] === Trial 74 Summary ===
|
|
[2026-04-13 01:29:02] Total runs in history: 192
|
|
[2026-04-13 01:29:02] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:02] Top 5 results:
|
|
[2026-04-13 01:29:02] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:02] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:02] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:02] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:02] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:04]
|
|
[AutoResearch] ========== Trial 75/200 ==========
|
|
[2026-04-13 01:29:04] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:29:04] UCB=1.6120 mu=1.2708 sigma=0.1706 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003972001259370569}
|
|
[2026-04-13 01:29:04] UCB=1.5978 mu=1.3560 sigma=0.1209 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004079656353977675}
|
|
[2026-04-13 01:29:04] UCB=1.4538 mu=1.2716 sigma=0.0911 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.00428762187842254}
|
|
[2026-04-13 01:29:04] UCB=1.4519 mu=1.0906 sigma=0.1807 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004095362390819413}
|
|
[2026-04-13 01:29:04] UCB=1.2538 mu=1.0659 sigma=0.0940 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003974743664792731}
|
|
[2026-04-13 01:29:04] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003972001259370569, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:06] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.003972
|
|
[2026-04-13 01:29:15] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:29:15] [AutoResearch] mean_reward=66.8392
|
|
[2026-04-13 01:29:15] [AutoResearch] === Trial 75 Summary ===
|
|
[2026-04-13 01:29:15] Total runs in history: 193
|
|
[2026-04-13 01:29:15] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:15] Top 5 results:
|
|
[2026-04-13 01:29:15] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:15] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:15] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:15] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:15] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:17]
|
|
[AutoResearch] ========== Trial 76/200 ==========
|
|
[2026-04-13 01:29:17] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:29:17] UCB=1.1712 mu=0.8692 sigma=0.1510 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.004009613951890513}
|
|
[2026-04-13 01:29:17] UCB=1.1537 mu=0.7993 sigma=0.1772 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0037888829915617133}
|
|
[2026-04-13 01:29:17] UCB=1.0506 mu=0.7090 sigma=0.1708 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008777232807240161}
|
|
[2026-04-13 01:29:17] UCB=0.9760 mu=0.6396 sigma=0.1682 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009376619186653774}
|
|
[2026-04-13 01:29:17] UCB=0.9398 mu=0.6500 sigma=0.1449 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008531893070653654}
|
|
[2026-04-13 01:29:17] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.004009613951890513, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:19] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.004010
|
|
[2026-04-13 01:29:28] [AutoResearch] Job finished in 9.1s, returncode=0
|
|
[2026-04-13 01:29:28] [AutoResearch] mean_reward=82.72
|
|
[2026-04-13 01:29:28] [AutoResearch] === Trial 76 Summary ===
|
|
[2026-04-13 01:29:28] Total runs in history: 194
|
|
[2026-04-13 01:29:28] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:28] Top 5 results:
|
|
[2026-04-13 01:29:28] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:28] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:28] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:28] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:28] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:30]
|
|
[AutoResearch] ========== Trial 77/200 ==========
|
|
[2026-04-13 01:29:30] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:29:30] UCB=1.3258 mu=0.9806 sigma=0.1726 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004339954912177771}
|
|
[2026-04-13 01:29:30] UCB=1.2543 mu=0.9571 sigma=0.1486 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003960570977725122}
|
|
[2026-04-13 01:29:30] UCB=1.1828 mu=0.8455 sigma=0.1686 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004273161091846668}
|
|
[2026-04-13 01:29:30] UCB=1.1805 mu=0.8658 sigma=0.1573 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004297083601779251}
|
|
[2026-04-13 01:29:30] UCB=1.1284 mu=0.7555 sigma=0.1864 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003862946425614067}
|
|
[2026-04-13 01:29:30] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004339954912177771, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:32] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.004340
|
|
[2026-04-13 01:29:41] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:29:41] [AutoResearch] mean_reward=63.8416
|
|
[2026-04-13 01:29:41] [AutoResearch] === Trial 77 Summary ===
|
|
[2026-04-13 01:29:41] Total runs in history: 195
|
|
[2026-04-13 01:29:41] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:41] Top 5 results:
|
|
[2026-04-13 01:29:41] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:41] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:41] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:41] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:41] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:43]
|
|
[AutoResearch] ========== Trial 78/200 ==========
|
|
[2026-04-13 01:29:43] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:29:43] UCB=1.4734 mu=1.1217 sigma=0.1758 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.004252772508346083}
|
|
[2026-04-13 01:29:43] UCB=1.3442 mu=1.0048 sigma=0.1697 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0038713739745032285}
|
|
[2026-04-13 01:29:43] UCB=1.1263 mu=0.7880 sigma=0.1691 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0036627758165331038}
|
|
[2026-04-13 01:29:43] UCB=0.9071 mu=0.5920 sigma=0.1576 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004312314561176255}
|
|
[2026-04-13 01:29:43] UCB=0.9065 mu=0.5675 sigma=0.1695 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.00379068851735736}
|
|
[2026-04-13 01:29:43] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.004252772508346083, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:45] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.004253
|
|
[2026-04-13 01:29:53] [AutoResearch] Job finished in 8.0s, returncode=0
|
|
[2026-04-13 01:29:53] [AutoResearch] mean_reward=49.275
|
|
[2026-04-13 01:29:53] [AutoResearch] === Trial 78 Summary ===
|
|
[2026-04-13 01:29:53] Total runs in history: 196
|
|
[2026-04-13 01:29:53] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:53] Top 5 results:
|
|
[2026-04-13 01:29:53] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:53] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:53] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:53] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:53] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:55]
|
|
[AutoResearch] ========== Trial 79/200 ==========
|
|
[2026-04-13 01:29:55] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:29:55] UCB=1.1623 mu=0.8101 sigma=0.1761 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0036393349516789324}
|
|
[2026-04-13 01:29:55] UCB=1.1134 mu=0.7553 sigma=0.1790 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004075635656125686}
|
|
[2026-04-13 01:29:55] UCB=1.0870 mu=0.8434 sigma=0.1218 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0037130891520516093}
|
|
[2026-04-13 01:29:55] UCB=1.0162 mu=0.7890 sigma=0.1136 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 9.374131582240222e-05}
|
|
[2026-04-13 01:29:55] UCB=0.9766 mu=0.6421 sigma=0.1673 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008176441355730209}
|
|
[2026-04-13 01:29:55] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0036393349516789324, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:29:57] [AutoResearch] Launching job: n_steer=6 n_throttle=3 lr=0.003639
|
|
[2026-04-13 01:30:06] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:30:06] [AutoResearch] mean_reward=67.5213
|
|
[2026-04-13 01:30:06] [AutoResearch] === Trial 79 Summary ===
|
|
[2026-04-13 01:30:06] Total runs in history: 197
|
|
[2026-04-13 01:30:06] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:06] Top 5 results:
|
|
[2026-04-13 01:30:06] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:06] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:06] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:06] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:06] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:08]
|
|
[AutoResearch] ========== Trial 80/200 ==========
|
|
[2026-04-13 01:30:08] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:30:08] UCB=1.2161 mu=0.9087 sigma=0.1537 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003922957500280374}
|
|
[2026-04-13 01:30:08] UCB=1.1279 mu=0.8082 sigma=0.1599 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.003572699119620912}
|
|
[2026-04-13 01:30:08] UCB=1.1173 mu=0.8721 sigma=0.1226 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003707143125363745}
|
|
[2026-04-13 01:30:08] UCB=1.0357 mu=0.7046 sigma=0.1656 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0010013565725034228}
|
|
[2026-04-13 01:30:08] UCB=1.0231 mu=0.7839 sigma=0.1196 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0021974081125910754}
|
|
[2026-04-13 01:30:08] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003922957500280374, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:10] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.003923
|
|
[2026-04-13 01:30:19] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:30:19] [AutoResearch] mean_reward=87.1741
|
|
[2026-04-13 01:30:19] [AutoResearch] === Trial 80 Summary ===
|
|
[2026-04-13 01:30:19] Total runs in history: 198
|
|
[2026-04-13 01:30:19] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:19] Top 5 results:
|
|
[2026-04-13 01:30:19] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:19] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:19] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:19] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:19] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:21]
|
|
[AutoResearch] ========== Trial 81/200 ==========
|
|
[2026-04-13 01:30:21] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:30:21] UCB=1.5472 mu=1.1902 sigma=0.1785 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0036788253882170108}
|
|
[2026-04-13 01:30:21] UCB=1.3635 mu=1.2098 sigma=0.0768 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0035838655350918535}
|
|
[2026-04-13 01:30:21] UCB=1.1068 mu=0.8035 sigma=0.1516 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.003839178112146758}
|
|
[2026-04-13 01:30:21] UCB=1.0132 mu=0.6666 sigma=0.1733 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0042323010589454535}
|
|
[2026-04-13 01:30:21] UCB=0.9370 mu=0.6582 sigma=0.1394 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.000782603431121359}
|
|
[2026-04-13 01:30:21] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0036788253882170108, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:23] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.003679
|
|
[2026-04-13 01:30:32] [AutoResearch] Job finished in 9.3s, returncode=0
|
|
[2026-04-13 01:30:32] [AutoResearch] mean_reward=81.5059
|
|
[2026-04-13 01:30:32] [AutoResearch] === Trial 81 Summary ===
|
|
[2026-04-13 01:30:32] Total runs in history: 199
|
|
[2026-04-13 01:30:32] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:32] Top 5 results:
|
|
[2026-04-13 01:30:32] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:32] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:32] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:32] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:32] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:34]
|
|
[AutoResearch] ========== Trial 82/200 ==========
|
|
[2026-04-13 01:30:34] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:30:34] UCB=1.4445 mu=1.1036 sigma=0.1705 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003811311986822861}
|
|
[2026-04-13 01:30:34] UCB=1.3730 mu=1.0653 sigma=0.1538 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004095467379598436}
|
|
[2026-04-13 01:30:34] UCB=1.3686 mu=1.0339 sigma=0.1673 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003986054564041316}
|
|
[2026-04-13 01:30:34] UCB=1.1194 mu=0.9388 sigma=0.0903 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0040922000146962475}
|
|
[2026-04-13 01:30:34] UCB=1.1162 mu=0.8437 sigma=0.1362 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 7.044489250831814e-05}
|
|
[2026-04-13 01:30:34] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003811311986822861, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:36] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.003811
|
|
[2026-04-13 01:30:45] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:30:45] [AutoResearch] mean_reward=62.9282
|
|
[2026-04-13 01:30:45] [AutoResearch] === Trial 82 Summary ===
|
|
[2026-04-13 01:30:45] Total runs in history: 200
|
|
[2026-04-13 01:30:45] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:45] Top 5 results:
|
|
[2026-04-13 01:30:45] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:45] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:45] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:45] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:45] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:47]
|
|
[AutoResearch] ========== Trial 83/200 ==========
|
|
[2026-04-13 01:30:47] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:30:47] UCB=0.9857 mu=0.6463 sigma=0.1697 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.00391822554963505}
|
|
[2026-04-13 01:30:47] UCB=0.9467 mu=0.6503 sigma=0.1482 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0011420363764630683}
|
|
[2026-04-13 01:30:47] UCB=0.8924 mu=0.7763 sigma=0.0581 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004284222550119089}
|
|
[2026-04-13 01:30:47] UCB=0.8860 mu=0.5385 sigma=0.1737 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0017367275519765634}
|
|
[2026-04-13 01:30:47] UCB=0.8752 mu=0.5536 sigma=0.1608 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0036061601416834525}
|
|
[2026-04-13 01:30:47] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.00391822554963505, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:49] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003918
|
|
[2026-04-13 01:30:57] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:30:57] [AutoResearch] mean_reward=83.0866
|
|
[2026-04-13 01:30:57] [AutoResearch] === Trial 83 Summary ===
|
|
[2026-04-13 01:30:57] Total runs in history: 201
|
|
[2026-04-13 01:30:57] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:57] Top 5 results:
|
|
[2026-04-13 01:30:57] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:57] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:57] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:57] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:57] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:30:59]
|
|
[AutoResearch] ========== Trial 84/200 ==========
|
|
[2026-04-13 01:30:59] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:30:59] UCB=1.2277 mu=1.0049 sigma=0.1114 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004104239055479664}
|
|
[2026-04-13 01:30:59] UCB=1.1063 mu=0.7657 sigma=0.1703 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0036138187421110154}
|
|
[2026-04-13 01:30:59] UCB=1.0672 mu=0.8405 sigma=0.1133 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004311536172115317}
|
|
[2026-04-13 01:30:59] UCB=0.9756 mu=0.6736 sigma=0.1510 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003459088566274435}
|
|
[2026-04-13 01:30:59] UCB=0.9569 mu=0.7051 sigma=0.1259 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001063579998619323}
|
|
[2026-04-13 01:30:59] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004104239055479664, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:01] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.004104
|
|
[2026-04-13 01:31:10] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:31:10] [AutoResearch] mean_reward=78.7617
|
|
[2026-04-13 01:31:10] [AutoResearch] === Trial 84 Summary ===
|
|
[2026-04-13 01:31:10] Total runs in history: 202
|
|
[2026-04-13 01:31:10] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:10] Top 5 results:
|
|
[2026-04-13 01:31:10] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:10] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:10] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:10] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:10] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:12]
|
|
[AutoResearch] ========== Trial 85/200 ==========
|
|
[2026-04-13 01:31:13] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:31:13] UCB=1.3608 mu=1.0230 sigma=0.1689 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0039008153530137113}
|
|
[2026-04-13 01:31:13] UCB=1.1797 mu=0.8681 sigma=0.1558 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0038490768722124614}
|
|
[2026-04-13 01:31:13] UCB=1.1329 mu=0.8375 sigma=0.1477 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.004010991858376517}
|
|
[2026-04-13 01:31:13] UCB=1.0679 mu=0.7339 sigma=0.1670 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.000972953367993339}
|
|
[2026-04-13 01:31:13] UCB=0.9660 mu=0.6276 sigma=0.1692 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.003477128874709628}
|
|
[2026-04-13 01:31:13] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0039008153530137113, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:15] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.003901
|
|
[2026-04-13 01:31:23] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:31:23] [AutoResearch] mean_reward=69.7956
|
|
[2026-04-13 01:31:23] [AutoResearch] === Trial 85 Summary ===
|
|
[2026-04-13 01:31:23] Total runs in history: 203
|
|
[2026-04-13 01:31:23] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:23] Top 5 results:
|
|
[2026-04-13 01:31:23] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:23] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:23] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:23] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:23] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:25]
|
|
[AutoResearch] ========== Trial 86/200 ==========
|
|
[2026-04-13 01:31:25] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:31:25] UCB=1.2110 mu=0.9260 sigma=0.1425 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004143267208460431}
|
|
[2026-04-13 01:31:25] UCB=1.0567 mu=0.7142 sigma=0.1712 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009220078422145305}
|
|
[2026-04-13 01:31:25] UCB=1.0424 mu=0.7103 sigma=0.1660 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0019402731229971398}
|
|
[2026-04-13 01:31:25] UCB=0.9594 mu=0.6644 sigma=0.1475 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 5.1367391743226425e-05}
|
|
[2026-04-13 01:31:25] UCB=0.8903 mu=0.5751 sigma=0.1576 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.003844874192785262}
|
|
[2026-04-13 01:31:25] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004143267208460431, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:27] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.004143
|
|
[2026-04-13 01:31:35] [AutoResearch] Job finished in 8.0s, returncode=0
|
|
[2026-04-13 01:31:35] [AutoResearch] mean_reward=38.2877
|
|
[2026-04-13 01:31:35] [AutoResearch] === Trial 86 Summary ===
|
|
[2026-04-13 01:31:35] Total runs in history: 204
|
|
[2026-04-13 01:31:35] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:35] Top 5 results:
|
|
[2026-04-13 01:31:35] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:35] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:35] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:35] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:35] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:37]
|
|
[AutoResearch] ========== Trial 87/200 ==========
|
|
[2026-04-13 01:31:37] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:31:37] UCB=1.0492 mu=0.6921 sigma=0.1786 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004158452372162899}
|
|
[2026-04-13 01:31:37] UCB=1.0373 mu=0.7179 sigma=0.1597 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.003578824999326237}
|
|
[2026-04-13 01:31:37] UCB=0.9951 mu=0.6930 sigma=0.1511 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0009297466555471297}
|
|
[2026-04-13 01:31:37] UCB=0.9770 mu=0.8877 sigma=0.0446 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0006137682830675293}
|
|
[2026-04-13 01:31:37] UCB=0.9283 mu=0.5800 sigma=0.1741 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0009220645190166993}
|
|
[2026-04-13 01:31:37] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004158452372162899, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:39] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.004158
|
|
[2026-04-13 01:31:48] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:31:48] [AutoResearch] mean_reward=51.627
|
|
[2026-04-13 01:31:48] [AutoResearch] === Trial 87 Summary ===
|
|
[2026-04-13 01:31:48] Total runs in history: 205
|
|
[2026-04-13 01:31:48] Best so far: mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:48] Top 5 results:
|
|
[2026-04-13 01:31:48] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:48] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:48] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:48] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:48] mean_reward=105.5329 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.921433664380339e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:50]
|
|
[AutoResearch] ========== Trial 88/200 ==========
|
|
[2026-04-13 01:31:50] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:31:50] UCB=1.0483 mu=0.7043 sigma=0.1720 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263}
|
|
[2026-04-13 01:31:50] UCB=0.9386 mu=0.5984 sigma=0.1701 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.003605205643479587}
|
|
[2026-04-13 01:31:50] UCB=0.9206 mu=0.6152 sigma=0.1527 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0013539755934280674}
|
|
[2026-04-13 01:31:50] UCB=0.8733 mu=0.5798 sigma=0.1468 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0020032834698165665}
|
|
[2026-04-13 01:31:50] UCB=0.8475 mu=0.5777 sigma=0.1349 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035215707422148724}
|
|
[2026-04-13 01:31:50] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:31:52] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.001997
|
|
[2026-04-13 01:32:02] [AutoResearch] Job finished in 9.8s, returncode=0
|
|
[2026-04-13 01:32:02] [AutoResearch] mean_reward=125.5734
|
|
[2026-04-13 01:32:02] [AutoResearch] === Trial 88 Summary ===
|
|
[2026-04-13 01:32:02] Total runs in history: 206
|
|
[2026-04-13 01:32:02] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:02] Top 5 results:
|
|
[2026-04-13 01:32:02] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:02] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:02] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:02] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:02] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:04]
|
|
[AutoResearch] ========== Trial 89/200 ==========
|
|
[2026-04-13 01:32:04] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:32:04] UCB=1.3271 mu=1.0061 sigma=0.1605 params={'n_steer': 5, 'n_throttle': 5, 'learning_rate': 0.0038192905517542876}
|
|
[2026-04-13 01:32:04] UCB=1.2172 mu=0.8770 sigma=0.1701 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0020785386856975734}
|
|
[2026-04-13 01:32:04] UCB=1.1478 mu=0.8070 sigma=0.1704 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003843126200374683}
|
|
[2026-04-13 01:32:04] UCB=1.1188 mu=0.8799 sigma=0.1195 params={'n_steer': 5, 'n_throttle': 5, 'learning_rate': 0.0034342476950443004}
|
|
[2026-04-13 01:32:04] UCB=1.0049 mu=0.6825 sigma=0.1612 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0037782080044796136}
|
|
[2026-04-13 01:32:04] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 5, 'learning_rate': 0.0038192905517542876, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:06] [AutoResearch] Launching job: n_steer=5 n_throttle=5 lr=0.003819
|
|
[2026-04-13 01:32:14] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:32:14] [AutoResearch] mean_reward=50.5648
|
|
[2026-04-13 01:32:14] [AutoResearch] === Trial 89 Summary ===
|
|
[2026-04-13 01:32:14] Total runs in history: 207
|
|
[2026-04-13 01:32:14] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:14] Top 5 results:
|
|
[2026-04-13 01:32:14] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:14] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:14] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:14] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:14] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:16]
|
|
[AutoResearch] ========== Trial 90/200 ==========
|
|
[2026-04-13 01:32:16] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:32:16] UCB=1.3031 mu=0.9894 sigma=0.1568 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001937220906338228}
|
|
[2026-04-13 01:32:16] UCB=1.2778 mu=0.9405 sigma=0.1687 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0019240959313003964}
|
|
[2026-04-13 01:32:16] UCB=1.2752 mu=0.9822 sigma=0.1465 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018249984872385687}
|
|
[2026-04-13 01:32:16] UCB=1.1406 mu=0.8968 sigma=0.1219 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018786705130629385}
|
|
[2026-04-13 01:32:16] UCB=1.0762 mu=0.7342 sigma=0.1710 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004017428588520549}
|
|
[2026-04-13 01:32:16] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001937220906338228, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:18] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.001937
|
|
[2026-04-13 01:32:27] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:32:27] [AutoResearch] mean_reward=56.0756
|
|
[2026-04-13 01:32:27] [AutoResearch] === Trial 90 Summary ===
|
|
[2026-04-13 01:32:27] Total runs in history: 208
|
|
[2026-04-13 01:32:27] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:27] Top 5 results:
|
|
[2026-04-13 01:32:27] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:27] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:27] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:27] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:27] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:29]
|
|
[AutoResearch] ========== Trial 91/200 ==========
|
|
[2026-04-13 01:32:29] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:32:29] UCB=1.0880 mu=0.7334 sigma=0.1773 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0035388036223637}
|
|
[2026-04-13 01:32:29] UCB=1.0574 mu=0.7000 sigma=0.1787 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0036052641284817395}
|
|
[2026-04-13 01:32:29] UCB=1.0231 mu=0.7223 sigma=0.1504 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004049050869987494}
|
|
[2026-04-13 01:32:29] UCB=0.9950 mu=0.6718 sigma=0.1616 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0007662501192556161}
|
|
[2026-04-13 01:32:29] UCB=0.9906 mu=0.7836 sigma=0.1035 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0034676243058722454}
|
|
[2026-04-13 01:32:29] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0035388036223637, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:31] [AutoResearch] Launching job: n_steer=6 n_throttle=2 lr=0.003539
|
|
[2026-04-13 01:32:39] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:32:39] [AutoResearch] mean_reward=55.5992
|
|
[2026-04-13 01:32:39] [AutoResearch] === Trial 91 Summary ===
|
|
[2026-04-13 01:32:39] Total runs in history: 209
|
|
[2026-04-13 01:32:39] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:39] Top 5 results:
|
|
[2026-04-13 01:32:39] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:39] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:39] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:39] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:39] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:41]
|
|
[AutoResearch] ========== Trial 92/200 ==========
|
|
[2026-04-13 01:32:41] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:32:41] UCB=1.0952 mu=0.8345 sigma=0.1303 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0021006038056172894}
|
|
[2026-04-13 01:32:41] UCB=1.0037 mu=0.6553 sigma=0.1742 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0011267897713360129}
|
|
[2026-04-13 01:32:41] UCB=0.9389 mu=0.5987 sigma=0.1701 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0036621868181581276}
|
|
[2026-04-13 01:32:41] UCB=0.8559 mu=0.5134 sigma=0.1713 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001139441929295187}
|
|
[2026-04-13 01:32:41] UCB=0.8550 mu=0.6077 sigma=0.1237 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012346980512615785}
|
|
[2026-04-13 01:32:41] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0021006038056172894, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:43] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.002101
|
|
[2026-04-13 01:32:52] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:32:52] [AutoResearch] mean_reward=56.4164
|
|
[2026-04-13 01:32:52] [AutoResearch] === Trial 92 Summary ===
|
|
[2026-04-13 01:32:52] Total runs in history: 210
|
|
[2026-04-13 01:32:52] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:52] Top 5 results:
|
|
[2026-04-13 01:32:52] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:52] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:52] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:52] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:52] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:54]
|
|
[AutoResearch] ========== Trial 93/200 ==========
|
|
[2026-04-13 01:32:54] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:32:54] UCB=1.1803 mu=0.8168 sigma=0.1817 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003913075653306757}
|
|
[2026-04-13 01:32:54] UCB=1.0091 mu=0.7009 sigma=0.1541 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0019208000471361447}
|
|
[2026-04-13 01:32:54] UCB=0.9789 mu=0.6928 sigma=0.1430 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0009876550088847547}
|
|
[2026-04-13 01:32:54] UCB=0.9767 mu=0.6890 sigma=0.1439 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.002046253951646962}
|
|
[2026-04-13 01:32:54] UCB=0.9208 mu=0.7302 sigma=0.0953 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003743307618888262}
|
|
[2026-04-13 01:32:54] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003913075653306757, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:32:56] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003913
|
|
[2026-04-13 01:33:05] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:33:05] [AutoResearch] mean_reward=86.2403
|
|
[2026-04-13 01:33:05] [AutoResearch] === Trial 93 Summary ===
|
|
[2026-04-13 01:33:05] Total runs in history: 211
|
|
[2026-04-13 01:33:05] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:05] Top 5 results:
|
|
[2026-04-13 01:33:05] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:05] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:05] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:05] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:05] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:07]
|
|
[AutoResearch] ========== Trial 94/200 ==========
|
|
[2026-04-13 01:33:07] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:33:07] UCB=1.2982 mu=1.0135 sigma=0.1424 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0037663797783529444}
|
|
[2026-04-13 01:33:07] UCB=1.2127 mu=0.9335 sigma=0.1396 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0041935472531786635}
|
|
[2026-04-13 01:33:07] UCB=1.1805 mu=0.8442 sigma=0.1682 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003757901488391255}
|
|
[2026-04-13 01:33:07] UCB=1.0116 mu=0.7162 sigma=0.1477 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004925880851025704}
|
|
[2026-04-13 01:33:07] UCB=0.9581 mu=0.6300 sigma=0.1641 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003566574200062338}
|
|
[2026-04-13 01:33:07] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0037663797783529444, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:09] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003766
|
|
[2026-04-13 01:33:18] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:33:18] [AutoResearch] mean_reward=70.4393
|
|
[2026-04-13 01:33:18] [AutoResearch] === Trial 94 Summary ===
|
|
[2026-04-13 01:33:18] Total runs in history: 212
|
|
[2026-04-13 01:33:18] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:18] Top 5 results:
|
|
[2026-04-13 01:33:18] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:18] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:18] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:18] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:18] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:20]
|
|
[AutoResearch] ========== Trial 95/200 ==========
|
|
[2026-04-13 01:33:20] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:33:20] UCB=0.9955 mu=0.7169 sigma=0.1393 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0019205885219095316}
|
|
[2026-04-13 01:33:20] UCB=0.9616 mu=0.6203 sigma=0.1706 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0009699595076067713}
|
|
[2026-04-13 01:33:20] UCB=0.9544 mu=0.6757 sigma=0.1393 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003494510846448695}
|
|
[2026-04-13 01:33:20] UCB=0.8929 mu=0.6195 sigma=0.1367 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009718856828587633}
|
|
[2026-04-13 01:33:20] UCB=0.8924 mu=0.6615 sigma=0.1154 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0018656269121865008}
|
|
[2026-04-13 01:33:20] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0019205885219095316, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:22] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.001921
|
|
[2026-04-13 01:33:31] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:33:31] [AutoResearch] mean_reward=88.7377
|
|
[2026-04-13 01:33:31] [AutoResearch] === Trial 95 Summary ===
|
|
[2026-04-13 01:33:31] Total runs in history: 213
|
|
[2026-04-13 01:33:31] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:31] Top 5 results:
|
|
[2026-04-13 01:33:31] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:31] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:31] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:31] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:31] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:33]
|
|
[AutoResearch] ========== Trial 96/200 ==========
|
|
[2026-04-13 01:33:33] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:33:33] UCB=1.3685 mu=1.0068 sigma=0.1809 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004011941692652049}
|
|
[2026-04-13 01:33:33] UCB=1.2741 mu=0.9370 sigma=0.1685 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004112883849984757}
|
|
[2026-04-13 01:33:33] UCB=1.1645 mu=0.8448 sigma=0.1598 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0036536467318617994}
|
|
[2026-04-13 01:33:33] UCB=1.0295 mu=0.7012 sigma=0.1641 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004005670126819884}
|
|
[2026-04-13 01:33:33] UCB=0.9116 mu=0.6398 sigma=0.1359 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.004998754001702071}
|
|
[2026-04-13 01:33:33] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004011941692652049, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:35] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.004012
|
|
[2026-04-13 01:33:44] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:33:44] [AutoResearch] mean_reward=52.9524
|
|
[2026-04-13 01:33:44] [AutoResearch] === Trial 96 Summary ===
|
|
[2026-04-13 01:33:44] Total runs in history: 214
|
|
[2026-04-13 01:33:44] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:44] Top 5 results:
|
|
[2026-04-13 01:33:44] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:44] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:44] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:44] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:44] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:46]
|
|
[AutoResearch] ========== Trial 97/200 ==========
|
|
[2026-04-13 01:33:46] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:33:46] UCB=1.3539 mu=1.0148 sigma=0.1696 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003986380872957056}
|
|
[2026-04-13 01:33:46] UCB=1.0353 mu=0.6775 sigma=0.1789 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0035734604736457087}
|
|
[2026-04-13 01:33:46] UCB=0.9902 mu=0.6526 sigma=0.1688 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0016360291705086484}
|
|
[2026-04-13 01:33:46] UCB=0.9612 mu=0.6183 sigma=0.1715 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0009632125010529058}
|
|
[2026-04-13 01:33:46] UCB=0.9601 mu=0.6974 sigma=0.1313 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010407362424779201}
|
|
[2026-04-13 01:33:46] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003986380872957056, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:48] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003986
|
|
[2026-04-13 01:33:57] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:33:57] [AutoResearch] mean_reward=82.325
|
|
[2026-04-13 01:33:57] [AutoResearch] === Trial 97 Summary ===
|
|
[2026-04-13 01:33:57] Total runs in history: 215
|
|
[2026-04-13 01:33:57] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:57] Top 5 results:
|
|
[2026-04-13 01:33:57] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:57] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:57] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:57] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:57] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:33:59]
|
|
[AutoResearch] ========== Trial 98/200 ==========
|
|
[2026-04-13 01:33:59] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:33:59] UCB=1.2818 mu=0.9071 sigma=0.1874 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003726023318062714}
|
|
[2026-04-13 01:33:59] UCB=1.2358 mu=0.9661 sigma=0.1349 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0037834273062180295}
|
|
[2026-04-13 01:33:59] UCB=1.0678 mu=0.7925 sigma=0.1376 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004190654310114177}
|
|
[2026-04-13 01:33:59] UCB=0.9271 mu=0.7123 sigma=0.1074 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.00361087157693879}
|
|
[2026-04-13 01:33:59] UCB=0.9080 mu=0.5768 sigma=0.1656 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0009466366377835845}
|
|
[2026-04-13 01:33:59] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003726023318062714, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:01] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003726
|
|
[2026-04-13 01:34:10] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:34:10] [AutoResearch] mean_reward=64.1272
|
|
[2026-04-13 01:34:10] [AutoResearch] === Trial 98 Summary ===
|
|
[2026-04-13 01:34:10] Total runs in history: 216
|
|
[2026-04-13 01:34:10] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:10] Top 5 results:
|
|
[2026-04-13 01:34:10] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:10] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:10] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:10] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:10] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:12]
|
|
[AutoResearch] ========== Trial 99/200 ==========
|
|
[2026-04-13 01:34:12] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:34:12] UCB=1.0307 mu=0.7080 sigma=0.1613 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0022840543886122485}
|
|
[2026-04-13 01:34:12] UCB=0.9082 mu=0.5649 sigma=0.1717 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0036758625006716285}
|
|
[2026-04-13 01:34:12] UCB=0.9070 mu=0.7130 sigma=0.0970 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0022178442474185705}
|
|
[2026-04-13 01:34:12] UCB=0.8774 mu=0.5497 sigma=0.1639 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003911849912546082}
|
|
[2026-04-13 01:34:12] UCB=0.8602 mu=0.5951 sigma=0.1326 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0006331478905184166}
|
|
[2026-04-13 01:34:12] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0022840543886122485, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:14] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.002284
|
|
[2026-04-13 01:34:22] [AutoResearch] Job finished in 7.8s, returncode=0
|
|
[2026-04-13 01:34:22] [AutoResearch] mean_reward=39.9343
|
|
[2026-04-13 01:34:22] [AutoResearch] === Trial 99 Summary ===
|
|
[2026-04-13 01:34:22] Total runs in history: 217
|
|
[2026-04-13 01:34:22] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:22] Top 5 results:
|
|
[2026-04-13 01:34:22] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:22] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:22] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:22] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:22] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:24]
|
|
[AutoResearch] ========== Trial 100/200 ==========
|
|
[2026-04-13 01:34:24] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:34:24] UCB=1.2107 mu=0.8559 sigma=0.1774 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003940169241721946}
|
|
[2026-04-13 01:34:24] UCB=1.0329 mu=0.6728 sigma=0.1800 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0036569303574516938}
|
|
[2026-04-13 01:34:24] UCB=1.0178 mu=0.6768 sigma=0.1705 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0022178146002845146}
|
|
[2026-04-13 01:34:24] UCB=0.9900 mu=0.6519 sigma=0.1691 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008866125206447304}
|
|
[2026-04-13 01:34:24] UCB=0.9780 mu=0.6688 sigma=0.1546 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0036422541932803797}
|
|
[2026-04-13 01:34:24] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003940169241721946, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:26] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003940
|
|
[2026-04-13 01:34:35] [AutoResearch] Job finished in 9.1s, returncode=0
|
|
[2026-04-13 01:34:35] [AutoResearch] mean_reward=73.4887
|
|
[2026-04-13 01:34:35] [AutoResearch] === Trial 100 Summary ===
|
|
[2026-04-13 01:34:35] Total runs in history: 218
|
|
[2026-04-13 01:34:35] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:35] Top 5 results:
|
|
[2026-04-13 01:34:35] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:35] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:35] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:35] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:35] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:37]
|
|
[AutoResearch] ========== Trial 101/200 ==========
|
|
[2026-04-13 01:34:37] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:34:37] UCB=1.2571 mu=0.8960 sigma=0.1806 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0037399306768376786}
|
|
[2026-04-13 01:34:37] UCB=1.2295 mu=0.8967 sigma=0.1664 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004021507420741623}
|
|
[2026-04-13 01:34:37] UCB=1.2042 mu=0.9877 sigma=0.1082 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003896119061894893}
|
|
[2026-04-13 01:34:37] UCB=1.0238 mu=0.7473 sigma=0.1383 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003687229564907118}
|
|
[2026-04-13 01:34:37] UCB=0.8948 mu=0.6522 sigma=0.1213 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004346205034164949}
|
|
[2026-04-13 01:34:37] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0037399306768376786, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:39] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003740
|
|
[2026-04-13 01:34:48] [AutoResearch] Job finished in 9.1s, returncode=0
|
|
[2026-04-13 01:34:48] [AutoResearch] mean_reward=93.2171
|
|
[2026-04-13 01:34:48] [AutoResearch] === Trial 101 Summary ===
|
|
[2026-04-13 01:34:48] Total runs in history: 219
|
|
[2026-04-13 01:34:48] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:48] Top 5 results:
|
|
[2026-04-13 01:34:48] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:48] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:48] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:48] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:48] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:50]
|
|
[AutoResearch] ========== Trial 102/200 ==========
|
|
[2026-04-13 01:34:50] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:34:50] UCB=1.3475 mu=0.9851 sigma=0.1812 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003800360371605525}
|
|
[2026-04-13 01:34:50] UCB=1.1850 mu=0.9969 sigma=0.0941 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003939882128491643}
|
|
[2026-04-13 01:34:50] UCB=1.1534 mu=0.8078 sigma=0.1728 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003581088296934811}
|
|
[2026-04-13 01:34:50] UCB=1.1530 mu=0.7714 sigma=0.1908 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003537190243538867}
|
|
[2026-04-13 01:34:50] UCB=1.1026 mu=0.7714 sigma=0.1656 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002046686920484412}
|
|
[2026-04-13 01:34:50] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003800360371605525, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:34:52] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.003800
|
|
[2026-04-13 01:35:01] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:35:01] [AutoResearch] mean_reward=74.1562
|
|
[2026-04-13 01:35:01] [AutoResearch] === Trial 102 Summary ===
|
|
[2026-04-13 01:35:01] Total runs in history: 220
|
|
[2026-04-13 01:35:01] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:01] Top 5 results:
|
|
[2026-04-13 01:35:01] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:01] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:01] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:01] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:01] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:03]
|
|
[AutoResearch] ========== Trial 103/200 ==========
|
|
[2026-04-13 01:35:03] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:35:03] UCB=1.1866 mu=0.8357 sigma=0.1755 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003993398991463532}
|
|
[2026-04-13 01:35:03] UCB=1.0361 mu=0.7175 sigma=0.1593 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0016750599480612927}
|
|
[2026-04-13 01:35:03] UCB=1.0172 mu=0.6515 sigma=0.1828 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0034752497507985723}
|
|
[2026-04-13 01:35:03] UCB=0.9603 mu=0.6455 sigma=0.1574 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0017089455139068185}
|
|
[2026-04-13 01:35:03] UCB=0.9556 mu=0.6729 sigma=0.1413 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0035391164238501175}
|
|
[2026-04-13 01:35:03] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003993398991463532, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:05] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003993
|
|
[2026-04-13 01:35:15] [AutoResearch] Job finished in 9.4s, returncode=0
|
|
[2026-04-13 01:35:15] [AutoResearch] mean_reward=85.395
|
|
[2026-04-13 01:35:15] [AutoResearch] === Trial 103 Summary ===
|
|
[2026-04-13 01:35:15] Total runs in history: 221
|
|
[2026-04-13 01:35:15] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:15] Top 5 results:
|
|
[2026-04-13 01:35:15] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:15] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:15] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:15] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:15] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:17]
|
|
[AutoResearch] ========== Trial 104/200 ==========
|
|
[2026-04-13 01:35:17] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:35:17] UCB=1.0013 mu=0.7885 sigma=0.1064 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0018744928761491848}
|
|
[2026-04-13 01:35:17] UCB=0.9827 mu=0.8601 sigma=0.0613 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004972069545023164}
|
|
[2026-04-13 01:35:17] UCB=0.9496 mu=0.5595 sigma=0.1950 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0035229685691469128}
|
|
[2026-04-13 01:35:17] UCB=0.8573 mu=0.5158 sigma=0.1707 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.00227478563998339}
|
|
[2026-04-13 01:35:17] UCB=0.8317 mu=0.4943 sigma=0.1687 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003974829277487949}
|
|
[2026-04-13 01:35:17] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0018744928761491848, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:19] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.001874
|
|
[2026-04-13 01:35:28] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:35:28] [AutoResearch] mean_reward=80.8291
|
|
[2026-04-13 01:35:28] [AutoResearch] === Trial 104 Summary ===
|
|
[2026-04-13 01:35:28] Total runs in history: 222
|
|
[2026-04-13 01:35:28] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:28] Top 5 results:
|
|
[2026-04-13 01:35:28] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:28] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:28] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:28] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:28] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:30]
|
|
[AutoResearch] ========== Trial 105/200 ==========
|
|
[2026-04-13 01:35:30] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:35:30] UCB=1.3532 mu=0.9973 sigma=0.1780 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003788515151195257}
|
|
[2026-04-13 01:35:30] UCB=1.1590 mu=0.9283 sigma=0.1154 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0036643427056572485}
|
|
[2026-04-13 01:35:30] UCB=0.9865 mu=0.6399 sigma=0.1733 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0010878749046863397}
|
|
[2026-04-13 01:35:30] UCB=0.9811 mu=0.7520 sigma=0.1146 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003946593368798988}
|
|
[2026-04-13 01:35:30] UCB=0.9333 mu=0.7129 sigma=0.1102 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0037576530476802397}
|
|
[2026-04-13 01:35:30] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003788515151195257, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:32] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003789
|
|
[2026-04-13 01:35:41] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:35:41] [AutoResearch] mean_reward=79.304
|
|
[2026-04-13 01:35:41] [AutoResearch] === Trial 105 Summary ===
|
|
[2026-04-13 01:35:41] Total runs in history: 223
|
|
[2026-04-13 01:35:41] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:41] Top 5 results:
|
|
[2026-04-13 01:35:41] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:41] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:41] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:41] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:41] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:43]
|
|
[AutoResearch] ========== Trial 106/200 ==========
|
|
[2026-04-13 01:35:43] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:35:43] UCB=1.2707 mu=1.0313 sigma=0.1197 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003987200299359022}
|
|
[2026-04-13 01:35:43] UCB=1.1875 mu=0.8414 sigma=0.1731 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003966296752847896}
|
|
[2026-04-13 01:35:43] UCB=1.1371 mu=0.8629 sigma=0.1371 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0037470161062659276}
|
|
[2026-04-13 01:35:43] UCB=1.0331 mu=0.6747 sigma=0.1792 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0035930313965585503}
|
|
[2026-04-13 01:35:43] UCB=1.0206 mu=0.9477 sigma=0.0365 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0039652781376341055}
|
|
[2026-04-13 01:35:43] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003987200299359022, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:45] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003987
|
|
[2026-04-13 01:35:54] [AutoResearch] Job finished in 9.3s, returncode=0
|
|
[2026-04-13 01:35:54] [AutoResearch] mean_reward=102.7403
|
|
[2026-04-13 01:35:54] [AutoResearch] === Trial 106 Summary ===
|
|
[2026-04-13 01:35:54] Total runs in history: 224
|
|
[2026-04-13 01:35:54] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:54] Top 5 results:
|
|
[2026-04-13 01:35:54] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:54] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:54] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:54] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:54] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:56]
|
|
[AutoResearch] ========== Trial 107/200 ==========
|
|
[2026-04-13 01:35:56] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:35:56] UCB=1.3444 mu=0.9848 sigma=0.1798 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003775582400866722}
|
|
[2026-04-13 01:35:56] UCB=1.1569 mu=0.8078 sigma=0.1746 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0038922263735888135}
|
|
[2026-04-13 01:35:56] UCB=1.1089 mu=0.7259 sigma=0.1915 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0035582623652896505}
|
|
[2026-04-13 01:35:56] UCB=1.0771 mu=0.9958 sigma=0.0406 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004016353777846578}
|
|
[2026-04-13 01:35:56] UCB=1.0171 mu=0.8469 sigma=0.0851 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003962041404890936}
|
|
[2026-04-13 01:35:56] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003775582400866722, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:35:58] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003776
|
|
[2026-04-13 01:36:07] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:36:07] [AutoResearch] mean_reward=55.5832
|
|
[2026-04-13 01:36:07] [AutoResearch] === Trial 107 Summary ===
|
|
[2026-04-13 01:36:07] Total runs in history: 225
|
|
[2026-04-13 01:36:07] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:07] Top 5 results:
|
|
[2026-04-13 01:36:07] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:07] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:07] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:07] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:07] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:09]
|
|
[AutoResearch] ========== Trial 108/200 ==========
|
|
[2026-04-13 01:36:09] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:36:09] UCB=1.5226 mu=1.1814 sigma=0.1706 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003927615678894278}
|
|
[2026-04-13 01:36:09] UCB=0.9741 mu=0.6266 sigma=0.1737 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0016442213117861368}
|
|
[2026-04-13 01:36:09] UCB=0.9454 mu=0.6310 sigma=0.1572 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0017025025429263073}
|
|
[2026-04-13 01:36:09] UCB=0.9310 mu=0.5692 sigma=0.1809 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004212447362131388}
|
|
[2026-04-13 01:36:09] UCB=0.9252 mu=0.5795 sigma=0.1728 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002282489033939961}
|
|
[2026-04-13 01:36:09] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003927615678894278, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:11] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003928
|
|
[2026-04-13 01:36:20] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:36:20] [AutoResearch] mean_reward=64.6677
|
|
[2026-04-13 01:36:20] [AutoResearch] === Trial 108 Summary ===
|
|
[2026-04-13 01:36:20] Total runs in history: 226
|
|
[2026-04-13 01:36:20] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:20] Top 5 results:
|
|
[2026-04-13 01:36:20] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:20] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:20] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:20] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:20] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:22]
|
|
[AutoResearch] ========== Trial 109/200 ==========
|
|
[2026-04-13 01:36:22] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:36:22] UCB=1.1106 mu=0.7951 sigma=0.1578 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018965129732777149}
|
|
[2026-04-13 01:36:22] UCB=1.0032 mu=0.7558 sigma=0.1237 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0017094709361207478}
|
|
[2026-04-13 01:36:22] UCB=0.9255 mu=0.6034 sigma=0.1610 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.00079123438695375}
|
|
[2026-04-13 01:36:22] UCB=0.9111 mu=0.6052 sigma=0.1530 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004914984605068698}
|
|
[2026-04-13 01:36:22] UCB=0.8857 mu=0.6268 sigma=0.1294 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003612036890729518}
|
|
[2026-04-13 01:36:22] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018965129732777149, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:24] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.001897
|
|
[2026-04-13 01:36:33] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:36:33] [AutoResearch] mean_reward=75.632
|
|
[2026-04-13 01:36:33] [AutoResearch] === Trial 109 Summary ===
|
|
[2026-04-13 01:36:33] Total runs in history: 227
|
|
[2026-04-13 01:36:33] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:33] Top 5 results:
|
|
[2026-04-13 01:36:33] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:33] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:33] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:33] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:33] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:35]
|
|
[AutoResearch] ========== Trial 110/200 ==========
|
|
[2026-04-13 01:36:35] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:36:35] UCB=1.0489 mu=0.7775 sigma=0.1357 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0036705305975700507}
|
|
[2026-04-13 01:36:35] UCB=1.0166 mu=0.6831 sigma=0.1667 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.00399345019892377}
|
|
[2026-04-13 01:36:35] UCB=1.0080 mu=0.6650 sigma=0.1715 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018578382232881552}
|
|
[2026-04-13 01:36:35] UCB=0.9894 mu=0.6810 sigma=0.1542 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010172280987666084}
|
|
[2026-04-13 01:36:35] UCB=0.9791 mu=0.5967 sigma=0.1912 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0035502199633150825}
|
|
[2026-04-13 01:36:35] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0036705305975700507, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:37] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.003671
|
|
[2026-04-13 01:36:46] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:36:46] [AutoResearch] mean_reward=54.5385
|
|
[2026-04-13 01:36:46] [AutoResearch] === Trial 110 Summary ===
|
|
[2026-04-13 01:36:46] Total runs in history: 228
|
|
[2026-04-13 01:36:46] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:46] Top 5 results:
|
|
[2026-04-13 01:36:46] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:46] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:46] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:46] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:46] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:48]
|
|
[AutoResearch] ========== Trial 111/200 ==========
|
|
[2026-04-13 01:36:48] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:36:48] UCB=0.9373 mu=0.5867 sigma=0.1753 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.004078540687137955}
|
|
[2026-04-13 01:36:48] UCB=0.9349 mu=0.5816 sigma=0.1766 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004185384730013951}
|
|
[2026-04-13 01:36:48] UCB=0.8698 mu=0.6004 sigma=0.1347 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035331180083151966}
|
|
[2026-04-13 01:36:48] UCB=0.8520 mu=0.6989 sigma=0.0765 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0049874274999701365}
|
|
[2026-04-13 01:36:48] UCB=0.8201 mu=0.7413 sigma=0.0394 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012103567753799101}
|
|
[2026-04-13 01:36:48] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.004078540687137955, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:50] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.004079
|
|
[2026-04-13 01:36:58] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:36:58] [AutoResearch] mean_reward=43.089
|
|
[2026-04-13 01:36:58] [AutoResearch] === Trial 111 Summary ===
|
|
[2026-04-13 01:36:58] Total runs in history: 229
|
|
[2026-04-13 01:36:58] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:58] Top 5 results:
|
|
[2026-04-13 01:36:58] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:58] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:58] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:58] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:36:58] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:00]
|
|
[AutoResearch] ========== Trial 112/200 ==========
|
|
[2026-04-13 01:37:00] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:37:00] UCB=1.1069 mu=0.9716 sigma=0.0677 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004155348505076391}
|
|
[2026-04-13 01:37:00] UCB=1.0274 mu=0.6905 sigma=0.1685 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.001960495892245515}
|
|
[2026-04-13 01:37:00] UCB=1.0241 mu=0.7317 sigma=0.1462 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020537902034990833}
|
|
[2026-04-13 01:37:00] UCB=0.9891 mu=0.6876 sigma=0.1508 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0036495021345474554}
|
|
[2026-04-13 01:37:00] UCB=0.9253 mu=0.6385 sigma=0.1434 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0009615955851657893}
|
|
[2026-04-13 01:37:00] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004155348505076391, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:02] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.004155
|
|
[2026-04-13 01:37:11] [AutoResearch] Job finished in 9.1s, returncode=0
|
|
[2026-04-13 01:37:11] [AutoResearch] mean_reward=82.5112
|
|
[2026-04-13 01:37:11] [AutoResearch] === Trial 112 Summary ===
|
|
[2026-04-13 01:37:11] Total runs in history: 230
|
|
[2026-04-13 01:37:11] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:11] Top 5 results:
|
|
[2026-04-13 01:37:11] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:11] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:11] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:11] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:11] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:13]
|
|
[AutoResearch] ========== Trial 113/200 ==========
|
|
[2026-04-13 01:37:13] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:37:13] UCB=1.2423 mu=0.8843 sigma=0.1790 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0037384159203696418}
|
|
[2026-04-13 01:37:13] UCB=1.0087 mu=0.6613 sigma=0.1737 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010453486388160205}
|
|
[2026-04-13 01:37:13] UCB=0.9567 mu=0.6589 sigma=0.1489 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009146666382257065}
|
|
[2026-04-13 01:37:13] UCB=0.9269 mu=0.5800 sigma=0.1734 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0017087300744263918}
|
|
[2026-04-13 01:37:13] UCB=0.9104 mu=0.5818 sigma=0.1643 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0016102722309113645}
|
|
[2026-04-13 01:37:13] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0037384159203696418, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:15] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003738
|
|
[2026-04-13 01:37:23] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:37:23] [AutoResearch] mean_reward=50.4147
|
|
[2026-04-13 01:37:23] [AutoResearch] === Trial 113 Summary ===
|
|
[2026-04-13 01:37:23] Total runs in history: 231
|
|
[2026-04-13 01:37:23] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:23] Top 5 results:
|
|
[2026-04-13 01:37:23] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:23] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:23] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:23] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:23] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:25]
|
|
[AutoResearch] ========== Trial 114/200 ==========
|
|
[2026-04-13 01:37:26] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:37:26] UCB=0.9679 mu=0.6539 sigma=0.1570 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004386658229439259}
|
|
[2026-04-13 01:37:26] UCB=0.9408 mu=0.6412 sigma=0.1498 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008890733256533982}
|
|
[2026-04-13 01:37:26] UCB=0.9279 mu=0.5886 sigma=0.1697 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037824753763798457}
|
|
[2026-04-13 01:37:26] UCB=0.9012 mu=0.8180 sigma=0.0416 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004919359939863535}
|
|
[2026-04-13 01:37:26] UCB=0.8685 mu=0.5212 sigma=0.1736 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0006762744817343475}
|
|
[2026-04-13 01:37:26] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004386658229439259, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:28] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.004387
|
|
[2026-04-13 01:37:36] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:37:36] [AutoResearch] mean_reward=59.8822
|
|
[2026-04-13 01:37:36] [AutoResearch] === Trial 114 Summary ===
|
|
[2026-04-13 01:37:36] Total runs in history: 232
|
|
[2026-04-13 01:37:36] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:36] Top 5 results:
|
|
[2026-04-13 01:37:36] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:36] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:36] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:36] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:36] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:38]
|
|
[AutoResearch] ========== Trial 115/200 ==========
|
|
[2026-04-13 01:37:38] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:37:38] UCB=1.1989 mu=0.9062 sigma=0.1463 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038872570535504907}
|
|
[2026-04-13 01:37:38] UCB=1.1106 mu=0.7829 sigma=0.1639 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020269719310821856}
|
|
[2026-04-13 01:37:38] UCB=1.0723 mu=0.7657 sigma=0.1533 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020288925830227386}
|
|
[2026-04-13 01:37:38] UCB=1.0341 mu=0.6882 sigma=0.1729 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0037782413233218514}
|
|
[2026-04-13 01:37:38] UCB=1.0214 mu=0.8847 sigma=0.0683 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0039958134653334905}
|
|
[2026-04-13 01:37:38] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038872570535504907, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:40] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003887
|
|
[2026-04-13 01:37:49] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:37:49] [AutoResearch] mean_reward=56.3685
|
|
[2026-04-13 01:37:49] [AutoResearch] === Trial 115 Summary ===
|
|
[2026-04-13 01:37:49] Total runs in history: 233
|
|
[2026-04-13 01:37:49] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:49] Top 5 results:
|
|
[2026-04-13 01:37:49] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:49] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:49] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:49] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:49] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:51]
|
|
[AutoResearch] ========== Trial 116/200 ==========
|
|
[2026-04-13 01:37:51] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:37:51] UCB=1.0831 mu=0.7385 sigma=0.1723 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020230081156069335}
|
|
[2026-04-13 01:37:51] UCB=0.9977 mu=0.7475 sigma=0.1251 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 7.703273875854807e-05}
|
|
[2026-04-13 01:37:51] UCB=0.9211 mu=0.5723 sigma=0.1744 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004213271649916171}
|
|
[2026-04-13 01:37:51] UCB=0.9190 mu=0.5798 sigma=0.1696 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.001096010552826292}
|
|
[2026-04-13 01:37:51] UCB=0.9102 mu=0.6121 sigma=0.1490 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0019484852788500078}
|
|
[2026-04-13 01:37:51] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020230081156069335, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:37:53] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.002023
|
|
[2026-04-13 01:38:01] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:38:01] [AutoResearch] mean_reward=69.7932
|
|
[2026-04-13 01:38:01] [AutoResearch] === Trial 116 Summary ===
|
|
[2026-04-13 01:38:01] Total runs in history: 234
|
|
[2026-04-13 01:38:01] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:01] Top 5 results:
|
|
[2026-04-13 01:38:01] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:01] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:01] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:01] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:01] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:03]
|
|
[AutoResearch] ========== Trial 117/200 ==========
|
|
[2026-04-13 01:38:04] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:38:04] UCB=0.9092 mu=0.6001 sigma=0.1546 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.001986395146659377}
|
|
[2026-04-13 01:38:04] UCB=0.8605 mu=0.5163 sigma=0.1721 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0037914921190741542}
|
|
[2026-04-13 01:38:04] UCB=0.8180 mu=0.4771 sigma=0.1705 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0011530657088124206}
|
|
[2026-04-13 01:38:04] UCB=0.8151 mu=0.4754 sigma=0.1699 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003434185533238493}
|
|
[2026-04-13 01:38:04] UCB=0.8078 mu=0.5012 sigma=0.1533 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.00374992834233124}
|
|
[2026-04-13 01:38:04] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.001986395146659377, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:06] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.001986
|
|
[2026-04-13 01:38:14] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:38:14] [AutoResearch] mean_reward=71.8818
|
|
[2026-04-13 01:38:14] [AutoResearch] === Trial 117 Summary ===
|
|
[2026-04-13 01:38:14] Total runs in history: 235
|
|
[2026-04-13 01:38:14] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:14] Top 5 results:
|
|
[2026-04-13 01:38:14] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:14] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:14] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:14] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:14] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:16]
|
|
[AutoResearch] ========== Trial 118/200 ==========
|
|
[2026-04-13 01:38:16] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:38:16] UCB=1.1845 mu=0.8246 sigma=0.1800 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0040258414587434855}
|
|
[2026-04-13 01:38:16] UCB=1.1736 mu=0.8342 sigma=0.1697 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004027108616647908}
|
|
[2026-04-13 01:38:16] UCB=1.0043 mu=0.7801 sigma=0.1121 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038558067074198734}
|
|
[2026-04-13 01:38:16] UCB=0.9914 mu=0.6924 sigma=0.1495 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004209099929164683}
|
|
[2026-04-13 01:38:16] UCB=0.9424 mu=0.6045 sigma=0.1689 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0010079539178812678}
|
|
[2026-04-13 01:38:16] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0040258414587434855, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:18] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.004026
|
|
[2026-04-13 01:38:27] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:38:27] [AutoResearch] mean_reward=64.2327
|
|
[2026-04-13 01:38:27] [AutoResearch] === Trial 118 Summary ===
|
|
[2026-04-13 01:38:27] Total runs in history: 236
|
|
[2026-04-13 01:38:27] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:27] Top 5 results:
|
|
[2026-04-13 01:38:27] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:27] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:27] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:27] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:27] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:29]
|
|
[AutoResearch] ========== Trial 119/200 ==========
|
|
[2026-04-13 01:38:29] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:38:29] UCB=0.9968 mu=0.7951 sigma=0.1009 params={'n_steer': 6, 'n_throttle': 5, 'learning_rate': 0.0028061703731165105}
|
|
[2026-04-13 01:38:29] UCB=0.9610 mu=0.5936 sigma=0.1837 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0036100860207770183}
|
|
[2026-04-13 01:38:29] UCB=0.9066 mu=0.5647 sigma=0.1710 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0011979777716310113}
|
|
[2026-04-13 01:38:29] UCB=0.8962 mu=0.5704 sigma=0.1629 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0017173258709371447}
|
|
[2026-04-13 01:38:29] UCB=0.8959 mu=0.7277 sigma=0.0841 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018397093119235795}
|
|
[2026-04-13 01:38:29] [AutoResearch] Proposed params: {'n_steer': 6, 'n_throttle': 5, 'learning_rate': 0.0028061703731165105, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:31] [AutoResearch] Launching job: n_steer=6 n_throttle=5 lr=0.002806
|
|
[2026-04-13 01:38:40] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:38:40] [AutoResearch] mean_reward=69.543
|
|
[2026-04-13 01:38:40] [AutoResearch] === Trial 119 Summary ===
|
|
[2026-04-13 01:38:40] Total runs in history: 237
|
|
[2026-04-13 01:38:40] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:40] Top 5 results:
|
|
[2026-04-13 01:38:40] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:40] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:40] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:40] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:40] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:42]
|
|
[AutoResearch] ========== Trial 120/200 ==========
|
|
[2026-04-13 01:38:42] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:38:42] UCB=0.9609 mu=0.6490 sigma=0.1560 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008528110846352778}
|
|
[2026-04-13 01:38:42] UCB=0.9066 mu=0.5746 sigma=0.1660 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0011090617952286084}
|
|
[2026-04-13 01:38:42] UCB=0.8755 mu=0.5380 sigma=0.1688 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0006716334196276739}
|
|
[2026-04-13 01:38:42] UCB=0.8730 mu=0.6498 sigma=0.1116 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.0049859449739461945}
|
|
[2026-04-13 01:38:42] UCB=0.8644 mu=0.5476 sigma=0.1584 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0013425046931496896}
|
|
[2026-04-13 01:38:42] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008528110846352778, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:44] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.000853
|
|
[2026-04-13 01:38:53] [AutoResearch] Job finished in 9.1s, returncode=0
|
|
[2026-04-13 01:38:53] [AutoResearch] mean_reward=88.4159
|
|
[2026-04-13 01:38:53] [AutoResearch] === Trial 120 Summary ===
|
|
[2026-04-13 01:38:53] Total runs in history: 238
|
|
[2026-04-13 01:38:53] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:53] Top 5 results:
|
|
[2026-04-13 01:38:53] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:53] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:53] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:53] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:53] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:55]
|
|
[AutoResearch] ========== Trial 121/200 ==========
|
|
[2026-04-13 01:38:55] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:38:55] UCB=1.1279 mu=0.7879 sigma=0.1700 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038001334263111624}
|
|
[2026-04-13 01:38:55] UCB=1.0797 mu=0.7090 sigma=0.1853 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003716165645071121}
|
|
[2026-04-13 01:38:55] UCB=1.0249 mu=0.6802 sigma=0.1724 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0007351348652005105}
|
|
[2026-04-13 01:38:55] UCB=0.9582 mu=0.6193 sigma=0.1695 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0007563238557602491}
|
|
[2026-04-13 01:38:55] UCB=0.8827 mu=0.6623 sigma=0.1102 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0021070170195671614}
|
|
[2026-04-13 01:38:55] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038001334263111624, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:38:57] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003800
|
|
[2026-04-13 01:39:06] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:39:06] [AutoResearch] mean_reward=84.615
|
|
[2026-04-13 01:39:06] [AutoResearch] === Trial 121 Summary ===
|
|
[2026-04-13 01:39:06] Total runs in history: 239
|
|
[2026-04-13 01:39:06] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:06] Top 5 results:
|
|
[2026-04-13 01:39:06] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:06] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:06] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:06] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:06] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:08]
|
|
[AutoResearch] ========== Trial 122/200 ==========
|
|
[2026-04-13 01:39:08] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:39:08] UCB=1.1540 mu=0.8088 sigma=0.1726 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0018554613943404113}
|
|
[2026-04-13 01:39:08] UCB=0.9199 mu=0.5851 sigma=0.1674 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0019226001184988913}
|
|
[2026-04-13 01:39:08] UCB=0.8726 mu=0.5927 sigma=0.1399 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0038489199749177833}
|
|
[2026-04-13 01:39:08] UCB=0.8708 mu=0.5267 sigma=0.1720 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001681699162271549}
|
|
[2026-04-13 01:39:08] UCB=0.7634 mu=0.4192 sigma=0.1721 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004181416278950615}
|
|
[2026-04-13 01:39:08] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0018554613943404113, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:10] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.001855
|
|
[2026-04-13 01:39:19] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:39:19] [AutoResearch] mean_reward=83.0335
|
|
[2026-04-13 01:39:19] [AutoResearch] === Trial 122 Summary ===
|
|
[2026-04-13 01:39:19] Total runs in history: 240
|
|
[2026-04-13 01:39:19] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:19] Top 5 results:
|
|
[2026-04-13 01:39:19] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:19] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:19] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:19] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:19] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:21]
|
|
[AutoResearch] ========== Trial 123/200 ==========
|
|
[2026-04-13 01:39:21] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:39:21] UCB=1.0505 mu=0.7006 sigma=0.1749 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0038610695700700506}
|
|
[2026-04-13 01:39:21] UCB=0.9945 mu=0.7123 sigma=0.1411 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003749335480045088}
|
|
[2026-04-13 01:39:21] UCB=0.9537 mu=0.6521 sigma=0.1508 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0009615673275003777}
|
|
[2026-04-13 01:39:21] UCB=0.9386 mu=0.6848 sigma=0.1269 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0020399425604693488}
|
|
[2026-04-13 01:39:21] UCB=0.9035 mu=0.6799 sigma=0.1118 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020775382726420347}
|
|
[2026-04-13 01:39:21] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0038610695700700506, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:23] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.003861
|
|
[2026-04-13 01:39:32] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:39:32] [AutoResearch] mean_reward=58.8867
|
|
[2026-04-13 01:39:32] [AutoResearch] === Trial 123 Summary ===
|
|
[2026-04-13 01:39:32] Total runs in history: 241
|
|
[2026-04-13 01:39:32] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:32] Top 5 results:
|
|
[2026-04-13 01:39:32] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:32] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:32] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:32] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:32] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:34]
|
|
[AutoResearch] ========== Trial 124/200 ==========
|
|
[2026-04-13 01:39:34] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:39:34] UCB=1.1703 mu=0.8851 sigma=0.1426 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018566411536533602}
|
|
[2026-04-13 01:39:34] UCB=1.0866 mu=0.9545 sigma=0.0661 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004981391188561929}
|
|
[2026-04-13 01:39:34] UCB=1.0817 mu=0.7746 sigma=0.1535 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003947517217025597}
|
|
[2026-04-13 01:39:34] UCB=0.9495 mu=0.7484 sigma=0.1005 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.004980263353644699}
|
|
[2026-04-13 01:39:34] UCB=0.9053 mu=0.6325 sigma=0.1364 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0022975321764403993}
|
|
[2026-04-13 01:39:34] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018566411536533602, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:36] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.001857
|
|
[2026-04-13 01:39:44] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:39:44] [AutoResearch] mean_reward=56.839
|
|
[2026-04-13 01:39:44] [AutoResearch] === Trial 124 Summary ===
|
|
[2026-04-13 01:39:44] Total runs in history: 242
|
|
[2026-04-13 01:39:44] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:44] Top 5 results:
|
|
[2026-04-13 01:39:44] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:44] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:44] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:44] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:44] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:46]
|
|
[AutoResearch] ========== Trial 125/200 ==========
|
|
[2026-04-13 01:39:46] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:39:46] UCB=1.1201 mu=0.7743 sigma=0.1729 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.001849059250410137}
|
|
[2026-04-13 01:39:46] UCB=1.0901 mu=0.7264 sigma=0.1818 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003822591950597879}
|
|
[2026-04-13 01:39:46] UCB=0.9973 mu=0.6950 sigma=0.1511 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0010927245694079787}
|
|
[2026-04-13 01:39:46] UCB=0.9911 mu=0.6285 sigma=0.1813 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0036699090525270095}
|
|
[2026-04-13 01:39:46] UCB=0.8810 mu=0.5800 sigma=0.1505 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012317073543293732}
|
|
[2026-04-13 01:39:46] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.001849059250410137, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:48] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.001849
|
|
[2026-04-13 01:39:56] [AutoResearch] Job finished in 7.9s, returncode=0
|
|
[2026-04-13 01:39:56] [AutoResearch] mean_reward=46.1192
|
|
[2026-04-13 01:39:56] [AutoResearch] === Trial 125 Summary ===
|
|
[2026-04-13 01:39:56] Total runs in history: 243
|
|
[2026-04-13 01:39:56] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:56] Top 5 results:
|
|
[2026-04-13 01:39:56] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:56] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:56] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:56] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:56] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:39:58]
|
|
[AutoResearch] ========== Trial 126/200 ==========
|
|
[2026-04-13 01:39:58] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:39:58] UCB=1.1800 mu=0.8286 sigma=0.1757 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038351549981542003}
|
|
[2026-04-13 01:39:58] UCB=1.1033 mu=0.7599 sigma=0.1717 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002001200254453849}
|
|
[2026-04-13 01:39:58] UCB=0.9466 mu=0.6042 sigma=0.1712 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003658508321851182}
|
|
[2026-04-13 01:39:58] UCB=0.8280 mu=0.5060 sigma=0.1610 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0013504935582237707}
|
|
[2026-04-13 01:39:58] UCB=0.8274 mu=0.6185 sigma=0.1045 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0010395074012916408}
|
|
[2026-04-13 01:39:58] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038351549981542003, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:00] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003835
|
|
[2026-04-13 01:40:10] [AutoResearch] Job finished in 9.6s, returncode=0
|
|
[2026-04-13 01:40:10] [AutoResearch] mean_reward=90.2338
|
|
[2026-04-13 01:40:10] [AutoResearch] === Trial 126 Summary ===
|
|
[2026-04-13 01:40:10] Total runs in history: 244
|
|
[2026-04-13 01:40:10] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:10] Top 5 results:
|
|
[2026-04-13 01:40:10] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:10] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:10] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:10] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:10] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:12]
|
|
[AutoResearch] ========== Trial 127/200 ==========
|
|
[2026-04-13 01:40:12] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:40:12] UCB=1.0270 mu=0.6905 sigma=0.1682 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020959092272320582}
|
|
[2026-04-13 01:40:12] UCB=0.9534 mu=0.6054 sigma=0.1740 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004128990435571124}
|
|
[2026-04-13 01:40:12] UCB=0.8675 mu=0.5359 sigma=0.1658 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0006478408159984413}
|
|
[2026-04-13 01:40:12] UCB=0.8629 mu=0.5158 sigma=0.1736 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0006058169972894313}
|
|
[2026-04-13 01:40:12] UCB=0.8548 mu=0.5128 sigma=0.1710 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0024183183616760823}
|
|
[2026-04-13 01:40:12] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020959092272320582, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:14] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.002096
|
|
[2026-04-13 01:40:23] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:40:23] [AutoResearch] mean_reward=66.1001
|
|
[2026-04-13 01:40:23] [AutoResearch] === Trial 127 Summary ===
|
|
[2026-04-13 01:40:23] Total runs in history: 245
|
|
[2026-04-13 01:40:23] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:23] Top 5 results:
|
|
[2026-04-13 01:40:23] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:23] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:23] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:23] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:23] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:25]
|
|
[AutoResearch] ========== Trial 128/200 ==========
|
|
[2026-04-13 01:40:25] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:40:25] UCB=1.0365 mu=0.7375 sigma=0.1495 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004102273378470343}
|
|
[2026-04-13 01:40:25] UCB=1.0235 mu=0.8410 sigma=0.0912 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004964254218401196}
|
|
[2026-04-13 01:40:25] UCB=0.8535 mu=0.5780 sigma=0.1377 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035570707267241156}
|
|
[2026-04-13 01:40:25] UCB=0.8486 mu=0.7518 sigma=0.0484 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0014882515895030537}
|
|
[2026-04-13 01:40:25] UCB=0.8398 mu=0.5957 sigma=0.1221 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0033997611388226637}
|
|
[2026-04-13 01:40:25] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004102273378470343, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:27] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.004102
|
|
[2026-04-13 01:40:36] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:40:36] [AutoResearch] mean_reward=76.9958
|
|
[2026-04-13 01:40:36] [AutoResearch] === Trial 128 Summary ===
|
|
[2026-04-13 01:40:36] Total runs in history: 246
|
|
[2026-04-13 01:40:36] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:36] Top 5 results:
|
|
[2026-04-13 01:40:36] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:36] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:36] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:36] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:36] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:38]
|
|
[AutoResearch] ========== Trial 129/200 ==========
|
|
[2026-04-13 01:40:38] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:40:38] UCB=1.0172 mu=0.7127 sigma=0.1522 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038041411666529575}
|
|
[2026-04-13 01:40:38] UCB=0.9270 mu=0.5951 sigma=0.1659 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0010035366674232592}
|
|
[2026-04-13 01:40:38] UCB=0.9206 mu=0.5904 sigma=0.1651 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0007274983433505704}
|
|
[2026-04-13 01:40:38] UCB=0.8945 mu=0.5437 sigma=0.1754 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004042389410219968}
|
|
[2026-04-13 01:40:38] UCB=0.8565 mu=0.5472 sigma=0.1546 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0018454332683544878}
|
|
[2026-04-13 01:40:38] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038041411666529575, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:40] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003804
|
|
[2026-04-13 01:40:49] [AutoResearch] Job finished in 9.3s, returncode=0
|
|
[2026-04-13 01:40:49] [AutoResearch] mean_reward=88.6229
|
|
[2026-04-13 01:40:49] [AutoResearch] === Trial 129 Summary ===
|
|
[2026-04-13 01:40:49] Total runs in history: 247
|
|
[2026-04-13 01:40:49] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:49] Top 5 results:
|
|
[2026-04-13 01:40:49] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:49] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:49] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:49] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:49] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:51]
|
|
[AutoResearch] ========== Trial 130/200 ==========
|
|
[2026-04-13 01:40:51] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:40:51] UCB=1.1896 mu=0.8225 sigma=0.1835 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003799123148974186}
|
|
[2026-04-13 01:40:51] UCB=1.0409 mu=0.7006 sigma=0.1701 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004110569940599718}
|
|
[2026-04-13 01:40:51] UCB=1.0296 mu=0.6636 sigma=0.1830 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003682868499527552}
|
|
[2026-04-13 01:40:51] UCB=1.0067 mu=0.6518 sigma=0.1774 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0036419887046515134}
|
|
[2026-04-13 01:40:51] UCB=1.0034 mu=0.6592 sigma=0.1721 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003814405593260346}
|
|
[2026-04-13 01:40:51] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003799123148974186, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:40:53] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003799
|
|
[2026-04-13 01:41:03] [AutoResearch] Job finished in 9.6s, returncode=0
|
|
[2026-04-13 01:41:03] [AutoResearch] mean_reward=103.6463
|
|
[2026-04-13 01:41:03] [AutoResearch] === Trial 130 Summary ===
|
|
[2026-04-13 01:41:03] Total runs in history: 248
|
|
[2026-04-13 01:41:03] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:03] Top 5 results:
|
|
[2026-04-13 01:41:03] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:03] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:03] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:03] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:03] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:05]
|
|
[AutoResearch] ========== Trial 131/200 ==========
|
|
[2026-04-13 01:41:05] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:41:05] UCB=1.1985 mu=0.8288 sigma=0.1848 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0037435444308027113}
|
|
[2026-04-13 01:41:05] UCB=1.0447 mu=0.6862 sigma=0.1793 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0037405881542587143}
|
|
[2026-04-13 01:41:05] UCB=1.0356 mu=0.7315 sigma=0.1521 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004953922797551403}
|
|
[2026-04-13 01:41:05] UCB=0.9351 mu=0.5984 sigma=0.1684 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0007190405544208439}
|
|
[2026-04-13 01:41:05] UCB=0.9213 mu=0.6594 sigma=0.1310 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0035780453336246116}
|
|
[2026-04-13 01:41:05] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0037435444308027113, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:07] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003744
|
|
[2026-04-13 01:41:16] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:41:16] [AutoResearch] mean_reward=63.0327
|
|
[2026-04-13 01:41:16] [AutoResearch] === Trial 131 Summary ===
|
|
[2026-04-13 01:41:16] Total runs in history: 249
|
|
[2026-04-13 01:41:16] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:16] Top 5 results:
|
|
[2026-04-13 01:41:16] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:16] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:16] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:16] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:16] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:18]
|
|
[AutoResearch] ========== Trial 132/200 ==========
|
|
[2026-04-13 01:41:18] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:41:18] UCB=0.9920 mu=0.6790 sigma=0.1565 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003919406024189691}
|
|
[2026-04-13 01:41:18] UCB=0.9712 mu=0.6167 sigma=0.1772 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003734567638763917}
|
|
[2026-04-13 01:41:18] UCB=0.9678 mu=0.6538 sigma=0.1570 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004168499347792139}
|
|
[2026-04-13 01:41:18] UCB=0.8773 mu=0.5696 sigma=0.1539 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0037920258959817984}
|
|
[2026-04-13 01:41:18] UCB=0.8235 mu=0.6273 sigma=0.0981 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004078997997351073}
|
|
[2026-04-13 01:41:18] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003919406024189691, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:20] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003919
|
|
[2026-04-13 01:41:29] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:41:29] [AutoResearch] mean_reward=47.3461
|
|
[2026-04-13 01:41:29] [AutoResearch] === Trial 132 Summary ===
|
|
[2026-04-13 01:41:29] Total runs in history: 250
|
|
[2026-04-13 01:41:29] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:29] Top 5 results:
|
|
[2026-04-13 01:41:29] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:29] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:29] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:29] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:29] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:31]
|
|
[AutoResearch] ========== Trial 133/200 ==========
|
|
[2026-04-13 01:41:31] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:41:31] UCB=0.9718 mu=0.6757 sigma=0.1481 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 6.786958634023708e-05}
|
|
[2026-04-13 01:41:31] UCB=0.9491 mu=0.6966 sigma=0.1262 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.00367824609310371}
|
|
[2026-04-13 01:41:31] UCB=0.8355 mu=0.6175 sigma=0.1090 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018626223290310545}
|
|
[2026-04-13 01:41:31] UCB=0.8207 mu=0.6146 sigma=0.1030 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.004964969280506222}
|
|
[2026-04-13 01:41:31] UCB=0.8111 mu=0.5277 sigma=0.1417 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.00013841828223153525}
|
|
[2026-04-13 01:41:31] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 4, 'learning_rate': 6.786958634023708e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:33] [AutoResearch] Launching job: n_steer=5 n_throttle=4 lr=0.000068
|
|
[2026-04-13 01:41:42] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:41:42] [AutoResearch] mean_reward=75.5195
|
|
[2026-04-13 01:41:42] [AutoResearch] === Trial 133 Summary ===
|
|
[2026-04-13 01:41:42] Total runs in history: 251
|
|
[2026-04-13 01:41:42] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:42] Top 5 results:
|
|
[2026-04-13 01:41:42] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:42] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:42] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:42] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:42] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:44]
|
|
[AutoResearch] ========== Trial 134/200 ==========
|
|
[2026-04-13 01:41:44] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:41:44] UCB=0.9579 mu=0.6681 sigma=0.1449 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003702506356856968}
|
|
[2026-04-13 01:41:44] UCB=0.9521 mu=0.5925 sigma=0.1798 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003726294765067452}
|
|
[2026-04-13 01:41:44] UCB=0.8973 mu=0.6701 sigma=0.1136 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012204123942446716}
|
|
[2026-04-13 01:41:44] UCB=0.8563 mu=0.5364 sigma=0.1600 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003724959448275549}
|
|
[2026-04-13 01:41:44] UCB=0.8543 mu=0.5296 sigma=0.1624 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037214937510614255}
|
|
[2026-04-13 01:41:44] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003702506356856968, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:46] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003703
|
|
[2026-04-13 01:41:54] [AutoResearch] Job finished in 7.9s, returncode=0
|
|
[2026-04-13 01:41:54] [AutoResearch] mean_reward=50.4312
|
|
[2026-04-13 01:41:54] [AutoResearch] === Trial 134 Summary ===
|
|
[2026-04-13 01:41:54] Total runs in history: 252
|
|
[2026-04-13 01:41:54] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:54] Top 5 results:
|
|
[2026-04-13 01:41:54] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:54] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:54] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:54] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:54] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:56]
|
|
[AutoResearch] ========== Trial 135/200 ==========
|
|
[2026-04-13 01:41:56] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:41:56] UCB=1.0164 mu=0.6705 sigma=0.1730 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018090429692356102}
|
|
[2026-04-13 01:41:56] UCB=0.8897 mu=0.5218 sigma=0.1839 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0040194693533968946}
|
|
[2026-04-13 01:41:56] UCB=0.8783 mu=0.6569 sigma=0.1107 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0009935095619630191}
|
|
[2026-04-13 01:41:56] UCB=0.8642 mu=0.5696 sigma=0.1473 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 8.090367605016716e-05}
|
|
[2026-04-13 01:41:56] UCB=0.8584 mu=0.5165 sigma=0.1710 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0035390113345405852}
|
|
[2026-04-13 01:41:56] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018090429692356102, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:41:58] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.001809
|
|
[2026-04-13 01:42:06] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:42:06] [AutoResearch] mean_reward=59.8302
|
|
[2026-04-13 01:42:06] [AutoResearch] === Trial 135 Summary ===
|
|
[2026-04-13 01:42:06] Total runs in history: 253
|
|
[2026-04-13 01:42:06] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:06] Top 5 results:
|
|
[2026-04-13 01:42:06] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:06] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:06] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:06] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:06] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:08]
|
|
[AutoResearch] ========== Trial 136/200 ==========
|
|
[2026-04-13 01:42:09] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:42:09] UCB=0.9693 mu=0.6334 sigma=0.1680 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0007838229083152588}
|
|
[2026-04-13 01:42:09] UCB=0.9090 mu=0.5900 sigma=0.1595 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0037001090210330855}
|
|
[2026-04-13 01:42:09] UCB=0.9058 mu=0.5314 sigma=0.1872 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003580745319343262}
|
|
[2026-04-13 01:42:09] UCB=0.9034 mu=0.7592 sigma=0.0721 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0013285697696221932}
|
|
[2026-04-13 01:42:09] UCB=0.8505 mu=0.5082 sigma=0.1712 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003498077307904384}
|
|
[2026-04-13 01:42:09] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0007838229083152588, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:11] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.000784
|
|
[2026-04-13 01:42:19] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:42:19] [AutoResearch] mean_reward=74.5225
|
|
[2026-04-13 01:42:19] [AutoResearch] === Trial 136 Summary ===
|
|
[2026-04-13 01:42:19] Total runs in history: 254
|
|
[2026-04-13 01:42:19] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:19] Top 5 results:
|
|
[2026-04-13 01:42:19] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:19] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:19] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:19] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:19] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:21]
|
|
[AutoResearch] ========== Trial 137/200 ==========
|
|
[2026-04-13 01:42:22] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:42:22] UCB=0.9256 mu=0.6046 sigma=0.1605 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0009401742986201011}
|
|
[2026-04-13 01:42:22] UCB=0.8549 mu=0.6160 sigma=0.1194 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.00082315042636751}
|
|
[2026-04-13 01:42:22] UCB=0.8378 mu=0.4963 sigma=0.1708 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0037972968543128426}
|
|
[2026-04-13 01:42:22] UCB=0.8202 mu=0.4624 sigma=0.1789 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0037134179548849755}
|
|
[2026-04-13 01:42:22] UCB=0.8113 mu=0.5455 sigma=0.1329 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0006503794626082374}
|
|
[2026-04-13 01:42:22] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0009401742986201011, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:24] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.000940
|
|
[2026-04-13 01:42:32] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:42:32] [AutoResearch] mean_reward=55.0907
|
|
[2026-04-13 01:42:32] [AutoResearch] === Trial 137 Summary ===
|
|
[2026-04-13 01:42:32] Total runs in history: 255
|
|
[2026-04-13 01:42:32] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:32] Top 5 results:
|
|
[2026-04-13 01:42:32] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:32] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:32] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:32] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:32] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:34]
|
|
[AutoResearch] ========== Trial 138/200 ==========
|
|
[2026-04-13 01:42:34] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:42:34] UCB=1.0545 mu=0.6781 sigma=0.1882 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0037210086121528133}
|
|
[2026-04-13 01:42:34] UCB=1.0144 mu=0.6706 sigma=0.1719 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020537610015480233}
|
|
[2026-04-13 01:42:34] UCB=0.7777 mu=0.6903 sigma=0.0437 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003914016502755188}
|
|
[2026-04-13 01:42:34] UCB=0.7593 mu=0.4290 sigma=0.1651 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0008466232736711985}
|
|
[2026-04-13 01:42:34] UCB=0.7588 mu=0.4948 sigma=0.1320 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0006918480692170758}
|
|
[2026-04-13 01:42:34] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0037210086121528133, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:36] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003721
|
|
[2026-04-13 01:42:45] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:42:45] [AutoResearch] mean_reward=77.7193
|
|
[2026-04-13 01:42:45] [AutoResearch] === Trial 138 Summary ===
|
|
[2026-04-13 01:42:45] Total runs in history: 256
|
|
[2026-04-13 01:42:45] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:45] Top 5 results:
|
|
[2026-04-13 01:42:45] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:45] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:45] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:45] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:45] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:47]
|
|
[AutoResearch] ========== Trial 139/200 ==========
|
|
[2026-04-13 01:42:47] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:42:47] UCB=0.9779 mu=0.6370 sigma=0.1704 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.000874825266158358}
|
|
[2026-04-13 01:42:47] UCB=0.8643 mu=0.5498 sigma=0.1573 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0006681114320921753}
|
|
[2026-04-13 01:42:47] UCB=0.8618 mu=0.5260 sigma=0.1679 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0038300182159274835}
|
|
[2026-04-13 01:42:47] UCB=0.8403 mu=0.4648 sigma=0.1877 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003732144284898933}
|
|
[2026-04-13 01:42:47] UCB=0.8384 mu=0.4719 sigma=0.1833 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0036678906554996464}
|
|
[2026-04-13 01:42:47] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.000874825266158358, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:49] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.000875
|
|
[2026-04-13 01:42:58] [AutoResearch] Job finished in 8.0s, returncode=0
|
|
[2026-04-13 01:42:58] [AutoResearch] mean_reward=38.6928
|
|
[2026-04-13 01:42:58] [AutoResearch] === Trial 139 Summary ===
|
|
[2026-04-13 01:42:58] Total runs in history: 257
|
|
[2026-04-13 01:42:58] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:58] Top 5 results:
|
|
[2026-04-13 01:42:58] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:58] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:58] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:58] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:42:58] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:00]
|
|
[AutoResearch] ========== Trial 140/200 ==========
|
|
[2026-04-13 01:43:00] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:43:00] UCB=1.0910 mu=0.7397 sigma=0.1756 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004028509063201252}
|
|
[2026-04-13 01:43:00] UCB=0.9617 mu=0.6731 sigma=0.1443 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002138508200249192}
|
|
[2026-04-13 01:43:00] UCB=0.8695 mu=0.5549 sigma=0.1573 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003567842996973346}
|
|
[2026-04-13 01:43:00] UCB=0.8474 mu=0.7410 sigma=0.0532 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0013990646635575876}
|
|
[2026-04-13 01:43:00] UCB=0.8377 mu=0.6570 sigma=0.0904 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.0049718250744616825}
|
|
[2026-04-13 01:43:00] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004028509063201252, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:02] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.004029
|
|
[2026-04-13 01:43:11] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:43:11] [AutoResearch] mean_reward=81.2028
|
|
[2026-04-13 01:43:11] [AutoResearch] === Trial 140 Summary ===
|
|
[2026-04-13 01:43:11] Total runs in history: 258
|
|
[2026-04-13 01:43:11] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:11] Top 5 results:
|
|
[2026-04-13 01:43:11] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:11] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:11] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:11] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:11] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:13]
|
|
[AutoResearch] ========== Trial 141/200 ==========
|
|
[2026-04-13 01:43:13] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:43:13] UCB=1.0079 mu=0.6549 sigma=0.1765 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003663582929800669}
|
|
[2026-04-13 01:43:13] UCB=0.9692 mu=0.6551 sigma=0.1570 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003765873392871993}
|
|
[2026-04-13 01:43:13] UCB=0.9683 mu=0.6125 sigma=0.1779 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0036068764270613144}
|
|
[2026-04-13 01:43:13] UCB=0.9581 mu=0.8655 sigma=0.0463 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00041616538090230197}
|
|
[2026-04-13 01:43:13] UCB=0.8750 mu=0.5381 sigma=0.1685 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003796702716938051}
|
|
[2026-04-13 01:43:13] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003663582929800669, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:15] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003664
|
|
[2026-04-13 01:43:23] [AutoResearch] Job finished in 8.0s, returncode=0
|
|
[2026-04-13 01:43:23] [AutoResearch] mean_reward=43.4001
|
|
[2026-04-13 01:43:23] [AutoResearch] === Trial 141 Summary ===
|
|
[2026-04-13 01:43:23] Total runs in history: 259
|
|
[2026-04-13 01:43:23] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:23] Top 5 results:
|
|
[2026-04-13 01:43:23] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:23] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:23] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:23] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:23] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:25]
|
|
[AutoResearch] ========== Trial 142/200 ==========
|
|
[2026-04-13 01:43:25] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:43:25] UCB=1.1708 mu=0.8418 sigma=0.1645 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038149691408171965}
|
|
[2026-04-13 01:43:25] UCB=0.8378 mu=0.4982 sigma=0.1698 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0034945820601627697}
|
|
[2026-04-13 01:43:25] UCB=0.8285 mu=0.4898 sigma=0.1693 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003427405328408437}
|
|
[2026-04-13 01:43:25] UCB=0.7931 mu=0.4496 sigma=0.1717 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003786985906956135}
|
|
[2026-04-13 01:43:25] UCB=0.7909 mu=0.5510 sigma=0.1200 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008159939786992965}
|
|
[2026-04-13 01:43:25] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038149691408171965, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:27] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003815
|
|
[2026-04-13 01:43:35] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:43:35] [AutoResearch] mean_reward=77.5819
|
|
[2026-04-13 01:43:35] [AutoResearch] === Trial 142 Summary ===
|
|
[2026-04-13 01:43:35] Total runs in history: 260
|
|
[2026-04-13 01:43:35] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:35] Top 5 results:
|
|
[2026-04-13 01:43:35] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:35] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:35] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:35] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:35] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:38]
|
|
[AutoResearch] ========== Trial 143/200 ==========
|
|
[2026-04-13 01:43:38] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:43:38] UCB=0.8979 mu=0.5569 sigma=0.1705 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036009030607319905}
|
|
[2026-04-13 01:43:38] UCB=0.7981 mu=0.4659 sigma=0.1661 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001956759362826256}
|
|
[2026-04-13 01:43:38] UCB=0.7754 mu=0.4432 sigma=0.1661 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.0028645671916918266}
|
|
[2026-04-13 01:43:38] UCB=0.7579 mu=0.6155 sigma=0.0712 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006337158706190145}
|
|
[2026-04-13 01:43:38] UCB=0.7443 mu=0.4757 sigma=0.1343 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.003422220382080945}
|
|
[2026-04-13 01:43:38] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036009030607319905, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:40] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003601
|
|
[2026-04-13 01:43:49] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:43:49] [AutoResearch] mean_reward=97.7013
|
|
[2026-04-13 01:43:49] [AutoResearch] === Trial 143 Summary ===
|
|
[2026-04-13 01:43:49] Total runs in history: 261
|
|
[2026-04-13 01:43:49] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:49] Top 5 results:
|
|
[2026-04-13 01:43:49] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:49] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:49] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:49] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:49] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:51]
|
|
[AutoResearch] ========== Trial 144/200 ==========
|
|
[2026-04-13 01:43:51] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:43:51] UCB=1.0033 mu=0.8188 sigma=0.0923 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035481519078291797}
|
|
[2026-04-13 01:43:51] UCB=0.9764 mu=0.8093 sigma=0.0836 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035023720505701784}
|
|
[2026-04-13 01:43:51] UCB=0.9383 mu=0.7405 sigma=0.0989 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0033247219903769303}
|
|
[2026-04-13 01:43:51] UCB=0.8596 mu=0.7360 sigma=0.0618 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0033631011073772853}
|
|
[2026-04-13 01:43:51] UCB=0.8561 mu=0.7252 sigma=0.0655 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003797729948885283}
|
|
[2026-04-13 01:43:51] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035481519078291797, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:43:53] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003548
|
|
[2026-04-13 01:44:01] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:44:01] [AutoResearch] mean_reward=79.2693
|
|
[2026-04-13 01:44:01] [AutoResearch] === Trial 144 Summary ===
|
|
[2026-04-13 01:44:01] Total runs in history: 262
|
|
[2026-04-13 01:44:01] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:01] Top 5 results:
|
|
[2026-04-13 01:44:01] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:01] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:01] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:01] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:01] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:03]
|
|
[AutoResearch] ========== Trial 145/200 ==========
|
|
[2026-04-13 01:44:03] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:44:03] UCB=1.0088 mu=0.6727 sigma=0.1681 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003552699900963468}
|
|
[2026-04-13 01:44:03] UCB=0.9543 mu=0.6737 sigma=0.1403 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036489741799300424}
|
|
[2026-04-13 01:44:03] UCB=0.9484 mu=0.7512 sigma=0.0986 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.00368232066077994}
|
|
[2026-04-13 01:44:03] UCB=0.9358 mu=0.6116 sigma=0.1621 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0033820363903321778}
|
|
[2026-04-13 01:44:03] UCB=0.8933 mu=0.5538 sigma=0.1698 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003429479318975156}
|
|
[2026-04-13 01:44:03] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003552699900963468, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:05] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003553
|
|
[2026-04-13 01:44:14] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:44:14] [AutoResearch] mean_reward=52.7759
|
|
[2026-04-13 01:44:14] [AutoResearch] === Trial 145 Summary ===
|
|
[2026-04-13 01:44:14] Total runs in history: 263
|
|
[2026-04-13 01:44:14] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:14] Top 5 results:
|
|
[2026-04-13 01:44:14] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:14] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:14] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:14] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:14] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:16]
|
|
[AutoResearch] ========== Trial 146/200 ==========
|
|
[2026-04-13 01:44:16] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:44:16] UCB=1.0540 mu=0.7527 sigma=0.1507 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004960473592165943}
|
|
[2026-04-13 01:44:16] UCB=0.9803 mu=0.6625 sigma=0.1589 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001970154046319278}
|
|
[2026-04-13 01:44:16] UCB=0.9289 mu=0.5752 sigma=0.1769 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0037856547529217127}
|
|
[2026-04-13 01:44:16] UCB=0.8331 mu=0.5348 sigma=0.1492 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.00010268493069901359}
|
|
[2026-04-13 01:44:16] UCB=0.8232 mu=0.4940 sigma=0.1646 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035070323785793647}
|
|
[2026-04-13 01:44:16] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.004960473592165943, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:18] [AutoResearch] Launching job: n_steer=3 n_throttle=3 lr=0.004960
|
|
[2026-04-13 01:44:27] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:44:27] [AutoResearch] mean_reward=77.6223
|
|
[2026-04-13 01:44:27] [AutoResearch] === Trial 146 Summary ===
|
|
[2026-04-13 01:44:27] Total runs in history: 264
|
|
[2026-04-13 01:44:27] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:27] Top 5 results:
|
|
[2026-04-13 01:44:27] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:27] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:27] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:27] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:27] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:29]
|
|
[AutoResearch] ========== Trial 147/200 ==========
|
|
[2026-04-13 01:44:29] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:44:29] UCB=1.0743 mu=0.7529 sigma=0.1607 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004005256499040783}
|
|
[2026-04-13 01:44:29] UCB=0.7949 mu=0.6641 sigma=0.0654 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012136689883710733}
|
|
[2026-04-13 01:44:29] UCB=0.7593 mu=0.4700 sigma=0.1446 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.00012713322669290127}
|
|
[2026-04-13 01:44:29] UCB=0.7509 mu=0.4107 sigma=0.1701 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.001954799602334939}
|
|
[2026-04-13 01:44:29] UCB=0.7269 mu=0.3808 sigma=0.1731 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012127879157741764}
|
|
[2026-04-13 01:44:29] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004005256499040783, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:31] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.004005
|
|
[2026-04-13 01:44:39] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:44:39] [AutoResearch] mean_reward=78.1393
|
|
[2026-04-13 01:44:39] [AutoResearch] === Trial 147 Summary ===
|
|
[2026-04-13 01:44:39] Total runs in history: 265
|
|
[2026-04-13 01:44:39] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:39] Top 5 results:
|
|
[2026-04-13 01:44:39] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:39] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:39] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:39] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:39] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:41]
|
|
[AutoResearch] ========== Trial 148/200 ==========
|
|
[2026-04-13 01:44:42] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:44:42] UCB=1.0955 mu=0.7357 sigma=0.1799 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003965811222343017}
|
|
[2026-04-13 01:44:42] UCB=1.0343 mu=0.8650 sigma=0.0847 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.004999185748739202}
|
|
[2026-04-13 01:44:42] UCB=0.8775 mu=0.5333 sigma=0.1721 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0017434302848358184}
|
|
[2026-04-13 01:44:42] UCB=0.7967 mu=0.5154 sigma=0.1406 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0007708670783818601}
|
|
[2026-04-13 01:44:42] UCB=0.7850 mu=0.5086 sigma=0.1382 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.001823316164969434}
|
|
[2026-04-13 01:44:42] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003965811222343017, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:44] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003966
|
|
[2026-04-13 01:44:52] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:44:52] [AutoResearch] mean_reward=55.2785
|
|
[2026-04-13 01:44:52] [AutoResearch] === Trial 148 Summary ===
|
|
[2026-04-13 01:44:52] Total runs in history: 266
|
|
[2026-04-13 01:44:52] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:52] Top 5 results:
|
|
[2026-04-13 01:44:52] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:52] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:52] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:52] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:52] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:54]
|
|
[AutoResearch] ========== Trial 149/200 ==========
|
|
[2026-04-13 01:44:54] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:44:54] UCB=0.8915 mu=0.5690 sigma=0.1613 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0037585379193959233}
|
|
[2026-04-13 01:44:54] UCB=0.8379 mu=0.5093 sigma=0.1643 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0010194047078467302}
|
|
[2026-04-13 01:44:54] UCB=0.8368 mu=0.6418 sigma=0.0975 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004164198392078201}
|
|
[2026-04-13 01:44:54] UCB=0.8184 mu=0.4766 sigma=0.1709 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0035247727190728688}
|
|
[2026-04-13 01:44:54] UCB=0.8049 mu=0.5883 sigma=0.1083 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.000740098205123042}
|
|
[2026-04-13 01:44:54] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0037585379193959233, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:44:56] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003759
|
|
[2026-04-13 01:45:04] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:45:04] [AutoResearch] mean_reward=58.3853
|
|
[2026-04-13 01:45:04] [AutoResearch] === Trial 149 Summary ===
|
|
[2026-04-13 01:45:04] Total runs in history: 267
|
|
[2026-04-13 01:45:04] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:04] Top 5 results:
|
|
[2026-04-13 01:45:04] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:04] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:04] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:04] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:04] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:06]
|
|
[AutoResearch] ========== Trial 150/200 ==========
|
|
[2026-04-13 01:45:06] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:45:06] UCB=0.9276 mu=0.5867 sigma=0.1704 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0019772829129634257}
|
|
[2026-04-13 01:45:06] UCB=0.9020 mu=0.5540 sigma=0.1740 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.004008786354733741}
|
|
[2026-04-13 01:45:06] UCB=0.8361 mu=0.5076 sigma=0.1642 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0038067684509739378}
|
|
[2026-04-13 01:45:06] UCB=0.7662 mu=0.5665 sigma=0.0999 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.00012338655995693264}
|
|
[2026-04-13 01:45:06] UCB=0.7625 mu=0.4722 sigma=0.1451 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0017112047257719027}
|
|
[2026-04-13 01:45:06] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0019772829129634257, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:08] [AutoResearch] Launching job: n_steer=7 n_throttle=4 lr=0.001977
|
|
[2026-04-13 01:45:17] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:45:17] [AutoResearch] mean_reward=57.3676
|
|
[2026-04-13 01:45:17] [AutoResearch] === Trial 150 Summary ===
|
|
[2026-04-13 01:45:17] Total runs in history: 268
|
|
[2026-04-13 01:45:17] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:17] Top 5 results:
|
|
[2026-04-13 01:45:17] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:17] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:17] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:17] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:17] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:19]
|
|
[AutoResearch] ========== Trial 151/200 ==========
|
|
[2026-04-13 01:45:19] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:45:19] UCB=0.9430 mu=0.6436 sigma=0.1497 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0019098436167027417}
|
|
[2026-04-13 01:45:19] UCB=0.9282 mu=0.7119 sigma=0.1082 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0038616125596556993}
|
|
[2026-04-13 01:45:19] UCB=0.8536 mu=0.5305 sigma=0.1615 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.00369615512677948}
|
|
[2026-04-13 01:45:19] UCB=0.8386 mu=0.5703 sigma=0.1342 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034986394086115873}
|
|
[2026-04-13 01:45:19] UCB=0.8293 mu=0.4877 sigma=0.1708 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003516854134952913}
|
|
[2026-04-13 01:45:19] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0019098436167027417, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:21] [AutoResearch] Launching job: n_steer=7 n_throttle=4 lr=0.001910
|
|
[2026-04-13 01:45:29] [AutoResearch] Job finished in 8.0s, returncode=0
|
|
[2026-04-13 01:45:29] [AutoResearch] mean_reward=46.2365
|
|
[2026-04-13 01:45:29] [AutoResearch] === Trial 151 Summary ===
|
|
[2026-04-13 01:45:29] Total runs in history: 269
|
|
[2026-04-13 01:45:29] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:29] Top 5 results:
|
|
[2026-04-13 01:45:29] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:29] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:29] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:29] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:29] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:31]
|
|
[AutoResearch] ========== Trial 152/200 ==========
|
|
[2026-04-13 01:45:31] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:45:31] UCB=1.0423 mu=0.7235 sigma=0.1594 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0038485374729066227}
|
|
[2026-04-13 01:45:31] UCB=0.8573 mu=0.5243 sigma=0.1665 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.0049704837326351855}
|
|
[2026-04-13 01:45:31] UCB=0.8517 mu=0.5751 sigma=0.1383 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.001858059267059204}
|
|
[2026-04-13 01:45:31] UCB=0.8404 mu=0.5282 sigma=0.1561 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035935192399074944}
|
|
[2026-04-13 01:45:31] UCB=0.8129 mu=0.4813 sigma=0.1658 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003763076525772448}
|
|
[2026-04-13 01:45:31] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0038485374729066227, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:33] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.003849
|
|
[2026-04-13 01:45:42] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:45:42] [AutoResearch] mean_reward=49.256
|
|
[2026-04-13 01:45:42] [AutoResearch] === Trial 152 Summary ===
|
|
[2026-04-13 01:45:42] Total runs in history: 270
|
|
[2026-04-13 01:45:42] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:42] Top 5 results:
|
|
[2026-04-13 01:45:42] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:42] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:42] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:42] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:42] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:44]
|
|
[AutoResearch] ========== Trial 153/200 ==========
|
|
[2026-04-13 01:45:44] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:45:44] UCB=1.0414 mu=0.7253 sigma=0.1581 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038015029551744226}
|
|
[2026-04-13 01:45:44] UCB=0.9898 mu=0.6450 sigma=0.1724 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0021963493327190158}
|
|
[2026-04-13 01:45:44] UCB=0.9756 mu=0.8026 sigma=0.0865 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 5.4542495046720746e-05}
|
|
[2026-04-13 01:45:44] UCB=0.8613 mu=0.5828 sigma=0.1393 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0037495678576254347}
|
|
[2026-04-13 01:45:44] UCB=0.8454 mu=0.5819 sigma=0.1317 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0012022788037046642}
|
|
[2026-04-13 01:45:44] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038015029551744226, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:46] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003802
|
|
[2026-04-13 01:45:54] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:45:54] [AutoResearch] mean_reward=48.3437
|
|
[2026-04-13 01:45:54] [AutoResearch] === Trial 153 Summary ===
|
|
[2026-04-13 01:45:54] Total runs in history: 271
|
|
[2026-04-13 01:45:54] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:54] Top 5 results:
|
|
[2026-04-13 01:45:54] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:54] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:54] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:54] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:54] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:56]
|
|
[AutoResearch] ========== Trial 154/200 ==========
|
|
[2026-04-13 01:45:56] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:45:56] UCB=0.9973 mu=0.6257 sigma=0.1858 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0037856890259167415}
|
|
[2026-04-13 01:45:56] UCB=0.8688 mu=0.7403 sigma=0.0643 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0014536245335418439}
|
|
[2026-04-13 01:45:56] UCB=0.8608 mu=0.5276 sigma=0.1666 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004212621805290471}
|
|
[2026-04-13 01:45:56] UCB=0.8337 mu=0.6395 sigma=0.0971 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0011580008559038504}
|
|
[2026-04-13 01:45:56] UCB=0.8221 mu=0.7253 sigma=0.0484 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001223375858125639}
|
|
[2026-04-13 01:45:56] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0037856890259167415, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:45:58] [AutoResearch] Launching job: n_steer=4 n_throttle=2 lr=0.003786
|
|
[2026-04-13 01:46:06] [AutoResearch] Job finished in 7.7s, returncode=0
|
|
[2026-04-13 01:46:06] [AutoResearch] mean_reward=34.8139
|
|
[2026-04-13 01:46:06] [AutoResearch] === Trial 154 Summary ===
|
|
[2026-04-13 01:46:06] Total runs in history: 272
|
|
[2026-04-13 01:46:06] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:06] Top 5 results:
|
|
[2026-04-13 01:46:06] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:06] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:06] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:06] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:06] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:08]
|
|
[AutoResearch] ========== Trial 155/200 ==========
|
|
[2026-04-13 01:46:08] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:46:08] UCB=0.8532 mu=0.5577 sigma=0.1478 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035819883277487504}
|
|
[2026-04-13 01:46:08] UCB=0.8259 mu=0.5349 sigma=0.1455 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035146865214253434}
|
|
[2026-04-13 01:46:08] UCB=0.8222 mu=0.4930 sigma=0.1646 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0014120881240376056}
|
|
[2026-04-13 01:46:08] UCB=0.8056 mu=0.5139 sigma=0.1459 params={'n_steer': 6, 'n_throttle': 4, 'learning_rate': 0.00010650068227422561}
|
|
[2026-04-13 01:46:08] UCB=0.7969 mu=0.5098 sigma=0.1435 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 0.00015878718864780094}
|
|
[2026-04-13 01:46:08] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035819883277487504, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:10] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003582
|
|
[2026-04-13 01:46:19] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:46:19] [AutoResearch] mean_reward=91.9871
|
|
[2026-04-13 01:46:19] [AutoResearch] === Trial 155 Summary ===
|
|
[2026-04-13 01:46:19] Total runs in history: 273
|
|
[2026-04-13 01:46:19] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:19] Top 5 results:
|
|
[2026-04-13 01:46:19] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:19] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:19] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:19] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:19] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:21]
|
|
[AutoResearch] ========== Trial 156/200 ==========
|
|
[2026-04-13 01:46:21] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:46:21] UCB=1.0824 mu=0.7242 sigma=0.1791 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003954456510628469}
|
|
[2026-04-13 01:46:21] UCB=1.0035 mu=0.6584 sigma=0.1726 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0020306150991573524}
|
|
[2026-04-13 01:46:21] UCB=0.9647 mu=0.7146 sigma=0.1251 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.00399804294101836}
|
|
[2026-04-13 01:46:21] UCB=0.9157 mu=0.5480 sigma=0.1838 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038452761381874852}
|
|
[2026-04-13 01:46:21] UCB=0.8901 mu=0.7781 sigma=0.0560 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0005109776685317931}
|
|
[2026-04-13 01:46:21] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003954456510628469, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:23] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003954
|
|
[2026-04-13 01:46:32] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:46:32] [AutoResearch] mean_reward=53.2133
|
|
[2026-04-13 01:46:32] [AutoResearch] === Trial 156 Summary ===
|
|
[2026-04-13 01:46:32] Total runs in history: 274
|
|
[2026-04-13 01:46:32] Best so far: mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:32] Top 5 results:
|
|
[2026-04-13 01:46:32] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:32] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:32] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:32] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:32] mean_reward=106.2747 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003537015910569086, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:34]
|
|
[AutoResearch] ========== Trial 157/200 ==========
|
|
[2026-04-13 01:46:34] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:46:34] UCB=0.9811 mu=0.6623 sigma=0.1594 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734}
|
|
[2026-04-13 01:46:34] UCB=0.9459 mu=0.7984 sigma=0.0737 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 6.083073699739592e-05}
|
|
[2026-04-13 01:46:34] UCB=0.9031 mu=0.5723 sigma=0.1654 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035019996788284605}
|
|
[2026-04-13 01:46:34] UCB=0.8323 mu=0.6028 sigma=0.1148 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0022468667787073657}
|
|
[2026-04-13 01:46:34] UCB=0.8220 mu=0.5037 sigma=0.1591 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.003831524404516114}
|
|
[2026-04-13 01:46:34] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:36] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.002024
|
|
[2026-04-13 01:46:45] [AutoResearch] Job finished in 9.7s, returncode=0
|
|
[2026-04-13 01:46:45] [AutoResearch] mean_reward=141.8524
|
|
[2026-04-13 01:46:45] [AutoResearch] === Trial 157 Summary ===
|
|
[2026-04-13 01:46:45] Total runs in history: 275
|
|
[2026-04-13 01:46:45] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:45] Top 5 results:
|
|
[2026-04-13 01:46:45] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:45] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:45] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:45] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:45] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:47]
|
|
[AutoResearch] ========== Trial 158/200 ==========
|
|
[2026-04-13 01:46:47] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:46:47] UCB=0.8747 mu=0.5860 sigma=0.1444 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035318237798399182}
|
|
[2026-04-13 01:46:47] UCB=0.8691 mu=0.5293 sigma=0.1699 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0024638665227811178}
|
|
[2026-04-13 01:46:47] UCB=0.8535 mu=0.6221 sigma=0.1157 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0036093977212524777}
|
|
[2026-04-13 01:46:47] UCB=0.8203 mu=0.7063 sigma=0.0570 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0004904202362515397}
|
|
[2026-04-13 01:46:47] UCB=0.8108 mu=0.5161 sigma=0.1474 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0010558647447061352}
|
|
[2026-04-13 01:46:47] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035318237798399182, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:49] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003532
|
|
[2026-04-13 01:46:58] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:46:58] [AutoResearch] mean_reward=67.8909
|
|
[2026-04-13 01:46:58] [AutoResearch] === Trial 158 Summary ===
|
|
[2026-04-13 01:46:58] Total runs in history: 276
|
|
[2026-04-13 01:46:58] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:58] Top 5 results:
|
|
[2026-04-13 01:46:58] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:58] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:58] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:58] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:46:58] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:00]
|
|
[AutoResearch] ========== Trial 159/200 ==========
|
|
[2026-04-13 01:47:00] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:47:00] UCB=0.9906 mu=0.6500 sigma=0.1703 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020941821717380057}
|
|
[2026-04-13 01:47:00] UCB=0.9476 mu=0.6433 sigma=0.1522 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0040814201603721215}
|
|
[2026-04-13 01:47:00] UCB=0.8781 mu=0.5078 sigma=0.1852 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003761767023332683}
|
|
[2026-04-13 01:47:00] UCB=0.8469 mu=0.5455 sigma=0.1507 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003727963871927738}
|
|
[2026-04-13 01:47:00] UCB=0.8250 mu=0.6815 sigma=0.0717 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00116819352909946}
|
|
[2026-04-13 01:47:00] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020941821717380057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:02] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.002094
|
|
[2026-04-13 01:47:11] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:47:11] [AutoResearch] mean_reward=58.8493
|
|
[2026-04-13 01:47:11] [AutoResearch] === Trial 159 Summary ===
|
|
[2026-04-13 01:47:11] Total runs in history: 277
|
|
[2026-04-13 01:47:11] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:11] Top 5 results:
|
|
[2026-04-13 01:47:11] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:11] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:11] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:11] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:11] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:13]
|
|
[AutoResearch] ========== Trial 160/200 ==========
|
|
[2026-04-13 01:47:13] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:47:13] UCB=0.8702 mu=0.5265 sigma=0.1719 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.003649961947893182}
|
|
[2026-04-13 01:47:13] UCB=0.8632 mu=0.6520 sigma=0.1056 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0014405020095145686}
|
|
[2026-04-13 01:47:13] UCB=0.8442 mu=0.5125 sigma=0.1659 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.001019081344081621}
|
|
[2026-04-13 01:47:13] UCB=0.8115 mu=0.6937 sigma=0.0589 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0016076340411399909}
|
|
[2026-04-13 01:47:13] UCB=0.7915 mu=0.4593 sigma=0.1661 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001173627247746085}
|
|
[2026-04-13 01:47:13] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.003649961947893182, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:15] [AutoResearch] Launching job: n_steer=7 n_throttle=2 lr=0.003650
|
|
[2026-04-13 01:47:23] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:47:23] [AutoResearch] mean_reward=66.5789
|
|
[2026-04-13 01:47:23] [AutoResearch] === Trial 160 Summary ===
|
|
[2026-04-13 01:47:23] Total runs in history: 278
|
|
[2026-04-13 01:47:23] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:23] Top 5 results:
|
|
[2026-04-13 01:47:23] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:23] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:23] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:23] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:23] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:25]
|
|
[AutoResearch] ========== Trial 161/200 ==========
|
|
[2026-04-13 01:47:25] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:47:25] UCB=0.9032 mu=0.5584 sigma=0.1724 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004085229861037777}
|
|
[2026-04-13 01:47:25] UCB=0.8141 mu=0.4919 sigma=0.1611 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0011283424913702263}
|
|
[2026-04-13 01:47:25] UCB=0.7965 mu=0.4763 sigma=0.1601 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001054170647178148}
|
|
[2026-04-13 01:47:25] UCB=0.7704 mu=0.4437 sigma=0.1634 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.00369077920598478}
|
|
[2026-04-13 01:47:25] UCB=0.7406 mu=0.6596 sigma=0.0405 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0015692558386461739}
|
|
[2026-04-13 01:47:25] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004085229861037777, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:27] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.004085
|
|
[2026-04-13 01:47:35] [AutoResearch] Job finished in 8.0s, returncode=0
|
|
[2026-04-13 01:47:35] [AutoResearch] mean_reward=44.9587
|
|
[2026-04-13 01:47:35] [AutoResearch] === Trial 161 Summary ===
|
|
[2026-04-13 01:47:35] Total runs in history: 279
|
|
[2026-04-13 01:47:35] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:35] Top 5 results:
|
|
[2026-04-13 01:47:35] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:35] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:35] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:35] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:35] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:37]
|
|
[AutoResearch] ========== Trial 162/200 ==========
|
|
[2026-04-13 01:47:37] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:47:37] UCB=0.9745 mu=0.6125 sigma=0.1810 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038353861625994375}
|
|
[2026-04-13 01:47:37] UCB=0.9558 mu=0.5916 sigma=0.1821 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.0038601145913871077}
|
|
[2026-04-13 01:47:37] UCB=0.7998 mu=0.5347 sigma=0.1326 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036224833158078167}
|
|
[2026-04-13 01:47:37] UCB=0.7819 mu=0.4606 sigma=0.1606 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0038228517499287813}
|
|
[2026-04-13 01:47:37] UCB=0.7743 mu=0.4556 sigma=0.1594 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.001173383115073788}
|
|
[2026-04-13 01:47:37] [AutoResearch] Proposed params: {'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0038353861625994375, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:39] [AutoResearch] Launching job: n_steer=4 n_throttle=3 lr=0.003835
|
|
[2026-04-13 01:47:48] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:47:48] [AutoResearch] mean_reward=55.6142
|
|
[2026-04-13 01:47:48] [AutoResearch] === Trial 162 Summary ===
|
|
[2026-04-13 01:47:48] Total runs in history: 280
|
|
[2026-04-13 01:47:48] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:48] Top 5 results:
|
|
[2026-04-13 01:47:48] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:48] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:48] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:48] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:48] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:50]
|
|
[AutoResearch] ========== Trial 163/200 ==========
|
|
[2026-04-13 01:47:50] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:47:50] UCB=0.9313 mu=0.5969 sigma=0.1672 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0022059815087590168}
|
|
[2026-04-13 01:47:50] UCB=0.8921 mu=0.6131 sigma=0.1395 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002061513719951825}
|
|
[2026-04-13 01:47:50] UCB=0.8516 mu=0.4938 sigma=0.1789 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.00399826668589754}
|
|
[2026-04-13 01:47:50] UCB=0.8418 mu=0.4984 sigma=0.1717 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0035819575586993934}
|
|
[2026-04-13 01:47:50] UCB=0.8108 mu=0.4672 sigma=0.1718 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.003736254638999268}
|
|
[2026-04-13 01:47:50] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0022059815087590168, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:47:52] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.002206
|
|
[2026-04-13 01:48:00] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:48:00] [AutoResearch] mean_reward=54.5912
|
|
[2026-04-13 01:48:00] [AutoResearch] === Trial 163 Summary ===
|
|
[2026-04-13 01:48:00] Total runs in history: 281
|
|
[2026-04-13 01:48:00] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:00] Top 5 results:
|
|
[2026-04-13 01:48:00] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:00] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:00] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:00] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:00] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:02]
|
|
[AutoResearch] ========== Trial 164/200 ==========
|
|
[2026-04-13 01:48:02] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:48:02] UCB=0.8714 mu=0.5653 sigma=0.1531 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036716104969230414}
|
|
[2026-04-13 01:48:02] UCB=0.8028 mu=0.5580 sigma=0.1224 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001130037980832555}
|
|
[2026-04-13 01:48:02] UCB=0.7995 mu=0.4979 sigma=0.1508 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0038857454545401264}
|
|
[2026-04-13 01:48:02] UCB=0.7954 mu=0.4388 sigma=0.1783 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003718589526779072}
|
|
[2026-04-13 01:48:02] UCB=0.7840 mu=0.4249 sigma=0.1796 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.004015861239120415}
|
|
[2026-04-13 01:48:02] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036716104969230414, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:05] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003672
|
|
[2026-04-13 01:48:14] [AutoResearch] Job finished in 9.1s, returncode=0
|
|
[2026-04-13 01:48:14] [AutoResearch] mean_reward=92.892
|
|
[2026-04-13 01:48:14] [AutoResearch] === Trial 164 Summary ===
|
|
[2026-04-13 01:48:14] Total runs in history: 282
|
|
[2026-04-13 01:48:14] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:14] Top 5 results:
|
|
[2026-04-13 01:48:14] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:14] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:14] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:14] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:14] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:16]
|
|
[AutoResearch] ========== Trial 165/200 ==========
|
|
[2026-04-13 01:48:16] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:48:16] UCB=1.0187 mu=0.6853 sigma=0.1667 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018064370937087909}
|
|
[2026-04-13 01:48:16] UCB=0.8939 mu=0.5811 sigma=0.1564 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002314721167117874}
|
|
[2026-04-13 01:48:16] UCB=0.8912 mu=0.5891 sigma=0.1510 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002196967291501117}
|
|
[2026-04-13 01:48:16] UCB=0.8866 mu=0.6285 sigma=0.1290 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.00375375305407539}
|
|
[2026-04-13 01:48:16] UCB=0.8418 mu=0.4978 sigma=0.1720 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003920276268348746}
|
|
[2026-04-13 01:48:16] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0018064370937087909, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:18] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.001806
|
|
[2026-04-13 01:48:26] [AutoResearch] Job finished in 8.1s, returncode=0
|
|
[2026-04-13 01:48:26] [AutoResearch] mean_reward=50.3114
|
|
[2026-04-13 01:48:26] [AutoResearch] === Trial 165 Summary ===
|
|
[2026-04-13 01:48:26] Total runs in history: 283
|
|
[2026-04-13 01:48:26] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:26] Top 5 results:
|
|
[2026-04-13 01:48:26] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:26] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:26] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:26] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:26] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:28]
|
|
[AutoResearch] ========== Trial 166/200 ==========
|
|
[2026-04-13 01:48:28] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:48:28] UCB=0.8730 mu=0.6360 sigma=0.1185 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001224508662447414}
|
|
[2026-04-13 01:48:28] UCB=0.8578 mu=0.5144 sigma=0.1717 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.000860552409247052}
|
|
[2026-04-13 01:48:28] UCB=0.8306 mu=0.5588 sigma=0.1359 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034831338146445795}
|
|
[2026-04-13 01:48:28] UCB=0.8277 mu=0.4851 sigma=0.1713 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.002254261015907784}
|
|
[2026-04-13 01:48:28] UCB=0.7904 mu=0.4334 sigma=0.1785 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.001976477057124745}
|
|
[2026-04-13 01:48:28] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.001224508662447414, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:30] [AutoResearch] Launching job: n_steer=8 n_throttle=2 lr=0.001225
|
|
[2026-04-13 01:48:39] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:48:39] [AutoResearch] mean_reward=64.1748
|
|
[2026-04-13 01:48:39] [AutoResearch] === Trial 166 Summary ===
|
|
[2026-04-13 01:48:39] Total runs in history: 284
|
|
[2026-04-13 01:48:39] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:39] Top 5 results:
|
|
[2026-04-13 01:48:39] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:39] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:39] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:39] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:39] mean_reward=106.8657 params={'n_steer': 3, 'n_throttle': 2, 'learning_rate': 0.004941536515712236, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:41]
|
|
[AutoResearch] ========== Trial 167/200 ==========
|
|
[2026-04-13 01:48:41] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:48:41] UCB=0.9399 mu=0.6123 sigma=0.1638 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827}
|
|
[2026-04-13 01:48:41] UCB=0.8855 mu=0.5596 sigma=0.1629 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009548249782238416}
|
|
[2026-04-13 01:48:41] UCB=0.8114 mu=0.5193 sigma=0.1460 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036666499329695264}
|
|
[2026-04-13 01:48:41] UCB=0.8107 mu=0.6219 sigma=0.0944 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037176189964106213}
|
|
[2026-04-13 01:48:41] UCB=0.8087 mu=0.4705 sigma=0.1691 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0039791775533499826}
|
|
[2026-04-13 01:48:41] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:43] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003644
|
|
[2026-04-13 01:48:53] [AutoResearch] Job finished in 9.6s, returncode=0
|
|
[2026-04-13 01:48:53] [AutoResearch] mean_reward=117.3069
|
|
[2026-04-13 01:48:53] [AutoResearch] === Trial 167 Summary ===
|
|
[2026-04-13 01:48:53] Total runs in history: 285
|
|
[2026-04-13 01:48:53] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:53] Top 5 results:
|
|
[2026-04-13 01:48:53] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:53] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:53] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:53] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:53] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:55]
|
|
[AutoResearch] ========== Trial 168/200 ==========
|
|
[2026-04-13 01:48:55] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:48:55] UCB=1.0394 mu=0.6949 sigma=0.1722 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020521993454920237}
|
|
[2026-04-13 01:48:55] UCB=0.9812 mu=0.6489 sigma=0.1661 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020564328969684817}
|
|
[2026-04-13 01:48:55] UCB=0.8545 mu=0.5066 sigma=0.1739 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004081342057018184}
|
|
[2026-04-13 01:48:55] UCB=0.8470 mu=0.5799 sigma=0.1335 params={'n_steer': 4, 'n_throttle': 4, 'learning_rate': 5.0930975641172e-05}
|
|
[2026-04-13 01:48:55] UCB=0.8183 mu=0.7021 sigma=0.0581 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003912560376450068}
|
|
[2026-04-13 01:48:55] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020521993454920237, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:48:57] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.002052
|
|
[2026-04-13 01:49:05] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:49:05] [AutoResearch] mean_reward=58.0071
|
|
[2026-04-13 01:49:05] [AutoResearch] === Trial 168 Summary ===
|
|
[2026-04-13 01:49:05] Total runs in history: 286
|
|
[2026-04-13 01:49:05] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:05] Top 5 results:
|
|
[2026-04-13 01:49:05] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:05] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:05] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:05] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:05] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:07]
|
|
[AutoResearch] ========== Trial 169/200 ==========
|
|
[2026-04-13 01:49:07] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:49:07] UCB=0.8545 mu=0.5174 sigma=0.1686 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.002198297812268199}
|
|
[2026-04-13 01:49:07] UCB=0.7961 mu=0.4613 sigma=0.1674 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0024136263084211843}
|
|
[2026-04-13 01:49:07] UCB=0.7517 mu=0.6400 sigma=0.0559 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006320436444330081}
|
|
[2026-04-13 01:49:07] UCB=0.7455 mu=0.6226 sigma=0.0615 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0038370269729647924}
|
|
[2026-04-13 01:49:07] UCB=0.7447 mu=0.4350 sigma=0.1549 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.0037504663215938983}
|
|
[2026-04-13 01:49:07] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.002198297812268199, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:09] [AutoResearch] Launching job: n_steer=7 n_throttle=4 lr=0.002198
|
|
[2026-04-13 01:49:18] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:49:18] [AutoResearch] mean_reward=74.3732
|
|
[2026-04-13 01:49:18] [AutoResearch] === Trial 169 Summary ===
|
|
[2026-04-13 01:49:18] Total runs in history: 287
|
|
[2026-04-13 01:49:18] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:18] Top 5 results:
|
|
[2026-04-13 01:49:18] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:18] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:18] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:18] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:18] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:20]
|
|
[AutoResearch] ========== Trial 170/200 ==========
|
|
[2026-04-13 01:49:20] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:49:20] UCB=0.9310 mu=0.7836 sigma=0.0737 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034649641648052875}
|
|
[2026-04-13 01:49:20] UCB=0.8496 mu=0.4826 sigma=0.1835 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003900460467561357}
|
|
[2026-04-13 01:49:20] UCB=0.8413 mu=0.4977 sigma=0.1718 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002389945162411186}
|
|
[2026-04-13 01:49:20] UCB=0.8360 mu=0.5102 sigma=0.1629 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008769539772108456}
|
|
[2026-04-13 01:49:20] UCB=0.8151 mu=0.5343 sigma=0.1404 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0008787043637543843}
|
|
[2026-04-13 01:49:20] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034649641648052875, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:22] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003465
|
|
[2026-04-13 01:49:31] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:49:31] [AutoResearch] mean_reward=71.8223
|
|
[2026-04-13 01:49:31] [AutoResearch] === Trial 170 Summary ===
|
|
[2026-04-13 01:49:31] Total runs in history: 288
|
|
[2026-04-13 01:49:31] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:31] Top 5 results:
|
|
[2026-04-13 01:49:31] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:31] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:31] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:31] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:31] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:33]
|
|
[AutoResearch] ========== Trial 171/200 ==========
|
|
[2026-04-13 01:49:33] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:49:33] UCB=1.0867 mu=0.7880 sigma=0.1494 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0038492682866230693}
|
|
[2026-04-13 01:49:33] UCB=0.8853 mu=0.5825 sigma=0.1514 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0021103040013165843}
|
|
[2026-04-13 01:49:33] UCB=0.8662 mu=0.5367 sigma=0.1648 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009340195240954506}
|
|
[2026-04-13 01:49:33] UCB=0.8237 mu=0.4650 sigma=0.1794 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0039482936174207745}
|
|
[2026-04-13 01:49:33] UCB=0.8076 mu=0.5696 sigma=0.1190 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002026535474657009}
|
|
[2026-04-13 01:49:33] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0038492682866230693, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:35] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003849
|
|
[2026-04-13 01:49:44] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:49:44] [AutoResearch] mean_reward=76.6723
|
|
[2026-04-13 01:49:44] [AutoResearch] === Trial 171 Summary ===
|
|
[2026-04-13 01:49:44] Total runs in history: 289
|
|
[2026-04-13 01:49:44] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:44] Top 5 results:
|
|
[2026-04-13 01:49:44] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:44] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:44] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:44] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:44] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:46]
|
|
[AutoResearch] ========== Trial 172/200 ==========
|
|
[2026-04-13 01:49:46] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:49:46] UCB=0.8926 mu=0.5579 sigma=0.1673 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0019119517499715658}
|
|
[2026-04-13 01:49:46] UCB=0.8870 mu=0.5731 sigma=0.1569 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003757338930937929}
|
|
[2026-04-13 01:49:46] UCB=0.8802 mu=0.5835 sigma=0.1483 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 7.834093406057774e-05}
|
|
[2026-04-13 01:49:46] UCB=0.8421 mu=0.4973 sigma=0.1724 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.004014537856838027}
|
|
[2026-04-13 01:49:46] UCB=0.8421 mu=0.7025 sigma=0.0698 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034379324321722068}
|
|
[2026-04-13 01:49:46] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0019119517499715658, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:48] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.001912
|
|
[2026-04-13 01:49:56] [AutoResearch] Job finished in 7.9s, returncode=0
|
|
[2026-04-13 01:49:56] [AutoResearch] mean_reward=44.18
|
|
[2026-04-13 01:49:56] [AutoResearch] === Trial 172 Summary ===
|
|
[2026-04-13 01:49:56] Total runs in history: 290
|
|
[2026-04-13 01:49:56] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:56] Top 5 results:
|
|
[2026-04-13 01:49:56] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:56] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:56] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:56] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:56] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:49:58]
|
|
[AutoResearch] ========== Trial 173/200 ==========
|
|
[2026-04-13 01:49:58] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:49:58] UCB=0.9274 mu=0.7944 sigma=0.0665 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003785959624358219}
|
|
[2026-04-13 01:49:58] UCB=0.9106 mu=0.7351 sigma=0.0878 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 6.860354732686536e-05}
|
|
[2026-04-13 01:49:58] UCB=0.8873 mu=0.5430 sigma=0.1721 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003636004233969052}
|
|
[2026-04-13 01:49:58] UCB=0.8408 mu=0.4840 sigma=0.1784 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.0019926357693607418}
|
|
[2026-04-13 01:49:58] UCB=0.8351 mu=0.5549 sigma=0.1401 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008376387726759843}
|
|
[2026-04-13 01:49:58] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003785959624358219, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:00] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003786
|
|
[2026-04-13 01:50:09] [AutoResearch] Job finished in 9.3s, returncode=0
|
|
[2026-04-13 01:50:09] [AutoResearch] mean_reward=87.5374
|
|
[2026-04-13 01:50:09] [AutoResearch] === Trial 173 Summary ===
|
|
[2026-04-13 01:50:09] Total runs in history: 291
|
|
[2026-04-13 01:50:09] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:09] Top 5 results:
|
|
[2026-04-13 01:50:09] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:09] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:09] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:09] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:09] mean_reward=114.5598 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0020783633254979773, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:11]
|
|
[AutoResearch] ========== Trial 174/200 ==========
|
|
[2026-04-13 01:50:11] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:50:11] UCB=1.0777 mu=0.7843 sigma=0.1467 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057}
|
|
[2026-04-13 01:50:11] UCB=1.0225 mu=0.8158 sigma=0.1034 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0038650066620055535}
|
|
[2026-04-13 01:50:11] UCB=0.8882 mu=0.5742 sigma=0.1570 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003678532363494295}
|
|
[2026-04-13 01:50:11] UCB=0.8710 mu=0.6305 sigma=0.1203 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034175447952622395}
|
|
[2026-04-13 01:50:11] UCB=0.8706 mu=0.5805 sigma=0.1450 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036578880878419795}
|
|
[2026-04-13 01:50:11] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:13] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003726
|
|
[2026-04-13 01:50:23] [AutoResearch] Job finished in 9.5s, returncode=0
|
|
[2026-04-13 01:50:23] [AutoResearch] mean_reward=120.0185
|
|
[2026-04-13 01:50:23] [AutoResearch] === Trial 174 Summary ===
|
|
[2026-04-13 01:50:23] Total runs in history: 292
|
|
[2026-04-13 01:50:23] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:23] Top 5 results:
|
|
[2026-04-13 01:50:23] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:23] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:23] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:23] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:23] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:25]
|
|
[AutoResearch] ========== Trial 175/200 ==========
|
|
[2026-04-13 01:50:25] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:50:25] UCB=1.1676 mu=1.0071 sigma=0.0803 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0038391196734918536}
|
|
[2026-04-13 01:50:25] UCB=0.9380 mu=0.8133 sigma=0.0623 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.00395442506062643}
|
|
[2026-04-13 01:50:25] UCB=0.9192 mu=0.5996 sigma=0.1598 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034115819763085955}
|
|
[2026-04-13 01:50:25] UCB=0.9181 mu=0.6338 sigma=0.1421 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.002091759716422658}
|
|
[2026-04-13 01:50:25] UCB=0.8895 mu=0.5520 sigma=0.1688 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003514532289284362}
|
|
[2026-04-13 01:50:25] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0038391196734918536, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:27] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003839
|
|
[2026-04-13 01:50:36] [AutoResearch] Job finished in 9.2s, returncode=0
|
|
[2026-04-13 01:50:36] [AutoResearch] mean_reward=100.686
|
|
[2026-04-13 01:50:36] [AutoResearch] === Trial 175 Summary ===
|
|
[2026-04-13 01:50:36] Total runs in history: 293
|
|
[2026-04-13 01:50:36] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:36] Top 5 results:
|
|
[2026-04-13 01:50:36] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:36] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:36] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:36] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:36] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:38]
|
|
[AutoResearch] ========== Trial 176/200 ==========
|
|
[2026-04-13 01:50:39] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:50:39] UCB=1.1888 mu=0.8763 sigma=0.1563 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.004025176404937744}
|
|
[2026-04-13 01:50:39] UCB=1.0788 mu=0.9548 sigma=0.0620 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036043890776644564}
|
|
[2026-04-13 01:50:39] UCB=1.0645 mu=0.8727 sigma=0.0959 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.004137254884963912}
|
|
[2026-04-13 01:50:39] UCB=1.0465 mu=0.9106 sigma=0.0680 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.004045267143934102}
|
|
[2026-04-13 01:50:39] UCB=0.9084 mu=0.6849 sigma=0.1118 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003746648657153557}
|
|
[2026-04-13 01:50:39] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.004025176404937744, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:41] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.004025
|
|
[2026-04-13 01:50:49] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:50:49] [AutoResearch] mean_reward=44.1892
|
|
[2026-04-13 01:50:49] [AutoResearch] === Trial 176 Summary ===
|
|
[2026-04-13 01:50:49] Total runs in history: 294
|
|
[2026-04-13 01:50:49] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:49] Top 5 results:
|
|
[2026-04-13 01:50:49] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:49] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:49] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:49] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:49] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:51]
|
|
[AutoResearch] ========== Trial 177/200 ==========
|
|
[2026-04-13 01:50:51] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:50:51] UCB=1.0804 mu=0.9040 sigma=0.0882 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036437431438426705}
|
|
[2026-04-13 01:50:51] UCB=1.0256 mu=0.8042 sigma=0.1107 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003423525273105999}
|
|
[2026-04-13 01:50:51] UCB=0.9807 mu=0.6991 sigma=0.1408 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036861506948891723}
|
|
[2026-04-13 01:50:51] UCB=0.9564 mu=0.7510 sigma=0.1027 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003580260626675254}
|
|
[2026-04-13 01:50:51] UCB=0.9268 mu=0.7640 sigma=0.0814 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003549532764811867}
|
|
[2026-04-13 01:50:51] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036437431438426705, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:50:53] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003644
|
|
[2026-04-13 01:51:02] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:51:02] [AutoResearch] mean_reward=67.3735
|
|
[2026-04-13 01:51:02] [AutoResearch] === Trial 177 Summary ===
|
|
[2026-04-13 01:51:02] Total runs in history: 295
|
|
[2026-04-13 01:51:02] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:02] Top 5 results:
|
|
[2026-04-13 01:51:02] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:02] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:02] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:02] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:02] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:04]
|
|
[AutoResearch] ========== Trial 178/200 ==========
|
|
[2026-04-13 01:51:04] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:51:04] UCB=0.9542 mu=0.8494 sigma=0.0524 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00040478947425976174}
|
|
[2026-04-13 01:51:04] UCB=0.9129 mu=0.5795 sigma=0.1667 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003468370848144715}
|
|
[2026-04-13 01:51:04] UCB=0.9044 mu=0.6114 sigma=0.1465 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0033886458072858593}
|
|
[2026-04-13 01:51:04] UCB=0.9000 mu=0.5670 sigma=0.1665 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0034652716432347266}
|
|
[2026-04-13 01:51:04] UCB=0.8787 mu=0.5730 sigma=0.1529 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0037109603654534927}
|
|
[2026-04-13 01:51:04] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00040478947425976174, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:06] [AutoResearch] Launching job: n_steer=3 n_throttle=5 lr=0.000405
|
|
[2026-04-13 01:51:15] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:51:15] [AutoResearch] mean_reward=67.4358
|
|
[2026-04-13 01:51:15] [AutoResearch] === Trial 178 Summary ===
|
|
[2026-04-13 01:51:15] Total runs in history: 296
|
|
[2026-04-13 01:51:15] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:15] Top 5 results:
|
|
[2026-04-13 01:51:15] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:15] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:15] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:15] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:15] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:17]
|
|
[AutoResearch] ========== Trial 179/200 ==========
|
|
[2026-04-13 01:51:18] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:51:18] UCB=0.9227 mu=0.8397 sigma=0.0415 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037292824709774922}
|
|
[2026-04-13 01:51:18] UCB=0.9220 mu=0.7920 sigma=0.0650 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003761388102621209}
|
|
[2026-04-13 01:51:18] UCB=0.9144 mu=0.8162 sigma=0.0491 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035609322412469236}
|
|
[2026-04-13 01:51:18] UCB=0.8886 mu=0.5552 sigma=0.1667 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003352041735087034}
|
|
[2026-04-13 01:51:18] UCB=0.8287 mu=0.4963 sigma=0.1662 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0019486688991881937}
|
|
[2026-04-13 01:51:18] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037292824709774922, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:20] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003729
|
|
[2026-04-13 01:51:29] [AutoResearch] Job finished in 9.5s, returncode=0
|
|
[2026-04-13 01:51:29] [AutoResearch] mean_reward=85.085
|
|
[2026-04-13 01:51:29] [AutoResearch] === Trial 179 Summary ===
|
|
[2026-04-13 01:51:29] Total runs in history: 297
|
|
[2026-04-13 01:51:29] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:29] Top 5 results:
|
|
[2026-04-13 01:51:29] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:29] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:29] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:29] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:29] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:31]
|
|
[AutoResearch] ========== Trial 180/200 ==========
|
|
[2026-04-13 01:51:31] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:51:31] UCB=0.9333 mu=0.6243 sigma=0.1545 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003491600228346355}
|
|
[2026-04-13 01:51:31] UCB=0.9012 mu=0.6157 sigma=0.1428 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036161208321681742}
|
|
[2026-04-13 01:51:31] UCB=0.8500 mu=0.5139 sigma=0.1680 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002225900270994388}
|
|
[2026-04-13 01:51:31] UCB=0.8091 mu=0.5330 sigma=0.1381 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0007832591340163999}
|
|
[2026-04-13 01:51:31] UCB=0.7943 mu=0.4971 sigma=0.1486 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.004986321693039314}
|
|
[2026-04-13 01:51:31] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003491600228346355, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:33] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003492
|
|
[2026-04-13 01:51:42] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:51:42] [AutoResearch] mean_reward=69.8318
|
|
[2026-04-13 01:51:42] [AutoResearch] === Trial 180 Summary ===
|
|
[2026-04-13 01:51:42] Total runs in history: 298
|
|
[2026-04-13 01:51:42] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:42] Top 5 results:
|
|
[2026-04-13 01:51:42] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:42] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:42] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:42] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:42] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:44]
|
|
[AutoResearch] ========== Trial 181/200 ==========
|
|
[2026-04-13 01:51:44] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:51:44] UCB=1.0324 mu=0.7126 sigma=0.1599 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003623077185471837}
|
|
[2026-04-13 01:51:44] UCB=0.9116 mu=0.5706 sigma=0.1705 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035997060442987685}
|
|
[2026-04-13 01:51:44] UCB=0.8559 mu=0.5715 sigma=0.1422 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0033790983701108696}
|
|
[2026-04-13 01:51:44] UCB=0.8355 mu=0.5305 sigma=0.1525 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0035392808291132196}
|
|
[2026-04-13 01:51:44] UCB=0.7545 mu=0.5628 sigma=0.0959 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006271815362759663}
|
|
[2026-04-13 01:51:44] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003623077185471837, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:46] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003623
|
|
[2026-04-13 01:51:55] [AutoResearch] Job finished in 9.1s, returncode=0
|
|
[2026-04-13 01:51:55] [AutoResearch] mean_reward=96.3322
|
|
[2026-04-13 01:51:55] [AutoResearch] === Trial 181 Summary ===
|
|
[2026-04-13 01:51:55] Total runs in history: 299
|
|
[2026-04-13 01:51:55] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:55] Top 5 results:
|
|
[2026-04-13 01:51:55] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:55] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:55] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:55] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:55] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:57]
|
|
[AutoResearch] ========== Trial 182/200 ==========
|
|
[2026-04-13 01:51:57] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:51:57] UCB=1.0314 mu=0.7078 sigma=0.1618 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003818357491026087}
|
|
[2026-04-13 01:51:57] UCB=0.8027 mu=0.5292 sigma=0.1367 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.000805032577040455}
|
|
[2026-04-13 01:51:57] UCB=0.7942 mu=0.4893 sigma=0.1524 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.00407920982038762}
|
|
[2026-04-13 01:51:57] UCB=0.7771 mu=0.4776 sigma=0.1497 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009551306864251269}
|
|
[2026-04-13 01:51:57] UCB=0.7502 mu=0.5078 sigma=0.1212 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003465366785071685}
|
|
[2026-04-13 01:51:57] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003818357491026087, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:51:59] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003818
|
|
[2026-04-13 01:52:08] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:52:08] [AutoResearch] mean_reward=60.4213
|
|
[2026-04-13 01:52:08] [AutoResearch] === Trial 182 Summary ===
|
|
[2026-04-13 01:52:08] Total runs in history: 300
|
|
[2026-04-13 01:52:08] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:08] Top 5 results:
|
|
[2026-04-13 01:52:08] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:08] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:08] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:08] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:08] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:10]
|
|
[AutoResearch] ========== Trial 183/200 ==========
|
|
[2026-04-13 01:52:10] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:52:10] UCB=0.9555 mu=0.6852 sigma=0.1352 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035225077447470937}
|
|
[2026-04-13 01:52:10] UCB=0.9221 mu=0.6299 sigma=0.1461 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003508792155037348}
|
|
[2026-04-13 01:52:10] UCB=0.8474 mu=0.7509 sigma=0.0482 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036586415321887464}
|
|
[2026-04-13 01:52:10] UCB=0.8451 mu=0.5180 sigma=0.1636 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0039537502522221275}
|
|
[2026-04-13 01:52:10] UCB=0.8385 mu=0.6438 sigma=0.0974 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.028288783330288e-05}
|
|
[2026-04-13 01:52:10] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035225077447470937, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:12] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003523
|
|
[2026-04-13 01:52:21] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:52:21] [AutoResearch] mean_reward=85.328
|
|
[2026-04-13 01:52:21] [AutoResearch] === Trial 183 Summary ===
|
|
[2026-04-13 01:52:21] Total runs in history: 301
|
|
[2026-04-13 01:52:21] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:21] Top 5 results:
|
|
[2026-04-13 01:52:21] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:21] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:21] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:21] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:21] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:23]
|
|
[AutoResearch] ========== Trial 184/200 ==========
|
|
[2026-04-13 01:52:23] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:52:23] UCB=0.9606 mu=0.7241 sigma=0.1183 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034430849838173647}
|
|
[2026-04-13 01:52:23] UCB=0.9027 mu=0.7640 sigma=0.0693 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035624662982350606}
|
|
[2026-04-13 01:52:23] UCB=0.8446 mu=0.7290 sigma=0.0578 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0033879645014999005}
|
|
[2026-04-13 01:52:23] UCB=0.8213 mu=0.5558 sigma=0.1328 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0033749869017482533}
|
|
[2026-04-13 01:52:23] UCB=0.7538 mu=0.4916 sigma=0.1311 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003746502824299608}
|
|
[2026-04-13 01:52:23] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034430849838173647, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:25] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003443
|
|
[2026-04-13 01:52:34] [AutoResearch] Job finished in 8.7s, returncode=0
|
|
[2026-04-13 01:52:34] [AutoResearch] mean_reward=51.9965
|
|
[2026-04-13 01:52:34] [AutoResearch] === Trial 184 Summary ===
|
|
[2026-04-13 01:52:34] Total runs in history: 302
|
|
[2026-04-13 01:52:34] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:34] Top 5 results:
|
|
[2026-04-13 01:52:34] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:34] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:34] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:34] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:34] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:36]
|
|
[AutoResearch] ========== Trial 185/200 ==========
|
|
[2026-04-13 01:52:36] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:52:36] UCB=1.1251 mu=0.8198 sigma=0.1527 params={'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.0049939230441137655}
|
|
[2026-04-13 01:52:36] UCB=0.8823 mu=0.5335 sigma=0.1744 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.003907972025437071}
|
|
[2026-04-13 01:52:36] UCB=0.8037 mu=0.5104 sigma=0.1466 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0021090259051973308}
|
|
[2026-04-13 01:52:36] UCB=0.8004 mu=0.4624 sigma=0.1690 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003477605271030014}
|
|
[2026-04-13 01:52:36] UCB=0.7984 mu=0.4606 sigma=0.1689 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0021038292329371698}
|
|
[2026-04-13 01:52:36] [AutoResearch] Proposed params: {'n_steer': 3, 'n_throttle': 3, 'learning_rate': 0.0049939230441137655, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:38] [AutoResearch] Launching job: n_steer=3 n_throttle=3 lr=0.004994
|
|
[2026-04-13 01:52:46] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:52:46] [AutoResearch] mean_reward=63.7252
|
|
[2026-04-13 01:52:46] [AutoResearch] === Trial 185 Summary ===
|
|
[2026-04-13 01:52:46] Total runs in history: 303
|
|
[2026-04-13 01:52:46] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:46] Top 5 results:
|
|
[2026-04-13 01:52:46] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:46] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:46] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:46] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:46] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:48]
|
|
[AutoResearch] ========== Trial 186/200 ==========
|
|
[2026-04-13 01:52:49] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:52:49] UCB=0.9933 mu=0.7268 sigma=0.1332 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.00370419095252343}
|
|
[2026-04-13 01:52:49] UCB=0.9315 mu=0.6369 sigma=0.1473 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 5.287858413858992e-05}
|
|
[2026-04-13 01:52:49] UCB=0.8261 mu=0.4893 sigma=0.1684 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.000919912353828159}
|
|
[2026-04-13 01:52:49] UCB=0.7864 mu=0.4532 sigma=0.1666 params={'n_steer': 7, 'n_throttle': 2, 'learning_rate': 0.003816712202626492}
|
|
[2026-04-13 01:52:49] UCB=0.7757 mu=0.4764 sigma=0.1497 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0025609904625457027}
|
|
[2026-04-13 01:52:49] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.00370419095252343, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:51] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003704
|
|
[2026-04-13 01:52:59] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:52:59] [AutoResearch] mean_reward=62.2993
|
|
[2026-04-13 01:52:59] [AutoResearch] === Trial 186 Summary ===
|
|
[2026-04-13 01:52:59] Total runs in history: 304
|
|
[2026-04-13 01:52:59] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:59] Top 5 results:
|
|
[2026-04-13 01:52:59] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:59] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:59] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:59] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:52:59] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:01]
|
|
[AutoResearch] ========== Trial 187/200 ==========
|
|
[2026-04-13 01:53:02] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:53:02] UCB=0.8362 mu=0.5129 sigma=0.1616 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002187813189418104}
|
|
[2026-04-13 01:53:02] UCB=0.8317 mu=0.4921 sigma=0.1698 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002362637607936012}
|
|
[2026-04-13 01:53:02] UCB=0.8185 mu=0.5381 sigma=0.1402 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035097416292239276}
|
|
[2026-04-13 01:53:02] UCB=0.8072 mu=0.4954 sigma=0.1559 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004013556596163426}
|
|
[2026-04-13 01:53:02] UCB=0.7941 mu=0.4512 sigma=0.1715 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0034551247905998234}
|
|
[2026-04-13 01:53:02] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002187813189418104, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:04] [AutoResearch] Launching job: n_steer=8 n_throttle=5 lr=0.002188
|
|
[2026-04-13 01:53:12] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:53:12] [AutoResearch] mean_reward=61.3429
|
|
[2026-04-13 01:53:12] [AutoResearch] === Trial 187 Summary ===
|
|
[2026-04-13 01:53:12] Total runs in history: 305
|
|
[2026-04-13 01:53:12] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:12] Top 5 results:
|
|
[2026-04-13 01:53:12] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:12] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:12] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:12] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:12] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:14]
|
|
[AutoResearch] ========== Trial 188/200 ==========
|
|
[2026-04-13 01:53:14] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:53:14] UCB=0.9404 mu=0.6622 sigma=0.1391 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035970039225449468}
|
|
[2026-04-13 01:53:14] UCB=0.9046 mu=0.6201 sigma=0.1423 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003679821085505559}
|
|
[2026-04-13 01:53:14] UCB=0.8930 mu=0.7185 sigma=0.0873 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003745307890517056}
|
|
[2026-04-13 01:53:14] UCB=0.8357 mu=0.4729 sigma=0.1814 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.00400644557520237}
|
|
[2026-04-13 01:53:14] UCB=0.8100 mu=0.4659 sigma=0.1720 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.002413438379886847}
|
|
[2026-04-13 01:53:14] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035970039225449468, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:16] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003597
|
|
[2026-04-13 01:53:25] [AutoResearch] Job finished in 8.9s, returncode=0
|
|
[2026-04-13 01:53:25] [AutoResearch] mean_reward=93.2864
|
|
[2026-04-13 01:53:25] [AutoResearch] === Trial 188 Summary ===
|
|
[2026-04-13 01:53:25] Total runs in history: 306
|
|
[2026-04-13 01:53:25] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:25] Top 5 results:
|
|
[2026-04-13 01:53:25] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:25] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:25] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:25] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:25] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:27]
|
|
[AutoResearch] ========== Trial 189/200 ==========
|
|
[2026-04-13 01:53:27] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:53:27] UCB=0.9114 mu=0.6295 sigma=0.1410 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.298799597311507e-05}
|
|
[2026-04-13 01:53:27] UCB=0.8861 mu=0.5497 sigma=0.1682 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003585127640097398}
|
|
[2026-04-13 01:53:27] UCB=0.8810 mu=0.5399 sigma=0.1706 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.002269200870334638}
|
|
[2026-04-13 01:53:27] UCB=0.8469 mu=0.5408 sigma=0.1531 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001974644266152381}
|
|
[2026-04-13 01:53:27] UCB=0.8467 mu=0.5562 sigma=0.1452 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.003515227339221353}
|
|
[2026-04-13 01:53:27] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.298799597311507e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:29] [AutoResearch] Launching job: n_steer=5 n_throttle=4 lr=0.000083
|
|
[2026-04-13 01:53:38] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:53:38] [AutoResearch] mean_reward=84.6081
|
|
[2026-04-13 01:53:38] [AutoResearch] === Trial 189 Summary ===
|
|
[2026-04-13 01:53:38] Total runs in history: 307
|
|
[2026-04-13 01:53:38] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:38] Top 5 results:
|
|
[2026-04-13 01:53:38] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:38] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:38] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:38] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:38] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:40]
|
|
[AutoResearch] ========== Trial 190/200 ==========
|
|
[2026-04-13 01:53:40] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:53:40] UCB=0.9063 mu=0.6423 sigma=0.1320 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034939908567109316}
|
|
[2026-04-13 01:53:40] UCB=0.8867 mu=0.6583 sigma=0.1142 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035386445188696747}
|
|
[2026-04-13 01:53:40] UCB=0.8571 mu=0.4976 sigma=0.1797 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0039034944089353855}
|
|
[2026-04-13 01:53:40] UCB=0.8454 mu=0.5531 sigma=0.1462 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0035687083277951373}
|
|
[2026-04-13 01:53:40] UCB=0.7883 mu=0.5182 sigma=0.1351 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003493395913651652}
|
|
[2026-04-13 01:53:40] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034939908567109316, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:42] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003494
|
|
[2026-04-13 01:53:51] [AutoResearch] Job finished in 8.8s, returncode=0
|
|
[2026-04-13 01:53:51] [AutoResearch] mean_reward=73.9234
|
|
[2026-04-13 01:53:51] [AutoResearch] === Trial 190 Summary ===
|
|
[2026-04-13 01:53:51] Total runs in history: 308
|
|
[2026-04-13 01:53:51] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:51] Top 5 results:
|
|
[2026-04-13 01:53:51] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:51] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:51] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:51] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:51] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:53]
|
|
[AutoResearch] ========== Trial 191/200 ==========
|
|
[2026-04-13 01:53:53] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:53:53] UCB=0.8738 mu=0.5834 sigma=0.1452 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 6.437147161114935e-05}
|
|
[2026-04-13 01:53:53] UCB=0.8654 mu=0.5222 sigma=0.1716 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003913756893933623}
|
|
[2026-04-13 01:53:53] UCB=0.8259 mu=0.6305 sigma=0.0977 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0034955035175064098}
|
|
[2026-04-13 01:53:53] UCB=0.7909 mu=0.6335 sigma=0.0787 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003706130646033743}
|
|
[2026-04-13 01:53:53] UCB=0.7807 mu=0.4617 sigma=0.1595 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008492808246747831}
|
|
[2026-04-13 01:53:53] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 3, 'learning_rate': 6.437147161114935e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:53:55] [AutoResearch] Launching job: n_steer=5 n_throttle=3 lr=0.000064
|
|
[2026-04-13 01:54:04] [AutoResearch] Job finished in 8.6s, returncode=0
|
|
[2026-04-13 01:54:04] [AutoResearch] mean_reward=71.3288
|
|
[2026-04-13 01:54:04] [AutoResearch] === Trial 191 Summary ===
|
|
[2026-04-13 01:54:04] Total runs in history: 309
|
|
[2026-04-13 01:54:04] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:04] Top 5 results:
|
|
[2026-04-13 01:54:04] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:04] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:04] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:04] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:04] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:06]
|
|
[AutoResearch] ========== Trial 192/200 ==========
|
|
[2026-04-13 01:54:06] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:54:06] UCB=0.8763 mu=0.5316 sigma=0.1724 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0021693436639211063}
|
|
[2026-04-13 01:54:06] UCB=0.8472 mu=0.7028 sigma=0.0722 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003640920976101544}
|
|
[2026-04-13 01:54:06] UCB=0.7787 mu=0.4465 sigma=0.1661 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0024718061544978026}
|
|
[2026-04-13 01:54:06] UCB=0.7757 mu=0.4812 sigma=0.1473 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0033862291143332574}
|
|
[2026-04-13 01:54:06] UCB=0.7572 mu=0.4208 sigma=0.1682 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0009641410366605733}
|
|
[2026-04-13 01:54:06] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.0021693436639211063, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:08] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.002169
|
|
[2026-04-13 01:54:17] [AutoResearch] Job finished in 9.0s, returncode=0
|
|
[2026-04-13 01:54:17] [AutoResearch] mean_reward=74.053
|
|
[2026-04-13 01:54:17] [AutoResearch] === Trial 192 Summary ===
|
|
[2026-04-13 01:54:17] Total runs in history: 310
|
|
[2026-04-13 01:54:17] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:17] Top 5 results:
|
|
[2026-04-13 01:54:17] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:17] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:17] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:17] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:17] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:19]
|
|
[AutoResearch] ========== Trial 193/200 ==========
|
|
[2026-04-13 01:54:19] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:54:19] UCB=0.8808 mu=0.5381 sigma=0.1713 params={'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.002021260667478087}
|
|
[2026-04-13 01:54:19] UCB=0.8737 mu=0.5086 sigma=0.1825 params={'n_steer': 4, 'n_throttle': 2, 'learning_rate': 0.003880587419006791}
|
|
[2026-04-13 01:54:19] UCB=0.7961 mu=0.5206 sigma=0.1377 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0007455705713928767}
|
|
[2026-04-13 01:54:19] UCB=0.7724 mu=0.4494 sigma=0.1615 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.001055184904588385}
|
|
[2026-04-13 01:54:19] UCB=0.7527 mu=0.4196 sigma=0.1665 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008413615930839286}
|
|
[2026-04-13 01:54:19] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 5, 'learning_rate': 0.002021260667478087, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:21] [AutoResearch] Launching job: n_steer=7 n_throttle=5 lr=0.002021
|
|
[2026-04-13 01:54:29] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:54:29] [AutoResearch] mean_reward=53.1
|
|
[2026-04-13 01:54:29] [AutoResearch] === Trial 193 Summary ===
|
|
[2026-04-13 01:54:29] Total runs in history: 311
|
|
[2026-04-13 01:54:29] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:29] Top 5 results:
|
|
[2026-04-13 01:54:29] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:29] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:29] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:29] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:29] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:31]
|
|
[AutoResearch] ========== Trial 194/200 ==========
|
|
[2026-04-13 01:54:32] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:54:32] UCB=0.9154 mu=0.5900 sigma=0.1627 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003844169255029291}
|
|
[2026-04-13 01:54:32] UCB=0.8310 mu=0.5017 sigma=0.1646 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.003519645449999925}
|
|
[2026-04-13 01:54:32] UCB=0.8029 mu=0.4510 sigma=0.1760 params={'n_steer': 9, 'n_throttle': 2, 'learning_rate': 0.002300268268208318}
|
|
[2026-04-13 01:54:32] UCB=0.7740 mu=0.6144 sigma=0.0798 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0007771763955422609}
|
|
[2026-04-13 01:54:32] UCB=0.7557 mu=0.4167 sigma=0.1695 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.0007372537186135668}
|
|
[2026-04-13 01:54:32] [AutoResearch] Proposed params: {'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.003844169255029291, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:34] [AutoResearch] Launching job: n_steer=7 n_throttle=3 lr=0.003844
|
|
[2026-04-13 01:54:42] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:54:42] [AutoResearch] mean_reward=43.1928
|
|
[2026-04-13 01:54:42] [AutoResearch] === Trial 194 Summary ===
|
|
[2026-04-13 01:54:42] Total runs in history: 312
|
|
[2026-04-13 01:54:42] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:42] Top 5 results:
|
|
[2026-04-13 01:54:42] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:42] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:42] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:42] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:42] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:44]
|
|
[AutoResearch] ========== Trial 195/200 ==========
|
|
[2026-04-13 01:54:44] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:54:44] UCB=0.8897 mu=0.6459 sigma=0.1219 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003646711706724205}
|
|
[2026-04-13 01:54:44] UCB=0.8881 mu=0.5426 sigma=0.1727 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0019222075484476822}
|
|
[2026-04-13 01:54:44] UCB=0.8543 mu=0.5106 sigma=0.1718 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0034521524018910556}
|
|
[2026-04-13 01:54:44] UCB=0.8410 mu=0.6662 sigma=0.0874 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035256062972769163}
|
|
[2026-04-13 01:54:44] UCB=0.8335 mu=0.6934 sigma=0.0701 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003650043164571141}
|
|
[2026-04-13 01:54:44] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.003646711706724205, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:46] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003647
|
|
[2026-04-13 01:54:55] [AutoResearch] Job finished in 8.5s, returncode=0
|
|
[2026-04-13 01:54:55] [AutoResearch] mean_reward=56.4788
|
|
[2026-04-13 01:54:55] [AutoResearch] === Trial 195 Summary ===
|
|
[2026-04-13 01:54:55] Total runs in history: 313
|
|
[2026-04-13 01:54:55] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:55] Top 5 results:
|
|
[2026-04-13 01:54:55] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:55] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:55] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:55] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:55] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:57]
|
|
[AutoResearch] ========== Trial 196/200 ==========
|
|
[2026-04-13 01:54:57] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:54:57] UCB=0.8269 mu=0.4678 sigma=0.1795 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003911730307839594}
|
|
[2026-04-13 01:54:57] UCB=0.8166 mu=0.4740 sigma=0.1713 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.00350103575195631}
|
|
[2026-04-13 01:54:57] UCB=0.7707 mu=0.4468 sigma=0.1619 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008109564223723561}
|
|
[2026-04-13 01:54:57] UCB=0.7592 mu=0.5335 sigma=0.1129 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006805373824626318}
|
|
[2026-04-13 01:54:57] UCB=0.7363 mu=0.5441 sigma=0.0961 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0033555486297841856}
|
|
[2026-04-13 01:54:57] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.003911730307839594, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:54:59] [AutoResearch] Launching job: n_steer=5 n_throttle=2 lr=0.003912
|
|
[2026-04-13 01:55:07] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:55:07] [AutoResearch] mean_reward=50.413
|
|
[2026-04-13 01:55:07] [AutoResearch] === Trial 196 Summary ===
|
|
[2026-04-13 01:55:07] Total runs in history: 314
|
|
[2026-04-13 01:55:07] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:07] Top 5 results:
|
|
[2026-04-13 01:55:07] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:07] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:07] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:07] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:07] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:09]
|
|
[AutoResearch] ========== Trial 197/200 ==========
|
|
[2026-04-13 01:55:09] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:55:09] UCB=0.8504 mu=0.6243 sigma=0.1130 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035821770162068104}
|
|
[2026-04-13 01:55:09] UCB=0.8364 mu=0.5112 sigma=0.1626 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008983777144142389}
|
|
[2026-04-13 01:55:09] UCB=0.8269 mu=0.5347 sigma=0.1461 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 9.033387351833335e-05}
|
|
[2026-04-13 01:55:09] UCB=0.8086 mu=0.6483 sigma=0.0802 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036137283648080033}
|
|
[2026-04-13 01:55:09] UCB=0.7728 mu=0.4598 sigma=0.1565 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.00414878270096977}
|
|
[2026-04-13 01:55:09] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035821770162068104, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:11] [AutoResearch] Launching job: n_steer=8 n_throttle=3 lr=0.003582
|
|
[2026-04-13 01:55:20] [AutoResearch] Job finished in 8.3s, returncode=0
|
|
[2026-04-13 01:55:20] [AutoResearch] mean_reward=58.4384
|
|
[2026-04-13 01:55:20] [AutoResearch] === Trial 197 Summary ===
|
|
[2026-04-13 01:55:20] Total runs in history: 315
|
|
[2026-04-13 01:55:20] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:20] Top 5 results:
|
|
[2026-04-13 01:55:20] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:20] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:20] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:20] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:20] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:22]
|
|
[AutoResearch] ========== Trial 198/200 ==========
|
|
[2026-04-13 01:55:22] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:55:22] UCB=1.0116 mu=0.8521 sigma=0.0797 params={'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.004996488557393561}
|
|
[2026-04-13 01:55:22] UCB=0.8363 mu=0.5159 sigma=0.1602 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.004011273588545727}
|
|
[2026-04-13 01:55:22] UCB=0.8099 mu=0.4682 sigma=0.1708 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0037716444061262197}
|
|
[2026-04-13 01:55:22] UCB=0.7963 mu=0.4935 sigma=0.1514 params={'n_steer': 7, 'n_throttle': 4, 'learning_rate': 0.0022899351236047275}
|
|
[2026-04-13 01:55:22] UCB=0.7543 mu=0.4506 sigma=0.1518 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.000852766933229179}
|
|
[2026-04-13 01:55:22] [AutoResearch] Proposed params: {'n_steer': 9, 'n_throttle': 3, 'learning_rate': 0.004996488557393561, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:24] [AutoResearch] Launching job: n_steer=9 n_throttle=3 lr=0.004996
|
|
[2026-04-13 01:55:32] [AutoResearch] Job finished in 8.1s, returncode=0
|
|
[2026-04-13 01:55:32] [AutoResearch] mean_reward=46.6812
|
|
[2026-04-13 01:55:32] [AutoResearch] === Trial 198 Summary ===
|
|
[2026-04-13 01:55:32] Total runs in history: 316
|
|
[2026-04-13 01:55:32] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:32] Top 5 results:
|
|
[2026-04-13 01:55:32] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:32] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:32] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:32] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:32] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:34]
|
|
[AutoResearch] ========== Trial 199/200 ==========
|
|
[2026-04-13 01:55:34] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:55:34] UCB=1.0010 mu=0.7139 sigma=0.1435 params={'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.433335464045147e-05}
|
|
[2026-04-13 01:55:34] UCB=0.8118 mu=0.5424 sigma=0.1347 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0035198921926947488}
|
|
[2026-04-13 01:55:34] UCB=0.7905 mu=0.4642 sigma=0.1631 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.002383977858819023}
|
|
[2026-04-13 01:55:34] UCB=0.7504 mu=0.6255 sigma=0.0625 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036409099649273493}
|
|
[2026-04-13 01:55:34] UCB=0.7323 mu=0.4588 sigma=0.1368 params={'n_steer': 5, 'n_throttle': 2, 'learning_rate': 0.0008342369191574561}
|
|
[2026-04-13 01:55:34] [AutoResearch] Proposed params: {'n_steer': 5, 'n_throttle': 4, 'learning_rate': 8.433335464045147e-05, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:36] [AutoResearch] Launching job: n_steer=5 n_throttle=4 lr=0.000084
|
|
[2026-04-13 01:55:44] [AutoResearch] Job finished in 8.2s, returncode=0
|
|
[2026-04-13 01:55:44] [AutoResearch] mean_reward=47.3571
|
|
[2026-04-13 01:55:44] [AutoResearch] === Trial 199 Summary ===
|
|
[2026-04-13 01:55:44] Total runs in history: 317
|
|
[2026-04-13 01:55:44] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:44] Top 5 results:
|
|
[2026-04-13 01:55:44] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:44] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:44] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:44] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:44] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:46]
|
|
[AutoResearch] ========== Trial 200/200 ==========
|
|
[2026-04-13 01:55:46] [AutoResearch] GP UCB top-5 candidates:
|
|
[2026-04-13 01:55:46] UCB=0.7404 mu=0.3997 sigma=0.1703 params={'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0022889855206986213}
|
|
[2026-04-13 01:55:46] UCB=0.7171 mu=0.5084 sigma=0.1043 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.000930624307401501}
|
|
[2026-04-13 01:55:46] UCB=0.6968 mu=0.3540 sigma=0.1714 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0007154345889676419}
|
|
[2026-04-13 01:55:46] UCB=0.6957 mu=0.5438 sigma=0.0759 params={'n_steer': 6, 'n_throttle': 3, 'learning_rate': 0.0008446982553323807}
|
|
[2026-04-13 01:55:46] UCB=0.6876 mu=0.4436 sigma=0.1220 params={'n_steer': 8, 'n_throttle': 2, 'learning_rate': 0.0013559749010314895}
|
|
[2026-04-13 01:55:46] [AutoResearch] Proposed params: {'n_steer': 8, 'n_throttle': 4, 'learning_rate': 0.0022889855206986213, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:48] [AutoResearch] Launching job: n_steer=8 n_throttle=4 lr=0.002289
|
|
[2026-04-13 01:55:57] [AutoResearch] Job finished in 8.4s, returncode=0
|
|
[2026-04-13 01:55:57] [AutoResearch] mean_reward=60.3436
|
|
[2026-04-13 01:55:57] [AutoResearch] === Trial 200 Summary ===
|
|
[2026-04-13 01:55:57] Total runs in history: 318
|
|
[2026-04-13 01:55:57] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:57] Top 5 results:
|
|
[2026-04-13 01:55:57] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:57] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:57] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:57] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:57] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:59] [AutoResearch] All trials complete!
|
|
[2026-04-13 01:55:59] [AutoResearch] === Trial 200 Summary ===
|
|
[2026-04-13 01:55:59] Total runs in history: 318
|
|
[2026-04-13 01:55:59] Best so far: mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:59] Top 5 results:
|
|
[2026-04-13 01:55:59] mean_reward=141.8524 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.0020237415552484734, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:59] mean_reward=125.5734 params={'n_steer': 8, 'n_throttle': 5, 'learning_rate': 0.001997383198130263, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:59] mean_reward=122.2970 params={'n_steer': 6, 'n_throttle': 2, 'learning_rate': 0.0012216452706746085, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:59] mean_reward=120.0185 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0037260182912991057, 'timesteps': 2000, 'eval_episodes': 3}
|
|
[2026-04-13 01:55:59] mean_reward=117.3069 params={'n_steer': 8, 'n_throttle': 3, 'learning_rate': 0.0036440082123546827, 'timesteps': 2000, 'eval_episodes': 3}
|