autoresearch: phase1 trial 20 results

Agent: pi
Tests: N/A
Tests-Added: 0
TypeScript: N/A
This commit is contained in:
Paul Huliganga 2026-04-14 04:35:49 -04:00
parent 5114a95a74
commit cfd1f843a4
1 changed files with 11 additions and 0 deletions

View File

@ -464,3 +464,14 @@
[2026-04-14 04:35:45] mean_reward=2073.7372 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0002881292103575585, 'timesteps': 15876, 'agent': 'ppo', 'eval_episodes': 5, 'reward_shaping': True}
[2026-04-14 04:35:45] mean_reward=1382.4461 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0010723485700433605, 'timesteps': 33234, 'agent': 'ppo', 'eval_episodes': 5, 'reward_shaping': True}
[2026-04-14 04:35:45] mean_reward=1097.1248 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.001421177467065464, 'timesteps': 33363, 'agent': 'ppo', 'eval_episodes': 5, 'reward_shaping': True}
[2026-04-14 04:35:47] [AutoResearch] Git push complete after trial 20
[2026-04-14 04:35:49] [AutoResearch] All trials complete!
[2026-04-14 04:35:49] [AutoResearch] === Trial 20 Summary ===
[2026-04-14 04:35:49] Total Phase 1 runs: 21
[2026-04-14 04:35:49] Champion: trial=5 mean_reward=4582.7984 params={'n_steer': 7, 'n_throttle': 3, 'learning_rate': 0.0006801262090358742, 'timesteps': 4787, 'agent': 'ppo', 'eval_episodes': 3, 'reward_shaping': True}
[2026-04-14 04:35:49] Top 5:
[2026-04-14 04:35:49] mean_reward=2469.2835 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.00022474333387549633, 'timesteps': 13328, 'agent': 'ppo', 'eval_episodes': 5, 'reward_shaping': True}
[2026-04-14 04:35:49] mean_reward=2296.1891 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0011680072988353367, 'timesteps': 34177, 'agent': 'ppo', 'eval_episodes': 5, 'reward_shaping': True}
[2026-04-14 04:35:49] mean_reward=2073.7372 params={'n_steer': 3, 'n_throttle': 5, 'learning_rate': 0.0002881292103575585, 'timesteps': 15876, 'agent': 'ppo', 'eval_episodes': 5, 'reward_shaping': True}
[2026-04-14 04:35:49] mean_reward=1382.4461 params={'n_steer': 4, 'n_throttle': 3, 'learning_rate': 0.0010723485700433605, 'timesteps': 33234, 'agent': 'ppo', 'eval_episodes': 5, 'reward_shaping': True}
[2026-04-14 04:35:49] mean_reward=1097.1248 params={'n_steer': 5, 'n_throttle': 3, 'learning_rate': 0.001421177467065464, 'timesteps': 33363, 'agent': 'ppo', 'eval_episodes': 5, 'reward_shaping': True}