Problems fixed: - Timesteps 5k-30k caused all trials to timeout (PPO+CNN+CPU needs ~0.1s/step) - New range: 1000-5000 steps fits well within 480s timeout - PPO random init policy outputs throttle~0 -> car sits still -> fix with ThrottleClampWrapper (min 0.2) - Sim stuck detection: if speed<0.02 for 100 consecutive steps, stop training and report error - Sim frozen detection: if observation unchanged for 30 steps, stop training (connection lost) - eval_episodes reduced to 3 to speed up evaluation phase Agent: pi/claude-sonnet Tests: 37/37 passing Tests-Added: 0 (behaviour change only) TypeScript: N/A |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| test_autoresearch_controller.py | ||
| test_discretize_action.py | ||
| test_reward_wrapper.py | ||
| test_runner_integration.py | ||