Commit Graph

1 Commits

Author SHA1 Message Date
Paul Huliganga 860e3d6610 fix: fresh PPO verbose=0 suppressed all training output — set verbose=1
Without this, Wave 4 scratch-trained models produce no rollout stats in
the log, making it impossible to monitor training progress or spot
degenerate policies early.

Warm-start models in Wave 3 showed stats because verbose=1 was baked
into the Phase-2 saved model state; fresh models default to verbose=0.

Agent: pi
Tests: 96 passed
Tests-Added: 0
TypeScript: N/A
2026-04-14 22:44:22 -04:00