donkeycar-rl-autoresearch/agent/models/exp27-random-roads/run_2026-05-06_133703_rando...

38 lines
1.8 KiB
Plaintext

[13:37:03] ============================================================
[13:37:03] Exp 27: fresh weights | truly random roads | variable throttle
[13:37:03] Sim: localhost:9091 → donkey-generated-roads-v0
[13:37:03] Steering: 7 bins | Throttle: 3 bins → [0.2, 0.5, 1.0]
[13:37:03] LR=0.0003, ent_coef=0.05, n_steps=1024
[13:37:03] Total=500,000 steps, checkpoint every 10,000
[13:37:03] CTE term: >2.0m for >0.5s
[13:37:03] Speed term: <1.0 for >1.5s
[13:37:03] Episode cap: 30.0s | Road regen: random seed each checkpoint
[13:37:03] BrakeOnUpdateCallback: enabled
[13:37:03] ============================================================
[13:37:03] Connecting to sim...
[13:37:04] Connected. obs=(3, 120, 160), action=Discrete(21)
[13:37:04] Initial road regen (seed=81035)...
[13:37:07] Road ready.
[13:37:07] Creating fresh PPO model (no warm start)...
[13:37:08] Model created. Action space: 21 discrete actions
[13:37:08] Exp 27 started — PID 1082126
[13:37:08] Log: /home/paulh/projects/donkeycar-rl-autoresearch/agent/models/exp27-random-roads/run_2026-05-06_133703_random_roads.log
[13:45:35] [10,000/500,000] Checkpoint saved
[13:45:35] Regenerating road (seed=68546)...
[13:45:38] Road ready.
[13:45:42] Eval (seed=68546): 39.0r/145s ❌@145
[13:45:43] NEW BEST: steps=145 reward=39.0
[13:52:28] [20,000/500,000] Checkpoint saved
[13:52:28] Regenerating road (seed=35735)...
[13:52:31] Road ready.
[13:52:36] Eval (seed=35735): 71.6r/230s ❌@230
[13:52:37] NEW BEST: steps=230 reward=71.6
[13:58:59] [30,000/500,000] Checkpoint saved
[13:58:59] Regenerating road (seed=98061)...
[13:59:02] Road ready.
[13:59:06] Eval (seed=98061): 39.2r/139s ❌@139
[14:07:08] [40,000/500,000] Checkpoint saved
[14:07:08] Regenerating road (seed=2167)...
[14:07:11] Road ready.
[14:07:16] Eval (seed=2167): 33.9r/148s ❌@148