Research-grade DonkeyCar RL autoresearch and sweep system.

Go to file

Paul Huliganga e61ebc5b38 fix: prevent trial timeouts losing all data Two changes: 1. Lower total_timesteps cap: 120k → 90k Actual throughput is 16 steps/sec (not 20 as estimated). 120k steps = 126 min training + 9 min overhead = 135 min > 2hr limit. 90k steps = 94 min + 8 min overhead = 102 min, safely within limit. 2. Per-segment checkpoint saves in multitrack_runner model.save() called after every segment so the latest weights are always on disk. If the runner is killed (timeout/crash/Ctrl+C), training data is never completely lost. 3. Timeout rescue eval in wave4_controller If JOB_TIMEOUT fires and a checkpoint exists, immediately runs a quick mini_monaco eval on the checkpoint so the trial still produces a GP data point despite the timeout. Agent: pi Tests: 102 passed Tests-Added: 0 TypeScript: N/A		2026-04-15 21:54:50 -04:00
.harness	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00
agent	fix: prevent trial timeouts losing all data	2026-04-15 21:54:50 -04:00
docs	wave3: add multi-track autoresearch system (83 tests passing)	2026-04-14 12:47:12 -04:00
tests	fix: StuckTerminationWrapper + deque import + 102 tests	2026-04-15 09:17:27 -04:00
.gitignore	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00
AGENT.md	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00
DECISIONS.md	wave3: add multi-track autoresearch system (83 tests passing)	2026-04-14 12:47:12 -04:00
IMPLEMENTATION_PLAN.md	feat: Phase 3 — behavioral control, enhanced evaluator, 53 tests	2026-04-14 09:28:43 -04:00
PROJECT-KICKOFF.md	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00
PROJECT-SPEC.md	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00
README.md	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00
create_gitea_repo.py	Initial commit	2026-04-12 23:44:36 -04:00
ralph-loop.sh	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00

README.md

donkeycar-rl-autoresearch

Purpose

Status

Scaffolded with the agent harness
Spec not filled yet

Runbook

Fill PROJECT-SPEC.md
Create IMPLEMENTATION_PLAN.md from the spec
Start the implementation loop