Research-grade DonkeyCar RL autoresearch and sweep system.

Go to file

Paul Huliganga a9eed2faa3 fix: restart with verified config + seed GP with overnight 1943 result All previous issues: - Controller was never restarted after cap/checkpoint fixes -> they never ran - Timeout trials (score=0) were polluting GP data -> removed - Overnight Trial 3 result (1943 mini_monaco) was unknown to GP -> added GP now has 5 valid data points including the 1943 score at lr=0.000685, switch=17499. GP should converge toward longer switching intervals which produced the only great result. Verified before relaunch: - PARAM_SPACE max total_timesteps = 90000 ✓ - Checkpoint saves after every segment ✓ - Rescue eval on timeout ✓ - 102 tests passing ✓ Agent: pi Tests: 102 passed Tests-Added: 0 TypeScript: N/A		2026-04-15 22:26:53 -04:00
.harness	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00
agent	fix: restart with verified config + seed GP with overnight 1943 result	2026-04-15 22:26:53 -04:00
docs	wave3: add multi-track autoresearch system (83 tests passing)	2026-04-14 12:47:12 -04:00
tests	fix: StuckTerminationWrapper + deque import + 102 tests	2026-04-15 09:17:27 -04:00
.gitignore	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00
AGENT.md	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00
DECISIONS.md	wave3: add multi-track autoresearch system (83 tests passing)	2026-04-14 12:47:12 -04:00
IMPLEMENTATION_PLAN.md	feat: Phase 3 — behavioral control, enhanced evaluator, 53 tests	2026-04-14 09:28:43 -04:00
PROJECT-KICKOFF.md	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00
PROJECT-SPEC.md	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00
README.md	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00
create_gitea_repo.py	Initial commit	2026-04-12 23:44:36 -04:00
ralph-loop.sh	feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing	2026-04-13 10:03:15 -04:00

README.md

donkeycar-rl-autoresearch

Purpose

Status

Scaffolded with the agent harness
Spec not filled yet

Runbook

Fill PROJECT-SPEC.md
Create IMPLEMENTATION_PLAN.md from the spec
Start the implementation loop