Research-grade DonkeyCar RL autoresearch and sweep system.
Go to file
Paul Huliganga 1d53bf613f feat(exp29): fine-tune wave4-trial-0009 on generated track (continuous actions)
Warm-starts from wave4-trial-0009/model.zip (best mini-monaco model, completed
laps). Fine-tunes on generated track with continuous Box action space preserved
(no DiscretizedActionWrapper) at LR=0.00005. 50k steps, checkpoint every 5k,
zero-shot mini-monaco eval at end.

Tests whether additional generated-track exposure improves corner handling on
mini-monaco without catastrophic forgetting of driving skill.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-14 15:32:43 -04:00
.harness feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing 2026-04-13 10:03:15 -04:00
agent feat(exp29): fine-tune wave4-trial-0009 on generated track (continuous actions) 2026-05-14 15:32:43 -04:00
docs feat: add exp17 parallel DummyVecEnv 450k training + strategy docs 2026-04-28 02:42:20 -04:00
tests fix(core): replace exploit bandaids with solid physics barriers + clean reward 2026-05-05 15:56:00 -04:00
.gitignore chore: add CLAUDE.md project instructions + exclude .chat/ from git 2026-05-14 15:32:04 -04:00
AGENT.md feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments 2026-05-05 14:46:13 -04:00
CLAUDE.md chore: add CLAUDE.md project instructions + exclude .chat/ from git 2026-05-14 15:32:04 -04:00
DECISIONS.md feat: add exp17 parallel DummyVecEnv 450k training + strategy docs 2026-04-28 02:42:20 -04:00
IMPLEMENTATION_PLAN.md feat: Phase 3 — behavioral control, enhanced evaluator, 53 tests 2026-04-14 09:28:43 -04:00
PROJECT-KICKOFF.md feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing 2026-04-13 10:03:15 -04:00
PROJECT-SPEC.md feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing 2026-04-13 10:03:15 -04:00
README.md feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing 2026-04-13 10:03:15 -04:00
create_gitea_repo.py Initial commit 2026-04-12 23:44:36 -04:00
monitor_training.sh fix: exp19 — hard episode time limit to stop minutes-long stuck cars 2026-04-28 09:18:04 -04:00
ralph-loop.sh feat: Wave 1 complete — real PPO training, model save, GP+UCB autoresearch, 37 tests passing 2026-04-13 10:03:15 -04:00

README.md

donkeycar-rl-autoresearch

Purpose

Status

  • Scaffolded with the agent harness
  • Spec not filled yet

Runbook

  • Fill PROJECT-SPEC.md
  • Create IMPLEMENTATION_PLAN.md from the spec
  • Start the implementation loop