Previously circles ran 20+ seconds because the efficiency gate only returned 0 reward without terminating. After 20 consecutive steps of efficiency < 0.15 (~0.7 seconds at 27 steps/sec), episode now terminates with -1.0. Also confirmed from telemetry diagnostic: CTE does report correctly when car goes off-track (rises steadily to 6.2m before tree collision). The grass exploit runs long only when the open grass area has no obstacles. Efficiency gate termination is the most reliable catch for both circles and open-grass driving (straight-line grass = high efficiency, but active_node progress terminator catches that case). |
||
|---|---|---|
| .harness | ||
| agent | ||
| docs | ||
| tests | ||
| .gitignore | ||
| AGENT.md | ||
| DECISIONS.md | ||
| IMPLEMENTATION_PLAN.md | ||
| PROJECT-KICKOFF.md | ||
| PROJECT-SPEC.md | ||
| README.md | ||
| create_gitea_repo.py | ||
| ralph-loop.sh | ||
README.md
donkeycar-rl-autoresearch
Purpose
Status
- Scaffolded with the agent harness
- Spec not filled yet
Runbook
- Fill PROJECT-SPEC.md
- Create IMPLEMENTATION_PLAN.md from the spec
- Start the implementation loop