donkeycar-rl-autoresearch/docs
Paul Huliganga b19dcc8b80 feat: run_eval.py — standard eval runner with persistent logging
Every test run now saves to agent/test-results/YYYY-MM-DD_HH-MM_<model>.log
so results are never lost. Also added 3-set Exp9 eval results to TEST_HISTORY.

Usage:
  python3 agent/run_eval.py --model models/exp9-.../best_model.zip --sets 3

Agent: pi
Tests: 102 passed
Tests-Added: 0
TypeScript: N/A
2026-04-18 15:32:36 -04:00
..
track-screenshots wave3: add multi-track autoresearch system (83 tests passing) 2026-04-14 12:47:12 -04:00
ARCHITECTURE.md docs: ARCHITECTURE.md — complete system architecture guide 2026-04-17 14:06:38 -04:00
RESEARCH_LOG.md wave3: add multi-track autoresearch system (83 tests passing) 2026-04-14 12:47:12 -04:00
STATE.md docs: STATE.md updated with April 16 test results 2026-04-16 20:45:45 -04:00
TEST_HISTORY.md feat: run_eval.py — standard eval runner with persistent logging 2026-04-18 15:32:36 -04:00