| .. |
|
model-000
|
Initial commit: stable RL sweep runner, legacy and new scripts, full docs included
|
2026-04-12 22:57:50 -04:00 |
|
model-001
|
Initial commit: stable RL sweep runner, legacy and new scripts, full docs included
|
2026-04-12 22:57:50 -04:00 |
|
model-002
|
Initial commit: stable RL sweep runner, legacy and new scripts, full docs included
|
2026-04-12 22:57:50 -04:00 |
|
model-003
|
Initial commit: stable RL sweep runner, legacy and new scripts, full docs included
|
2026-04-12 22:57:50 -04:00 |
|
autoresearch_log.txt
|
AUTORESEARCH: 300 total trials complete - best mean_reward=141.85 at n_steer=8, n_throttle=5, lr=0.00202
|
2026-04-13 01:56:06 -04:00 |
|
autoresearch_phase1_log.txt
|
autoresearch: phase1 trial 10 results
|
2026-04-13 13:11:06 -04:00 |
|
autoresearch_phase1_log_CORRUPTED_reward_hacking.txt
|
fix: hack-proof reward shaping + reward hacking detection + research log
|
2026-04-13 12:27:48 -04:00 |
|
autoresearch_results.jsonl
|
AUTORESEARCH: 300 total trials complete - best mean_reward=141.85 at n_steer=8, n_throttle=5, lr=0.00202
|
2026-04-13 01:56:06 -04:00 |
|
autoresearch_results_phase1.jsonl
|
autoresearch: phase1 trial 10 results
|
2026-04-13 13:11:06 -04:00 |
|
autoresearch_results_phase1_CORRUPTED_reward_hacking.jsonl
|
fix: hack-proof reward shaping + reward hacking detection + research log
|
2026-04-13 12:27:48 -04:00 |
|
clean_sweep_results.jsonl
|
AUTORESEARCH: Full Karpathy-style GP+UCB meta-controller, clean base data, fixed all paths, ready to run
|
2026-04-13 00:52:00 -04:00 |
|
nohup_outerloop.log
|
Initial commit: stable RL sweep runner, legacy and new scripts, full docs included
|
2026-04-12 22:57:50 -04:00 |
|
outer_monitor.log
|
Initial commit: stable RL sweep runner, legacy and new scripts, full docs included
|
2026-04-12 22:57:50 -04:00 |
|
sweep_results.jsonl
|
Initial commit: stable RL sweep runner, legacy and new scripts, full docs included
|
2026-04-12 22:57:50 -04:00 |