| .. |
|
ARCHIVED_reward_hacking/champion_hacked
|
fix: hack-proof reward shaping + reward hacking detection + research log
|
2026-04-13 12:27:48 -04:00 |
|
champion
|
feat: Phase 3 — behavioral control, enhanced evaluator, 53 tests
|
2026-04-14 09:28:43 -04:00 |
|
exp14-mountain-v5-finetune
|
docs: capture robust mountain finetune winner at 36k and preserve eval comparison
|
2026-04-20 00:43:27 -04:00 |
|
exp20-parallel-450k-v5
|
feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments
|
2026-05-05 14:46:13 -04:00 |
|
exp20-parallel-450k-v5_pre-fix_2026-04-28_163923
|
feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments
|
2026-05-05 14:46:13 -04:00 |
|
exp21-generated-pair-warm-v4
|
feat(exp22): add solid-hit/wedge/high-CTE exploit fixes and generated-pair warm experiments
|
2026-05-05 14:46:13 -04:00 |
|
exp22-generated-pair-warm-v6
|
chore(exp22): update wedgefix run log — training stopped for strategy rethink
|
2026-05-05 15:36:18 -04:00 |
|
exp23-generated-road-clean
|
chore(exp23): launched — clean barriers verified, training started
|
2026-05-05 16:04:21 -04:00 |
|
exp26-warmstart
|
feat(exp26): warm-start training from exp25 best_model (300k steps)
|
2026-05-14 15:32:16 -04:00 |
|
exp27-random-roads
|
feat(exp27): random roads with variable throttle + road regen + self-intersection fix
|
2026-05-14 15:32:32 -04:00 |
|
exp28-gentrack-finetune
|
feat(exp28): fine-tune exp26 best_model on generated-track with variable throttle
|
2026-05-14 15:32:37 -04:00 |
|
wave3-champion
|
wave3: autoresearch trial 5 results
|
2026-04-14 18:22:44 -04:00 |
|
wave4-champion
|
wave3: autoresearch trial 5 results
|
2026-04-15 07:15:57 -04:00 |
|
eval_best_models_20260506_102952.log
|
feat(eval): cross-model evaluation scripts for exp24/25/26 + gentrack→minimonaco
|
2026-05-14 15:32:21 -04:00 |
|
eval_gentrack_minimonaco_20260506_184305.log
|
feat(eval): cross-model evaluation scripts for exp24/25/26 + gentrack→minimonaco
|
2026-05-14 15:32:21 -04:00 |
|
eval_gentrack_minimonaco_20260506_184636.log
|
feat(eval): cross-model evaluation scripts for exp24/25/26 + gentrack→minimonaco
|
2026-05-14 15:32:21 -04:00 |
|
eval_gentrack_minimonaco_20260506_184902.log
|
feat(eval): cross-model evaluation scripts for exp24/25/26 + gentrack→minimonaco
|
2026-05-14 15:32:21 -04:00 |
|
eval_gentrack_minimonaco_20260506_211519.log
|
feat(eval): cross-model evaluation scripts for exp24/25/26 + gentrack→minimonaco
|
2026-05-14 15:32:21 -04:00 |
|
eval_gentrack_minimonaco_20260506_212714.log
|
feat(eval): cross-model evaluation scripts for exp24/25/26 + gentrack→minimonaco
|
2026-05-14 15:32:21 -04:00 |