nanochat/scripts
2026-02-17 20:19:21 -04:00
..
base_eval.py CORE eval: batched forwarding by default, per-example mode for verification 2026-02-13 08:42:45 +00:00
base_train.py Merge 4f79e750e7 into 4800c62f6e 2026-02-17 20:19:21 -04:00
bench_core_eval.py CORE eval: batched forwarding by default, per-example mode for verification 2026-02-13 08:42:45 +00:00
chat_cli.py remove leftover mid references (#491) 2026-02-02 08:33:46 -08:00
chat_eval.py remove leftover mid references (#491) 2026-02-02 08:33:46 -08:00
chat_rl.py remove leftover mid references (#491) 2026-02-02 08:33:46 -08:00
chat_sft.py tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft 2026-02-16 20:23:04 +00:00
chat_web.py remove leftover mid references (#491) 2026-02-02 08:33:46 -08:00
tok_eval.py initial commit 2025-10-13 06:49:24 -07:00
tok_train.py quick fix to not OOM main speedrun script 2026-01-26 22:31:42 +00:00