nanochat/scripts
2026-02-03 13:57:19 +01:00
..
base_eval.py merge two files base_loss and base_eval into a single file, it's nicer this way, and unify the huggingface code associated with both 2026-02-01 02:36:43 +00:00
base_train.py manually control the over-active garbage collector, save a small few minutes from a typical run 2026-02-02 01:44:30 +00:00
chat_cli.py remove leftover mid references (#491) 2026-02-02 08:33:46 -08:00
chat_eval.py remove leftover mid references (#491) 2026-02-02 08:33:46 -08:00
chat_rl.py remove leftover mid references (#491) 2026-02-02 08:33:46 -08:00
chat_sft.py fix bug in chat_sft, the attention window must be preserved sigh 2026-02-01 20:58:44 +00:00
chat_web.py remove leftover mid references (#491) 2026-02-02 08:33:46 -08:00
tok_eval.py Fix relative difference sign in scripts/tok_eval.py 2025-11-23 17:51:18 +01:00
tok_train.py quick fix to not OOM main speedrun script 2026-01-26 22:31:42 +00:00