nanochat/runs
2026-02-08 18:26:34 +00:00
..
miniseries.sh at 28 and above we start to need batch size 8 2026-02-08 18:26:34 +00:00
runcpu.sh merge two files base_loss and base_eval into a single file, it's nicer this way, and unify the huggingface code associated with both 2026-02-01 02:36:43 +00:00
scaling_laws.sh add engram-lite, add log, tune scaling laws analysis scripts 2026-01-27 22:31:17 +00:00
speedrun.sh new optimal ratio for d26 training 2026-02-06 19:21:27 +00:00