nanochat/dev
2026-01-15 19:11:28 +01:00
..
estimate_gpt3_core.ipynb add notebook on deriving the CORE estimates for the GPT-3 miniseries. 2026-01-05 18:40:28 +00:00
gen_synthetic_data.py sane secrets management 2026-01-04 19:29:22 +00:00
generate_logo.html
LOG.md add negative result about not allowing attention across BOS tokens. A lot more code complexity for basically no gain in performance 2026-01-13 21:33:54 +00:00
nanochat.png
repackage_data_reference.py
runcpu.sh Merge branch 'master' into fix/shard_count 2026-01-15 19:11:28 +01:00
scaling_analysis.ipynb add notebook used for scaling laws analysis 2026-01-07 22:28:53 +00:00