nanochat/dev
2026-01-18 03:01:17 +00:00
..
estimate_gpt3_core.ipynb
gen_synthetic_data.py
generate_logo.html
LOG.md log for jan 17 2026-01-18 03:01:17 +00:00
nanochat.png
repackage_data_reference.py
runcpu.sh update the CPU/MPS script to give reasonable results. The model can at least answer that Paris is the capital of France and knows that the sky is blue, for about 40 minutes of training on my macbook. Also fixed a bug that existed due to KVCache bfloat16 dtype assumption 2026-01-17 12:27:30 -08:00
scaling_analysis.ipynb