mirror of
https://github.com/karpathy/nanochat.git
synced 2026-04-02 05:35:19 +00:00
Resolve conflicts in gpt.py and base_train.py: - gpt.py: SP path uses Andrej's exact code, muP path layers our changes on top - base_train.py: adopt Andrej's new defaults (warmup-steps, warmdown-ratio, etc.), keep --use-mup/--base-width args Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| base_eval.py | ||
| base_train.py | ||
| chat_cli.py | ||
| chat_eval.py | ||
| chat_rl.py | ||
| chat_sft.py | ||
| chat_web.py | ||
| mup_coord_check.py | ||
| mup_transfer_check.py | ||
| tok_eval.py | ||
| tok_train.py | ||