mirror of
https://github.com/karpathy/nanochat.git
synced 2026-04-02 05:35:19 +00:00
Without --model-tag, chat_sft/chat_cli/chat_web/base_eval can pick the wrong model when multiple models exist in the cache. Add explicit --model-tag=d6 (runcpu) and --model-tag=d24 (speedrun) matching the depth used in each script's base_train call. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| miniseries.sh | ||
| runcpu.sh | ||
| scaling_laws.sh | ||
| speedrun.sh | ||