nanochat/README.md
2026-03-05 12:36:07 +08:00

615 B

sh

  • python -m nanochat.report reset
  • python -m scripts.tok_train --max_chars=2000000000
  • python -m scripts.tok_eval
  • torchrun --standalone --nproc_per_node=1 -m scripts.base_train -- --depth=18 --device-batch-size=1
  • torchrun --standalone --nproc_per_node=1 -m scripts.base_eval -- --device-batch-size=1
  • torchrun --standalone --nproc_per_node=1 -m scripts.chat_sft -- --device-batch-size=1
  • torchrun --standalone --nproc_per_node=1 -m scripts.chat_eval -- -i sft
  • python -m scripts.chat_cli -p "Why is the sky blue?"
  • python -m scripts.chat_web
  • python -m nanochat.report generate