nanochat

mirror of https://github.com/karpathy/nanochat.git synced 2026-03-21 12:23:13 +00:00

History

William Thurston 9550053cc1 Enhance model tagging support in training and evaluation scripts - Added model tagging functionality to `runmps.sh`, allowing for dynamic model tagging based on the W&B run name. - Updated `base_train.py`, `mid_train.py`, and `chat_sft.py` to utilize model tags for checkpoint management. - Enhanced `base_eval.py` to accept model tags for loading models during evaluation. - Improved handling of model tags to ensure proper checkpoint directory naming and logging.		2025-11-10 19:45:02 -08:00
..
base_eval.py	Enhance model tagging support in training and evaluation scripts	2025-11-10 19:45:02 -08:00
base_loss.py	Add scripts for running evaluations and training with W&B integration	2025-11-05 11:49:50 -08:00
base_train.py	Enhance model tagging support in training and evaluation scripts	2025-11-10 19:45:02 -08:00
chat_cli.py	upgrading all other files to be able to use cpu/mps as well as cuda. various minor other changes ,e.g. changing max_iterations to num_iterations in sft script for consistency in naming	2025-10-20 10:15:17 -07:00
chat_eval.py	Add scripts for running evaluations and training with W&B integration	2025-11-05 11:49:50 -08:00
chat_rl.py	initial commit	2025-10-13 06:49:24 -07:00
chat_sft.py	Enhance model tagging support in training and evaluation scripts	2025-11-10 19:45:02 -08:00
chat_web.py	upgrading all other files to be able to use cpu/mps as well as cuda. various minor other changes ,e.g. changing max_iterations to num_iterations in sft script for consistency in naming	2025-10-20 10:15:17 -07:00
mid_train.py	Enhance model tagging support in training and evaluation scripts	2025-11-10 19:45:02 -08:00
tok_eval.py	initial commit	2025-10-13 06:49:24 -07:00
tok_train.py	initial commit	2025-10-13 06:49:24 -07:00