nanochat

mirror of https://github.com/karpathy/nanochat.git synced 2026-03-21 12:23:13 +00:00

Author	SHA1	Message	Date
William Thurston	9550053cc1	Enhance model tagging support in training and evaluation scripts - Added model tagging functionality to `runmps.sh`, allowing for dynamic model tagging based on the W&B run name. - Updated `base_train.py`, `mid_train.py`, and `chat_sft.py` to utilize model tags for checkpoint management. - Enhanced `base_eval.py` to accept model tags for loading models during evaluation. - Improved handling of model tags to ensure proper checkpoint directory naming and logging.	2025-11-10 19:45:02 -08:00
William Thurston	b1d49aade5	Add scripts for running evaluations and training with W&B integration - Added `dev/runmps_evals.sh` for evaluating checkpoints and logging results to W&B. - Introduced `dev/runmps.sh` for orchestrating training stages with W&B support. - Updated `.gitignore` to include `wandb/` and `.runmps_wandb_ids`. - Changed permissions for `dev/runcpu.sh` and added executable flag. - Enhanced existing scripts to log metrics to W&B during training and evaluation processes.	2025-11-05 11:49:50 -08:00
karpathy	df600b6ed5	many small tweaks. base, eval, core work now i think	2025-10-16 15:46:18 -07:00
karpathy	786119d593	add autodetect of device and related stuff. getting weird warnings/errors still, so wip	2025-10-16 10:26:19 -07:00
karpathy	3a5e0bc50b	initial commit	2025-10-13 06:49:24 -07:00