nanochat/dev
William Thurston 9550053cc1 Enhance model tagging support in training and evaluation scripts
- Added model tagging functionality to `runmps.sh`, allowing for dynamic model tagging based on the W&B run name.
- Updated `base_train.py`, `mid_train.py`, and `chat_sft.py` to utilize model tags for checkpoint management.
- Enhanced `base_eval.py` to accept model tags for loading models during evaluation.
- Improved handling of model tags to ensure proper checkpoint directory naming and logging.
2025-11-10 19:45:02 -08:00
..
gen_synthetic_data.py add personality to nanochat. breaks previous code on git pull and requires download of a new file from s3, but there is a helpful error message so hopefully its ok 2025-10-21 15:04:58 +00:00
generate_logo.html initial commit 2025-10-13 06:49:24 -07:00
nanochat.png add nanochat logo png 2025-10-13 06:59:59 -07:00
repackage_data_reference.py initial commit 2025-10-13 06:49:24 -07:00
runcpu.sh Add scripts for running evaluations and training with W&B integration 2025-11-05 11:49:50 -08:00
runmps_evals.sh Add scripts for running evaluations and training with W&B integration 2025-11-05 11:49:50 -08:00
runmps.sh Enhance model tagging support in training and evaluation scripts 2025-11-10 19:45:02 -08:00