nanochat/scripts
2026-01-07 22:11:59 +00:00
..
base_eval.py
base_loss.py delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts 2026-01-04 19:14:23 +00:00
base_train.py nudge hyperparameters of the base script with the results of the sweeps and miniseries. vocab size down to 32K. D:N ratio from 20 to 8. add miniseries script 2026-01-07 22:11:59 +00:00
chat_cli.py
chat_eval.py
chat_rl.py Fix undefined variable in chat_rl after recent refactor 2026-01-07 09:08:57 -08:00
chat_sft.py delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts 2026-01-04 19:14:23 +00:00
chat_web.py
mid_train.py delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts 2026-01-04 19:14:23 +00:00
tok_eval.py
tok_train.py nudge hyperparameters of the base script with the results of the sweeps and miniseries. vocab size down to 32K. D:N ratio from 20 to 8. add miniseries script 2026-01-07 22:11:59 +00:00