nanochat/scripts
2026-01-18 15:31:47 +00:00
..
base_eval.py add explicit UTF-8 encoding 2025-11-03 21:27:12 +01:00
base_loss.py many small tweaks. base, eval, core work now i think 2025-10-16 15:46:18 -07:00
base_train.py global bs and d_model configurable 2026-01-10 02:30:44 +00:00
chat_cli.py upgrading all other files to be able to use cpu/mps as well as cuda. various minor other changes ,e.g. changing max_iterations to num_iterations in sft script for consistency in naming 2025-10-20 10:15:17 -07:00
chat_eval.py midtraining, sft, rl scripts and the final version of the nanochat-Mo 2026-01-18 15:31:47 +00:00
chat_rl.py midtraining, sft, rl scripts and the final version of the nanochat-Mo 2026-01-18 15:31:47 +00:00
chat_sft.py midtraining, sft, rl scripts and the final version of the nanochat-Mo 2026-01-18 15:31:47 +00:00
chat_web.py ensure consistency of quotes within each statement 2025-11-03 21:52:02 +01:00
mid_train.py midtraining, sft, rl scripts and the final version of the nanochat-Mo 2026-01-18 15:31:47 +00:00
tok_eval.py initial commit 2025-10-13 06:49:24 -07:00
tok_train.py initial commit 2025-10-13 06:49:24 -07:00