nanochat/scripts
aarushisingh04 8f6e616d12 Merge branch 'master' into topk-validation-fix
# Please enter a commit message to explain why this merge is necessary,
# especially if it merges an updated upstream into a topic branch.
#
# Lines starting with '#' will be ignored, and an empty message aborts
# the commit.
2026-01-23 18:18:41 +05:30
..
base_eval.py bugfix 2025-12-26 19:02:12 +08:00
base_loss.py update the CPU/MPS script to give reasonable results. The model can at least answer that Paris is the capital of France and knows that the sky is blue, for about 40 minutes of training on my macbook. Also fixed a bug that existed due to KVCache bfloat16 dtype assumption 2026-01-17 12:27:30 -08:00
base_train.py Merge branch 've' 2026-01-18 15:14:39 +00:00
chat_cli.py upgrading all other files to be able to use cpu/mps as well as cuda. various minor other changes ,e.g. changing max_iterations to num_iterations in sft script for consistency in naming 2025-10-20 10:15:17 -07:00
chat_eval.py Fix args in readme (#438) 2026-01-15 16:26:38 -08:00
chat_rl.py typo in comments: change "GAPO" to "DAPO" 2026-01-15 22:03:42 -08:00
chat_sft.py fix buggy midtrain and update all kwargs to be idiomatic. that is, argparse uses dashes variables use underscores. the underscores are just a remnant of the previous Configurator object. This is the right way 2026-01-13 22:45:27 +00:00
chat_web.py allow top_k=0 in web api to disable filtering 2026-01-08 20:22:16 +05:30
mid_train.py fix condition to perform bpb evaluation (#324) 2026-01-16 18:56:43 -08:00
tok_eval.py initial commit 2025-10-13 06:49:24 -07:00
tok_train.py fix buggy midtrain and update all kwargs to be idiomatic. that is, argparse uses dashes variables use underscores. the underscores are just a remnant of the previous Configurator object. This is the right way 2026-01-13 22:45:27 +00:00