nanochat/scripts
karaage0703 e1e836763e Add Japanese language support for nanochat
- Add NANOCHAT_LANG environment variable to switch languages
- Implement JapaneseInstructTask and JCommonsenseQA tasks
- Update dataset.py to support Japanese prompts and data loading
- Add Japanese evaluation in chat_eval.py and tok_eval.py
- Include speedrun_spark_ja.sh for Japanese training runs
- Add comprehensive test suite for Japanese support
- Include Kiro specification documents (requirements, design, tasks)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-12-01 21:29:45 +09:00
..
base_eval.py many small tweaks. base, eval, core work now i think 2025-10-16 15:46:18 -07:00
base_loss.py many small tweaks. base, eval, core work now i think 2025-10-16 15:46:18 -07:00
base_train.py move the lr schedule args up in base_train so they are tunable in configurator 2025-10-24 13:27:31 +00:00
chat_cli.py upgrading all other files to be able to use cpu/mps as well as cuda. various minor other changes ,e.g. changing max_iterations to num_iterations in sft script for consistency in naming 2025-10-20 10:15:17 -07:00
chat_eval.py Add Japanese language support for nanochat 2025-12-01 21:29:45 +09:00
chat_rl.py initial commit 2025-10-13 06:49:24 -07:00
chat_sft.py Add Japanese language support for nanochat 2025-12-01 21:29:45 +09:00
chat_web.py upgrading all other files to be able to use cpu/mps as well as cuda. various minor other changes ,e.g. changing max_iterations to num_iterations in sft script for consistency in naming 2025-10-20 10:15:17 -07:00
mid_train.py add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think 2025-10-24 14:02:48 +00:00
tok_eval.py Add Japanese language support for nanochat 2025-12-01 21:29:45 +09:00
tok_train.py initial commit 2025-10-13 06:49:24 -07:00