nanochat/nanochat
2026-01-04 19:14:23 +00:00
..
__init__.py initial commit 2025-10-13 06:49:24 -07:00
adamw.py slightly nicer error message 2025-12-09 12:46:48 +01:00
checkpoint_manager.py rename checkpoint_dir to checkpoints_dir for consistency. 2025-12-08 18:32:12 -08:00
common.py fix: safe DDP cleanup (check initialized PG, not just env) (#256) 2025-12-27 20:27:40 -08:00
core_eval.py initial commit 2025-10-13 06:49:24 -07:00
dataloader.py feat: pad vocab size to 64 for DDP optimizers and efficiency 2025-12-09 12:38:18 +01:00
dataset.py initial commit 2025-10-13 06:49:24 -07:00
engine.py delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts 2026-01-04 19:14:23 +00:00
execution.py nit delete redundant catch/raise in execute 2025-10-29 08:10:03 -07:00
gpt.py simplify, clarify and slightly tune model initialization. should be very slightly better possibly, but certainly a lot clearer 2026-01-01 21:15:09 +00:00
logo.svg initial commit 2025-10-13 06:49:24 -07:00
loss_eval.py fix typos 2025-11-14 11:20:25 +01:00
muon.py initial commit 2025-10-13 06:49:24 -07:00
report.py fix small bug where this would break if git stage has deleted files 2026-01-04 19:11:43 +00:00
tokenizer.py allow the tokenizer visualize_tokenization to also print the exact token id. you can never be paranoid enough 2025-10-24 13:27:05 +00:00
ui.html Fix conversation scroll to bottom on some browsers + remove duplicated padding (#348) 2025-12-31 13:03:22 -08:00