Hossein-Lakzaei
22f8d02345
Enhance error handling in dataset and training scripts
...
- Update file removal logic in dataset.py to log warnings for OSError and PermissionError.
- Improve assertion messages in gpt.py, base_train.py, mid_train.py, chat_rl.py, tok_eval.py, tok_train.py, and test_rustbpe.py to provide clearer context on assertion failures.
2025-10-15 18:34:05 +03:30
Hossein-Lakzaei
5777e51288
Update architecture diagram in nanochat_architecture.jpg
2025-10-15 18:34:04 +03:30
Zach Mueller
2724255f2e
Update speedrun.sh
2025-10-15 18:34:04 +03:30
Hossein-Lakzaei
71c6f47215
Update README.md
2025-10-15 18:34:04 +03:30
Hossein-Lakzaei
d2ddd9bf58
Replace architecture diagram in README with a new JPG format image and remove the old PNG file.
2025-10-15 18:34:04 +03:30
Hossein-Lakzaei
adc34e86b7
Enhance README by consolidating LLM implementation description and removing redundancy
2025-10-15 18:34:04 +03:30
Hossein-Lakzaei
b86d86df17
Add architecture overview section to README and include architecture diagram
2025-10-15 18:34:04 +03:30
Mirza-Samad-Ahmed-Baig
fa6521f7d0
Fix: Handle missing d<number> model tags in find_largest_model
2025-10-15 18:34:03 +03:30
Enes Poyraz
6a795baf27
Update README.md
...
fix typos
2025-10-13 18:40:12 +02:00
Andrej
626bd3e260
Add image of the WebUI to readme
2025-10-13 08:03:00 -07:00
karpathy
da96b46565
update link to the new discussion
2025-10-13 07:42:09 -07:00
karpathy
a53833d04f
add nanochat logo png
2025-10-13 06:59:59 -07:00
karpathy
3a5e0bc50b
initial commit
2025-10-13 06:49:24 -07:00