Hossein-Lakzaei
f9dd11fefe
Enhance error handling in dataset and training scripts
...
- Update file removal logic in dataset.py to log warnings for OSError and PermissionError.
- Improve assertion messages in gpt.py, base_train.py, mid_train.py, chat_rl.py, tok_eval.py, tok_train.py, and test_rustbpe.py to provide clearer context on assertion failures.
2025-10-15 17:15:59 +03:30
Hossein-Lakzaei
bfd8d21313
Update architecture diagram in nanochat_architecture.jpg
2025-10-15 13:45:21 +03:30
Hossein-Lakzaei
2167425dba
Merge branch 'karpathy:master' into master
2025-10-15 09:44:03 +03:30
Hossein-Lakzaei
f4efbe2825
Merge branch 'master' of https://github.com/HosLak/nanochat
2025-10-15 09:00:44 +03:30
Hossein-Lakzaei
9ff69c99b9
Replace architecture diagram in README with a new JPG format image and remove the old PNG file.
2025-10-15 09:00:38 +03:30
Andrej
67aaca98f5
export NANOCHAT_BASE_DIR so child processes get it too
...
Export the cache directory so that users can use their own cache location
2025-10-14 16:01:28 -07:00
Hossein-Lakzaei
267676fb7e
Update README.md
2025-10-15 01:07:08 +03:30
Hossein-Lakzaei
03398ce70a
Enhance README by consolidating LLM implementation description and removing redundancy
2025-10-15 00:46:25 +03:30
Hossein-Lakzaei
d7a9e5d3b2
Add architecture overview section to README and include architecture diagram
2025-10-15 00:42:52 +03:30
Zach Mueller
f0855cbcc7
Update speedrun.sh
2025-10-14 14:12:01 -04:00
Andrej
dd6ff9a1cc
fix bug in fallback case of find_largest_model
...
Fix: Handle missing d<number> model tags in find_largest_model
ty
2025-10-13 14:38:34 -07:00
Mirza-Samad-Ahmed-Baig
afaa5b4c90
Fix: Handle missing d<number> model tags in find_largest_model
2025-10-14 00:24:07 +03:00
Andrej
5fd0b13886
Merge pull request #2 from epoyraz/patch-1
...
Update README.md
2025-10-13 10:10:15 -07:00
Enes Poyraz
6a795baf27
Update README.md
...
fix typos
2025-10-13 18:40:12 +02:00
Andrej
626bd3e260
Add image of the WebUI to readme
2025-10-13 08:03:00 -07:00
karpathy
da96b46565
update link to the new discussion
2025-10-13 07:42:09 -07:00
karpathy
a53833d04f
add nanochat logo png
2025-10-13 06:59:59 -07:00
karpathy
3a5e0bc50b
initial commit
2025-10-13 06:49:24 -07:00