Commit Graph

13 Commits

Author SHA1 Message Date
Azekowka
557b2d5840 feat(engine.py): Sample unique tokens per row in generation stream
Before, when initiating a batch generation, the first sampled token was broadcasted to all rows. This change updates the logic to ensure that a unique token is sampled for each row, improving the diversity and independence of generated sequences within a batch.
2025-10-14 17:33:47 +05:00
Azekowka
9a08bb4edb Reapply "Refactor: Improve DDP detection in common.py"
This reverts commit 1a45e5b78a.
2025-10-14 17:09:04 +05:00
Azekowka
1a45e5b78a Revert "Refactor: Improve DDP detection in common.py"
This reverts commit b724190f2a.
2025-10-14 17:04:41 +05:00
Azekowka
b724190f2a Refactor: Improve DDP detection in common.py 2025-10-14 16:59:57 +05:00
Azekowka
dd6812c83e Refactor: Remove pandas dependency from base_eval.py
This commit addresses the TODO item in scripts/base_eval.py by removing the dependency on the pandas library. Specifically, the pd.read_csv call used to load eval_meta_data.csv has been replaced with Python's built-in csv module.
2025-10-14 16:52:26 +05:00
Andrej
dd6ff9a1cc
fix bug in fallback case of find_largest_model
Fix: Handle missing d<number> model tags in find_largest_model
ty
2025-10-13 14:38:34 -07:00
Mirza-Samad-Ahmed-Baig
afaa5b4c90 Fix: Handle missing d<number> model tags in find_largest_model 2025-10-14 00:24:07 +03:00
Andrej
5fd0b13886
Merge pull request #2 from epoyraz/patch-1
Update README.md
2025-10-13 10:10:15 -07:00
Enes Poyraz
6a795baf27
Update README.md
fix typos
2025-10-13 18:40:12 +02:00
Andrej
626bd3e260
Add image of the WebUI to readme 2025-10-13 08:03:00 -07:00
karpathy
da96b46565 update link to the new discussion 2025-10-13 07:42:09 -07:00
karpathy
a53833d04f add nanochat logo png 2025-10-13 06:59:59 -07:00
karpathy
3a5e0bc50b initial commit 2025-10-13 06:49:24 -07:00