mirror of
https://github.com/karpathy/nanochat.git
synced 2026-05-07 16:30:11 +00:00
Token-id sequence match failed because BPE has multiple valid tokenizations of the same text, so the greedy encoder output didn't match the model's sampled path. Instead decode gen_ids directly and search for the marker text. Batch-decoding produces complete text even if single-token decodes return empty strings. |
||
|---|---|---|
| .. | ||
| _model.py | ||
| _tokenizer.py | ||
| _tools.py | ||
| serve.py | ||