This website requires JavaScript.
Explore
Help
Sign In
tacit
/
nanochat
Watch
1
Star
0
Fork
0
You've already forked nanochat
mirror of
https://github.com/karpathy/nanochat.git
synced
2026-02-17 00:50:23 +00:00
Code
Issues
Actions
Packages
Projects
Releases
Wiki
Activity
181e7f1c15
nanochat
/
tests
History
Kartik Vashishta
181e7f1c15
Fix SDPA KV-cache for per-row cache_seqlens
2026-02-01 18:36:07 +07:00
..
test_attention_fallback.py
Fix SDPA KV-cache for per-row cache_seqlens
2026-02-01 18:36:07 +07:00
test_engine.py
update the CPU/MPS script to give reasonable results. The model can at least answer that Paris is the capital of France and knows that the sky is blue, for about 40 minutes of training on my macbook. Also fixed a bug that existed due to KVCache bfloat16 dtype assumption
2026-01-17 12:27:30 -08:00