• Joined on 2024-05-31
tacit synced and deleted reference refs/tags/refs/pull/396/merge at tacit/nanochat from mirror 2026-01-17 08:54:02 +00:00
tacit synced and deleted reference refs/tags/refs/pull/324/merge at tacit/nanochat from mirror 2026-01-17 08:54:02 +00:00
tacit synced and deleted reference refs/tags/refs/pull/253/merge at tacit/nanochat from mirror 2026-01-17 08:54:02 +00:00
tacit synced and deleted reference refs/tags/refs/pull/147/merge at tacit/nanochat from mirror 2026-01-17 08:54:02 +00:00
tacit synced and deleted reference refs/tags/refs/pull/407/merge at tacit/nanochat from mirror 2026-01-17 08:54:02 +00:00
tacit synced commits to refs/pull/437/merge at tacit/nanochat from mirror 2026-01-17 00:43:51 +00:00
184d4c12b1 also add to log about the FA3 changes
b62a5bc44a naturally i failed to include the actual code in the previous commit facepalm
8203efa919 implement flash attention 3 fallback to pytorch sdpa by touching as few lines of code as possible in main files and keeping all implementation to a single file. add tests. add helpful warning messages for the user.
Compare 4 commits »
tacit synced commits to refs/pull/429/merge at tacit/nanochat from mirror 2026-01-17 00:43:51 +00:00
184d4c12b1 also add to log about the FA3 changes
b62a5bc44a naturally i failed to include the actual code in the previous commit facepalm
8203efa919 implement flash attention 3 fallback to pytorch sdpa by touching as few lines of code as possible in main files and keeping all implementation to a single file. add tests. add helpful warning messages for the user.
Compare 4 commits »
tacit synced commits to refs/pull/311/merge at tacit/nanochat from mirror 2026-01-17 00:43:51 +00:00
184d4c12b1 also add to log about the FA3 changes
b62a5bc44a naturally i failed to include the actual code in the previous commit facepalm
8203efa919 implement flash attention 3 fallback to pytorch sdpa by touching as few lines of code as possible in main files and keeping all implementation to a single file. add tests. add helpful warning messages for the user.
Compare 4 commits »
tacit synced commits to refs/pull/204/merge at tacit/nanochat from mirror 2026-01-17 00:43:50 +00:00
00f1a3219d speedrun
2f7841cd50 remove all uv venv
184d4c12b1 also add to log about the FA3 changes
b62a5bc44a naturally i failed to include the actual code in the previous commit facepalm
Compare 6 commits »
tacit synced and deleted reference refs/tags/refs/pull/431/merge at tacit/nanochat from mirror 2026-01-17 00:43:50 +00:00
tacit synced and deleted reference refs/tags/refs/pull/436/merge at tacit/nanochat from mirror 2026-01-17 00:43:50 +00:00
tacit synced commits to refs/pull/204/head at tacit/nanochat from mirror 2026-01-17 00:43:50 +00:00
00f1a3219d speedrun
2f7841cd50 remove all uv venv
184d4c12b1 also add to log about the FA3 changes
b62a5bc44a naturally i failed to include the actual code in the previous commit facepalm
8203efa919 implement flash attention 3 fallback to pytorch sdpa by touching as few lines of code as possible in main files and keeping all implementation to a single file. add tests. add helpful warning messages for the user.
Compare 155 commits »
tacit synced commits to master at tacit/nanochat from mirror 2026-01-17 00:43:50 +00:00
1933e85046 brief update to log
184d4c12b1 also add to log about the FA3 changes
b62a5bc44a naturally i failed to include the actual code in the previous commit facepalm
8203efa919 implement flash attention 3 fallback to pytorch sdpa by touching as few lines of code as possible in main files and keeping all implementation to a single file. add tests. add helpful warning messages for the user.
Compare 4 commits »
tacit synced commits to refs/pull/434/merge at tacit/nanochat from mirror 2026-01-16 16:33:45 +00:00
50413d2d67 typo in comments: change "GAPO" to "DAPO"
fbf2bbea25 update log with a bunch of attempts
747ed4491f add negative result on olmo3 pretraining mix
7d1700c521 add zstd lib
Compare 9 commits »
tacit synced commits to refs/pull/436/merge at tacit/nanochat from mirror 2026-01-16 16:33:45 +00:00
3e5fccdfa4 feat: attempt fa3 load on sm < 9.0 (ampere/ada)
38e4e0dd7b Merge branch 'master' into fix/fa3-fallback-mps
Compare 3 commits »
tacit synced commits to refs/pull/436/head at tacit/nanochat from mirror 2026-01-16 16:33:45 +00:00
3e5fccdfa4 feat: attempt fa3 load on sm < 9.0 (ampere/ada)
38e4e0dd7b Merge branch 'master' into fix/fa3-fallback-mps
50413d2d67 typo in comments: change "GAPO" to "DAPO"
fbf2bbea25 update log with a bunch of attempts
747ed4491f add negative result on olmo3 pretraining mix
Compare 12 commits »
tacit synced commits to refs/pull/433/merge at tacit/nanochat from mirror 2026-01-16 16:33:44 +00:00
50413d2d67 typo in comments: change "GAPO" to "DAPO"
fbf2bbea25 update log with a bunch of attempts
747ed4491f add negative result on olmo3 pretraining mix
7d1700c521 add zstd lib
Compare 9 commits »
tacit synced commits to refs/pull/370/merge at tacit/nanochat from mirror 2026-01-16 16:33:44 +00:00
50413d2d67 typo in comments: change "GAPO" to "DAPO"
fbf2bbea25 update log with a bunch of attempts
747ed4491f add negative result on olmo3 pretraining mix
7d1700c521 add zstd lib
Compare 9 commits »
tacit synced commits to refs/pull/396/merge at tacit/nanochat from mirror 2026-01-16 16:33:44 +00:00
50413d2d67 typo in comments: change "GAPO" to "DAPO"
fbf2bbea25 update log with a bunch of attempts
747ed4491f add negative result on olmo3 pretraining mix
7d1700c521 add zstd lib
Compare 9 commits »
tacit synced commits to refs/pull/324/merge at tacit/nanochat from mirror 2026-01-16 16:33:44 +00:00
50413d2d67 typo in comments: change "GAPO" to "DAPO"
fbf2bbea25 update log with a bunch of attempts
747ed4491f add negative result on olmo3 pretraining mix
7d1700c521 add zstd lib
Compare 9 commits »