• Joined on 2024-05-31
tacit synced commits to refs/pull/442/merge at tacit/nanochat from mirror 2026-01-17 08:54:04 +00:00
f5425245f9 more GPU types from PR 147 thanks @Qubitium
2955650327 add detection of device to report more correct mfu for bf16
77a46902e4 Fix WANDB_RUN parameter passing in runcpu.sh (#407)
bbc4413c58 Add high value engine tests for core invariants (33 LoC) (#396)
Compare 12 commits »
tacit synced commits to refs/pull/312/merge at tacit/nanochat from mirror 2026-01-17 08:54:03 +00:00
77a46902e4 Fix WANDB_RUN parameter passing in runcpu.sh (#407)
bbc4413c58 Add high value engine tests for core invariants (33 LoC) (#396)
f42ae9e901 fix condition to perform bpb evaluation (#324)
e1dafc510f Reduce token waste in BOS bestfit by cropping shortest doc (#445)
Compare 10 commits »
tacit synced and deleted reference refs/tags/refs/pull/433/merge at tacit/nanochat from mirror 2026-01-17 08:54:03 +00:00
tacit synced and deleted reference refs/tags/refs/pull/445/merge at tacit/nanochat from mirror 2026-01-17 08:54:03 +00:00
tacit synced commits to master at tacit/nanochat from mirror 2026-01-17 08:54:03 +00:00
f5425245f9 more GPU types from PR 147 thanks @Qubitium
2955650327 add detection of device to report more correct mfu for bf16
77a46902e4 Fix WANDB_RUN parameter passing in runcpu.sh (#407)
bbc4413c58 Add high value engine tests for core invariants (33 LoC) (#396)
f42ae9e901 fix condition to perform bpb evaluation (#324)
Compare 7 commits »
tacit synced commits to refs/pull/204/merge at tacit/nanochat from mirror 2026-01-17 08:54:03 +00:00
f5425245f9 more GPU types from PR 147 thanks @Qubitium
2955650327 add detection of device to report more correct mfu for bf16
77a46902e4 Fix WANDB_RUN parameter passing in runcpu.sh (#407)
bbc4413c58 Add high value engine tests for core invariants (33 LoC) (#396)
Compare 9 commits »
tacit synced and deleted reference refs/tags/refs/pull/407/merge at tacit/nanochat from mirror 2026-01-17 08:54:02 +00:00
tacit synced and deleted reference refs/tags/refs/pull/405/merge at tacit/nanochat from mirror 2026-01-17 08:54:02 +00:00
tacit synced and deleted reference refs/tags/refs/pull/396/merge at tacit/nanochat from mirror 2026-01-17 08:54:02 +00:00
tacit synced and deleted reference refs/tags/refs/pull/147/merge at tacit/nanochat from mirror 2026-01-17 08:54:02 +00:00
tacit synced and deleted reference refs/tags/refs/pull/253/merge at tacit/nanochat from mirror 2026-01-17 08:54:02 +00:00
tacit synced and deleted reference refs/tags/refs/pull/324/merge at tacit/nanochat from mirror 2026-01-17 08:54:02 +00:00
tacit synced commits to refs/pull/429/merge at tacit/nanochat from mirror 2026-01-17 00:43:51 +00:00
184d4c12b1 also add to log about the FA3 changes
b62a5bc44a naturally i failed to include the actual code in the previous commit facepalm
8203efa919 implement flash attention 3 fallback to pytorch sdpa by touching as few lines of code as possible in main files and keeping all implementation to a single file. add tests. add helpful warning messages for the user.
Compare 4 commits »
tacit synced commits to refs/pull/311/merge at tacit/nanochat from mirror 2026-01-17 00:43:51 +00:00
184d4c12b1 also add to log about the FA3 changes
b62a5bc44a naturally i failed to include the actual code in the previous commit facepalm
8203efa919 implement flash attention 3 fallback to pytorch sdpa by touching as few lines of code as possible in main files and keeping all implementation to a single file. add tests. add helpful warning messages for the user.
Compare 4 commits »
tacit synced commits to refs/pull/437/merge at tacit/nanochat from mirror 2026-01-17 00:43:51 +00:00
184d4c12b1 also add to log about the FA3 changes
b62a5bc44a naturally i failed to include the actual code in the previous commit facepalm
8203efa919 implement flash attention 3 fallback to pytorch sdpa by touching as few lines of code as possible in main files and keeping all implementation to a single file. add tests. add helpful warning messages for the user.
Compare 4 commits »
tacit synced and deleted reference refs/tags/refs/pull/431/merge at tacit/nanochat from mirror 2026-01-17 00:43:50 +00:00
tacit synced commits to master at tacit/nanochat from mirror 2026-01-17 00:43:50 +00:00
1933e85046 brief update to log
184d4c12b1 also add to log about the FA3 changes
b62a5bc44a naturally i failed to include the actual code in the previous commit facepalm
8203efa919 implement flash attention 3 fallback to pytorch sdpa by touching as few lines of code as possible in main files and keeping all implementation to a single file. add tests. add helpful warning messages for the user.
Compare 4 commits »
tacit synced commits to refs/pull/204/merge at tacit/nanochat from mirror 2026-01-17 00:43:50 +00:00
00f1a3219d speedrun
2f7841cd50 remove all uv venv
184d4c12b1 also add to log about the FA3 changes
b62a5bc44a naturally i failed to include the actual code in the previous commit facepalm
Compare 6 commits »
tacit synced and deleted reference refs/tags/refs/pull/436/merge at tacit/nanochat from mirror 2026-01-17 00:43:50 +00:00
tacit synced commits to refs/pull/204/head at tacit/nanochat from mirror 2026-01-17 00:43:50 +00:00
00f1a3219d speedrun
2f7841cd50 remove all uv venv
184d4c12b1 also add to log about the FA3 changes
b62a5bc44a naturally i failed to include the actual code in the previous commit facepalm
8203efa919 implement flash attention 3 fallback to pytorch sdpa by touching as few lines of code as possible in main files and keeping all implementation to a single file. add tests. add helpful warning messages for the user.
Compare 155 commits »