• Joined on 2024-05-31
tacit synced commits to refs/pull/328/merge at tacit/nanochat from mirror 2026-02-11 13:09:54 +00:00
2f09686724 clarify that this is bf16 mfu we're talking about
e569b59f92 delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
1ec0a34779 at 28 and above we start to need batch size 8
ff46300720 tune miniseries just a bit, fairly cosmetic, keep to even depths where the math works out nicely in model sizing
Compare 7 commits »
tacit synced commits to refs/pull/442/merge at tacit/nanochat from mirror 2026-02-11 13:09:54 +00:00
2f09686724 clarify that this is bf16 mfu we're talking about
Compare 2 commits »
tacit synced commits to refs/pull/151/merge at tacit/nanochat from mirror 2026-02-11 13:09:53 +00:00
2f09686724 clarify that this is bf16 mfu we're talking about
e569b59f92 delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
1ec0a34779 at 28 and above we start to need batch size 8
ff46300720 tune miniseries just a bit, fairly cosmetic, keep to even depths where the math works out nicely in model sizing
Compare 7 commits »
tacit synced commits to refs/pull/141/merge at tacit/nanochat from mirror 2026-02-11 13:09:52 +00:00
2f09686724 clarify that this is bf16 mfu we're talking about
e569b59f92 delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
Compare 3 commits »
tacit synced commits to refs/pull/492/merge at tacit/nanochat from mirror 2026-02-11 04:59:51 +00:00
2f09686724 clarify that this is bf16 mfu we're talking about
Compare 2 commits »
tacit synced commits to refs/pull/501/merge at tacit/nanochat from mirror 2026-02-11 04:59:51 +00:00
2f09686724 clarify that this is bf16 mfu we're talking about
Compare 2 commits »
tacit synced commits to refs/pull/516/merge at tacit/nanochat from mirror 2026-02-11 04:59:51 +00:00
2f09686724 clarify that this is bf16 mfu we're talking about
Compare 2 commits »
tacit synced commits to master at tacit/nanochat from mirror 2026-02-11 04:59:51 +00:00
2f09686724 clarify that this is bf16 mfu we're talking about
tacit synced commits to refs/pull/512/merge at tacit/nanochat from mirror 2026-02-11 04:59:51 +00:00
2f09686724 clarify that this is bf16 mfu we're talking about
Compare 2 commits »
tacit synced commits to refs/pull/519/merge at tacit/nanochat from mirror 2026-02-11 04:59:51 +00:00
2f09686724 clarify that this is bf16 mfu we're talking about
Compare 2 commits »
tacit synced commits to refs/pull/437/merge at tacit/nanochat from mirror 2026-02-11 04:59:51 +00:00
2f09686724 clarify that this is bf16 mfu we're talking about
Compare 2 commits »
tacit synced commits to refs/pull/513/merge at tacit/nanochat from mirror 2026-02-11 04:59:51 +00:00
2f09686724 clarify that this is bf16 mfu we're talking about
Compare 2 commits »
tacit synced commits to refs/pull/520/merge at tacit/nanochat from mirror 2026-02-11 04:59:51 +00:00
2f09686724 clarify that this is bf16 mfu we're talking about
Compare 2 commits »
tacit synced commits to refs/pull/519/merge at tacit/nanochat from mirror 2026-02-10 20:50:00 +00:00
e569b59f92 delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
Compare 2 commits »
tacit synced commits to refs/pull/520/merge at tacit/nanochat from mirror 2026-02-10 20:50:00 +00:00
e569b59f92 delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
Compare 2 commits »
tacit synced commits to refs/pull/513/merge at tacit/nanochat from mirror 2026-02-10 20:49:59 +00:00
e569b59f92 delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
Compare 2 commits »
tacit synced commits to refs/pull/511/merge at tacit/nanochat from mirror 2026-02-10 20:49:59 +00:00
e569b59f92 delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
Compare 2 commits »
tacit synced commits to refs/pull/515/merge at tacit/nanochat from mirror 2026-02-10 20:49:59 +00:00
e569b59f92 delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
Compare 2 commits »
tacit synced commits to refs/pull/512/merge at tacit/nanochat from mirror 2026-02-10 20:49:59 +00:00
e569b59f92 delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
Compare 2 commits »
tacit synced commits to refs/pull/516/merge at tacit/nanochat from mirror 2026-02-10 20:49:59 +00:00
e569b59f92 delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
Compare 2 commits »