• Joined on 2024-05-31
tacit synced commits to refs/pull/533/merge at tacit/nanochat from mirror 2026-02-19 01:50:30 +00:00
0a23f87643 Fix bug in setting precision (#538)
Compare 2 commits »
tacit synced commits to refs/pull/536/merge at tacit/nanochat from mirror 2026-02-19 01:50:30 +00:00
0a23f87643 Fix bug in setting precision (#538)
Compare 2 commits »
tacit synced commits to refs/pull/526/merge at tacit/nanochat from mirror 2026-02-19 01:50:29 +00:00
48804bff3a report negative result on fineweb dataset
bb5137860e fix comment
458555117b Merge branch 'Chetter2-patch-1'
bac5a35dd7 fix minor bug in fp8 application to skip tiny matmuls
Compare 13 commits »
tacit synced commits to refs/pull/520/merge at tacit/nanochat from mirror 2026-02-19 01:50:28 +00:00
0a23f87643 Fix bug in setting precision (#538)
Compare 2 commits »
tacit synced commits to refs/pull/501/merge at tacit/nanochat from mirror 2026-02-19 01:50:27 +00:00
0a23f87643 Fix bug in setting precision (#538)
4800c62f6e Fix MockModel's device definition (#535)
Compare 3 commits »
tacit synced commits to refs/pull/509/merge at tacit/nanochat from mirror 2026-02-19 01:50:27 +00:00
48804bff3a report negative result on fineweb dataset
bb5137860e fix comment
458555117b Merge branch 'Chetter2-patch-1'
bac5a35dd7 fix minor bug in fp8 application to skip tiny matmuls
Compare 15 commits »
tacit synced commits to refs/pull/486/merge at tacit/nanochat from mirror 2026-02-19 01:50:27 +00:00
0a23f87643 Fix bug in setting precision (#538)
4800c62f6e Fix MockModel's device definition (#535)
4a6e47b0c6 update dev log with recent
Compare 4 commits »
tacit synced commits to refs/pull/483/merge at tacit/nanochat from mirror 2026-02-19 01:50:26 +00:00
0a23f87643 Fix bug in setting precision (#538)
4800c62f6e Fix MockModel's device definition (#535)
4a6e47b0c6 update dev log with recent
Compare 4 commits »
tacit synced commits to master at tacit/nanochat from mirror 2026-02-19 01:50:26 +00:00
48804bff3a report negative result on fineweb dataset
bb5137860e fix comment
458555117b Merge branch 'Chetter2-patch-1'
bac5a35dd7 fix minor bug in fp8 application to skip tiny matmuls
ad55575326 Fix bug in setting precision (#538)
Compare 11 commits »
tacit synced and deleted reference refs/tags/refs/pull/531/merge at tacit/nanochat from mirror 2026-02-19 01:50:25 +00:00
tacit synced and deleted reference refs/tags/refs/pull/516/merge at tacit/nanochat from mirror 2026-02-19 01:50:25 +00:00
tacit synced and deleted reference refs/tags/refs/pull/510/merge at tacit/nanochat from mirror 2026-02-19 01:50:25 +00:00
tacit synced commits to refs/pull/531/merge at tacit/nanochat from mirror 2026-02-18 17:40:24 +00:00
0a23f87643 Fix bug in setting precision (#538)
Compare 2 commits »
tacit synced commits to refs/pull/522/merge at tacit/nanochat from mirror 2026-02-18 17:40:23 +00:00
4800c62f6e Fix MockModel's device definition (#535)
4a6e47b0c6 update dev log with recent
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
788dadeb88 a number of upgrades to SFT script to bring it up to date w.r.t. pretraining and tuning some of its kwargs based on sweeps
Compare 5 commits »
tacit synced commits to refs/pull/511/merge at tacit/nanochat from mirror 2026-02-18 17:40:23 +00:00
4800c62f6e Fix MockModel's device definition (#535)
Compare 2 commits »
tacit synced commits to refs/pull/498/merge at tacit/nanochat from mirror 2026-02-18 17:40:23 +00:00
0a23f87643 Fix bug in setting precision (#538)
Compare 2 commits »
tacit synced commits to refs/pull/520/merge at tacit/nanochat from mirror 2026-02-18 17:40:23 +00:00
4800c62f6e Fix MockModel's device definition (#535)
4a6e47b0c6 update dev log with recent
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
Compare 4 commits »
tacit synced commits to master at tacit/nanochat from mirror 2026-02-18 17:40:23 +00:00
0a23f87643 Fix bug in setting precision (#538)
tacit synced commits to refs/pull/521/merge at tacit/nanochat from mirror 2026-02-18 17:40:23 +00:00
4800c62f6e Fix MockModel's device definition (#535)
4a6e47b0c6 update dev log with recent
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
Compare 4 commits »
tacit synced commits to refs/pull/498/merge at tacit/nanochat from mirror 2026-02-18 09:30:24 +00:00
4800c62f6e Fix MockModel's device definition (#535)
Compare 2 commits »