• Joined on 2024-05-31
tacit synced commits to refs/pull/498/merge at tacit/nanochat from mirror 2026-02-17 17:10:23 +00:00
4a6e47b0c6 update dev log with recent
Compare 2 commits »
tacit synced commits to master at tacit/nanochat from mirror 2026-02-17 17:10:23 +00:00
4a6e47b0c6 update dev log with recent
tacit synced commits to refs/pull/526/merge at tacit/nanochat from mirror 2026-02-17 09:00:23 +00:00
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
Compare 2 commits »
tacit synced commits to refs/pull/483/merge at tacit/nanochat from mirror 2026-02-17 09:00:22 +00:00
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
788dadeb88 a number of upgrades to SFT script to bring it up to date w.r.t. pretraining and tuning some of its kwargs based on sweeps
Compare 3 commits »
tacit synced commits to refs/pull/486/merge at tacit/nanochat from mirror 2026-02-17 09:00:22 +00:00
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
Compare 2 commits »
tacit synced commits to refs/pull/536/merge at tacit/nanochat from mirror 2026-02-17 00:50:25 +00:00
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
Compare 2 commits »
tacit synced commits to refs/pull/535/merge at tacit/nanochat from mirror 2026-02-17 00:50:25 +00:00
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
Compare 2 commits »
tacit synced commits to refs/pull/510/merge at tacit/nanochat from mirror 2026-02-17 00:50:24 +00:00
788dadeb88 a number of upgrades to SFT script to bring it up to date w.r.t. pretraining and tuning some of its kwargs based on sweeps
Compare 2 commits »
tacit synced commits to refs/pull/521/merge at tacit/nanochat from mirror 2026-02-17 00:50:24 +00:00
788dadeb88 a number of upgrades to SFT script to bring it up to date w.r.t. pretraining and tuning some of its kwargs based on sweeps
Compare 2 commits »
tacit synced commits to refs/pull/516/merge at tacit/nanochat from mirror 2026-02-17 00:50:24 +00:00
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
Compare 2 commits »
tacit synced commits to refs/pull/509/merge at tacit/nanochat from mirror 2026-02-17 00:50:24 +00:00
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
788dadeb88 a number of upgrades to SFT script to bring it up to date w.r.t. pretraining and tuning some of its kwargs based on sweeps
Compare 3 commits »
tacit synced commits to refs/pull/531/merge at tacit/nanochat from mirror 2026-02-17 00:50:24 +00:00
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
788dadeb88 a number of upgrades to SFT script to bring it up to date w.r.t. pretraining and tuning some of its kwargs based on sweeps
Compare 3 commits »
tacit synced commits to refs/pull/533/merge at tacit/nanochat from mirror 2026-02-17 00:50:24 +00:00
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
Compare 2 commits »
tacit synced commits to refs/pull/486/merge at tacit/nanochat from mirror 2026-02-17 00:50:24 +00:00
788dadeb88 a number of upgrades to SFT script to bring it up to date w.r.t. pretraining and tuning some of its kwargs based on sweeps
Compare 2 commits »
tacit synced commits to refs/pull/485/merge at tacit/nanochat from mirror 2026-02-17 00:50:24 +00:00
788dadeb88 a number of upgrades to SFT script to bring it up to date w.r.t. pretraining and tuning some of its kwargs based on sweeps
Compare 2 commits »
tacit synced commits to refs/pull/489/merge at tacit/nanochat from mirror 2026-02-17 00:50:24 +00:00
788dadeb88 a number of upgrades to SFT script to bring it up to date w.r.t. pretraining and tuning some of its kwargs based on sweeps
Compare 2 commits »
tacit synced commits to refs/pull/498/merge at tacit/nanochat from mirror 2026-02-17 00:50:24 +00:00
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
Compare 2 commits »
tacit synced commits to refs/pull/501/merge at tacit/nanochat from mirror 2026-02-17 00:50:24 +00:00
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
788dadeb88 a number of upgrades to SFT script to bring it up to date w.r.t. pretraining and tuning some of its kwargs based on sweeps
Compare 3 commits »
tacit synced commits to master at tacit/nanochat from mirror 2026-02-17 00:50:23 +00:00
8180e1d8c1 tune the data mixture a bit, load optimizer by default when SFT. These were confirmed to be best settings from sweeps of sft
tacit synced commits to refs/pull/141/merge at tacit/nanochat from mirror 2026-02-17 00:50:23 +00:00
788dadeb88 a number of upgrades to SFT script to bring it up to date w.r.t. pretraining and tuning some of its kwargs based on sweeps
Compare 2 commits »