• Joined on 2024-05-31
tacit synced commits to refs/pull/201/merge at tacit/nanochat from mirror 2025-11-13 17:12:13 +00:00
9a71d13688 typo oops
7b7fd0fe71 thank you Sophie for your help with nanochat
c6abcdfe3a big change: add pretraining resumption logic so that checkpoints can now be approximately resumed and training can continue. this is useful for very long runs when you don't want the anxiety of your run crashing for some reason. alternatively, it's a way to recover training in the event of loss spikes. i mean, this should have been there in v0 but it's ok. the resumption is approximate to control complexity and bloat, but it's possible we want to change that in the future. to use, set --save_every to a step interval to write checkpoints with, and then use --resume_from_step to resume optimization from a given step. only base model training (pretraining) supports this atm, but it's ok because midtraining is comparably quite a bit faster.
91f09ccd0d minor fix comment in engine
Compare 6 commits »
tacit synced commits to master at tacit/nanochat from mirror 2025-11-13 17:12:13 +00:00
9a71d13688 typo oops
7b7fd0fe71 thank you Sophie for your help with nanochat
c6abcdfe3a big change: add pretraining resumption logic so that checkpoints can now be approximately resumed and training can continue. this is useful for very long runs when you don't want the anxiety of your run crashing for some reason. alternatively, it's a way to recover training in the event of loss spikes. i mean, this should have been there in v0 but it's ok. the resumption is approximate to control complexity and bloat, but it's possible we want to change that in the future. to use, set --save_every to a step interval to write checkpoints with, and then use --resume_from_step to resume optimization from a given step. only base model training (pretraining) supports this atm, but it's ok because midtraining is comparably quite a bit faster.
91f09ccd0d minor fix comment in engine
adb5d4a16c uv lock has to change when we removed numpy the other commit
Compare 5 commits »
tacit synced and deleted reference refs/tags/refs/pull/279/merge at tacit/nanochat from mirror 2025-11-12 16:42:13 +00:00
tacit synced and deleted reference refs/tags/refs/pull/282/merge at tacit/nanochat from mirror 2025-11-12 16:42:13 +00:00
tacit synced and deleted reference refs/tags/refs/pull/281/merge at tacit/nanochat from mirror 2025-11-12 16:42:13 +00:00
tacit synced and deleted reference refs/tags/refs/pull/262/merge at tacit/nanochat from mirror 2025-11-12 16:42:13 +00:00
tacit synced commits to refs/pull/262/head at tacit/nanochat from mirror 2025-11-10 15:42:13 +00:00
9f3b8680ee cosmetics, description tweaks
tacit synced and deleted reference refs/tags/refs/pull/267/merge at tacit/nanochat from mirror 2025-11-10 15:42:13 +00:00
tacit synced and deleted reference refs/tags/refs/pull/265/merge at tacit/nanochat from mirror 2025-11-10 15:42:13 +00:00
tacit synced commits to refs/pull/262/merge at tacit/nanochat from mirror 2025-11-10 15:42:13 +00:00
9f3b8680ee cosmetics, description tweaks
Compare 2 commits »
tacit synced commits to refs/pull/262/merge at tacit/nanochat from mirror 2025-11-10 07:32:16 +00:00
f6b1d4f139 correct default value
Compare 2 commits »
tacit synced commits to refs/pull/262/head at tacit/nanochat from mirror 2025-11-10 07:32:16 +00:00
f6b1d4f139 correct default value
tacit synced and deleted reference refs/tags/refs/pull/259/merge at tacit/nanochat from mirror 2025-11-09 23:22:12 +00:00
tacit synced commits to refs/pull/262/head at tacit/nanochat from mirror 2025-11-08 22:52:13 +00:00
7d62478d78 cosmetic changes
tacit synced commits to refs/pull/262/merge at tacit/nanochat from mirror 2025-11-08 22:52:13 +00:00
7d62478d78 cosmetic changes
Compare 2 commits »
tacit synced commits to refs/pull/258/merge at tacit/nanochat from mirror 2025-11-08 14:42:13 +00:00
c2740d3a82 improve dataset downloader logging and add progress bar
2801dc341b add resusable file logger helper method
d4cc96d749 add get_logs_dir() to resolve log output path
8788ffb3db add helper to locate project root dynamically
Compare 5 commits »
tacit synced commits to refs/pull/258/head at tacit/nanochat from mirror 2025-11-08 14:42:13 +00:00
c2740d3a82 improve dataset downloader logging and add progress bar
2801dc341b add resusable file logger helper method
d4cc96d749 add get_logs_dir() to resolve log output path
8788ffb3db add helper to locate project root dynamically
c6b7ab7440 grad clip logging and printing and cosmetics
Compare 16 commits »
tacit synced commits to refs/pull/159/merge at tacit/nanochat from mirror 2025-11-08 06:32:14 +00:00
c6b7ab7440 grad clip logging and printing and cosmetics
885a4f25e7 Replace fcntl with filelock for Windows compatibility
3a2ae631c4 Merge branch 'master' into master
12d995f58c Add NPROC_PER_NODE var to speedrun.sh and run1000.sh
Compare 13 commits »
tacit synced commits to refs/pull/141/merge at tacit/nanochat from mirror 2025-11-08 06:32:13 +00:00
c6b7ab7440 grad clip logging and printing and cosmetics
Compare 2 commits »
tacit synced commits to refs/pull/93/merge at tacit/nanochat from mirror 2025-11-07 06:02:18 +00:00
c6b7ab7440 grad clip logging and printing and cosmetics
Compare 2 commits »