• Joined on 2024-05-31
tacit synced commits to refs/pull/568/merge at tacit/nanochat from mirror 2026-03-10 19:25:31 +00:00
f068604948 new leaderboard entry coming from improvements of autoresearch round 1, time to gpt-2 from 2.02 hours to 1.80 hours
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 3 commits »
tacit synced commits to refs/pull/614/merge at tacit/nanochat from mirror 2026-03-10 11:15:32 +00:00
f068604948 new leaderboard entry coming from improvements of autoresearch round 1, time to gpt-2 from 2.02 hours to 1.80 hours
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 3 commits »
tacit synced commits to refs/pull/595/merge at tacit/nanochat from mirror 2026-03-10 11:15:31 +00:00
d96558bcb0 fix heading, cf #622
f068604948 new leaderboard entry coming from improvements of autoresearch round 1, time to gpt-2 from 2.02 hours to 1.80 hours
Compare 3 commits »
tacit synced commits to refs/pull/544/merge at tacit/nanochat from mirror 2026-03-10 11:15:31 +00:00
f068604948 new leaderboard entry coming from improvements of autoresearch round 1, time to gpt-2 from 2.02 hours to 1.80 hours
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 3 commits »
tacit synced commits to refs/pull/595/head at tacit/nanochat from mirror 2026-03-10 11:15:31 +00:00
d96558bcb0 fix heading, cf #622
tacit synced commits to refs/pull/612/merge at tacit/nanochat from mirror 2026-03-10 11:15:31 +00:00
f068604948 new leaderboard entry coming from improvements of autoresearch round 1, time to gpt-2 from 2.02 hours to 1.80 hours
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 3 commits »
tacit synced commits to master at tacit/nanochat from mirror 2026-03-10 11:15:30 +00:00
f068604948 new leaderboard entry coming from improvements of autoresearch round 1, time to gpt-2 from 2.02 hours to 1.80 hours
tacit synced and deleted reference refs/tags/refs/pull/615/merge at tacit/nanochat from mirror 2026-03-10 11:15:30 +00:00
tacit synced commits to refs/pull/522/merge at tacit/nanochat from mirror 2026-03-10 11:15:30 +00:00
f068604948 new leaderboard entry coming from improvements of autoresearch round 1, time to gpt-2 from 2.02 hours to 1.80 hours
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 3 commits »
tacit synced commits to refs/pull/615/head at tacit/nanochat from mirror 2026-03-10 03:05:33 +00:00
258796b04f fix: added interactive training prompt and fixed path issues
tacit synced commits to refs/pull/615/merge at tacit/nanochat from mirror 2026-03-10 03:05:33 +00:00
258796b04f fix: added interactive training prompt and fixed path issues
Compare 2 commits »
tacit synced commits to refs/pull/611/merge at tacit/nanochat from mirror 2026-03-10 03:05:33 +00:00
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 2 commits »
tacit synced commits to refs/pull/595/merge at tacit/nanochat from mirror 2026-03-10 03:05:32 +00:00
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 2 commits »
tacit synced commits to refs/pull/602/merge at tacit/nanochat from mirror 2026-03-10 03:05:32 +00:00
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 2 commits »
tacit synced commits to refs/pull/604/merge at tacit/nanochat from mirror 2026-03-10 03:05:32 +00:00
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 2 commits »
tacit synced commits to refs/pull/609/merge at tacit/nanochat from mirror 2026-03-10 03:05:32 +00:00
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 2 commits »
tacit synced commits to refs/pull/608/merge at tacit/nanochat from mirror 2026-03-10 03:05:32 +00:00
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 2 commits »
tacit synced commits to refs/pull/598/merge at tacit/nanochat from mirror 2026-03-10 03:05:32 +00:00
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 2 commits »
tacit synced commits to refs/pull/600/merge at tacit/nanochat from mirror 2026-03-10 03:05:32 +00:00
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 2 commits »
tacit synced commits to refs/pull/610/merge at tacit/nanochat from mirror 2026-03-10 03:05:32 +00:00
6ed7d1d82c All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
Compare 2 commits »