• Joined on 2024-05-31
tacit synced commits to refs/pull/162/merge at tacit/nanochat from mirror 2025-10-24 23:32:13 +00:00
05a051dbe9 fix tokenization bug, there should be no space before first letter. sigh
Compare 2 commits »
tacit synced commits to refs/pull/161/merge at tacit/nanochat from mirror 2025-10-24 23:32:13 +00:00
0a1059d571 add into rustbpe
851810c7d5 remove string allocations
05a051dbe9 fix tokenization bug, there should be no space before first letter. sigh
8892470f29 add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
Compare 7 commits »
tacit synced commits to refs/pull/161/head at tacit/nanochat from mirror 2025-10-24 23:32:13 +00:00
0a1059d571 add into rustbpe
851810c7d5 remove string allocations
Compare 2 commits »
tacit synced commits to refs/pull/159/merge at tacit/nanochat from mirror 2025-10-24 23:32:13 +00:00
05a051dbe9 fix tokenization bug, there should be no space before first letter. sigh
8892470f29 add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
81597cd616 move the lr schedule args up in base_train so they are tunable in configurator
cc3636b01c allow the tokenizer visualize_tokenization to also print the exact token id. you can never be paranoid enough
Compare 5 commits »
tacit synced commits to refs/pull/156/merge at tacit/nanochat from mirror 2025-10-24 23:32:13 +00:00
05a051dbe9 fix tokenization bug, there should be no space before first letter. sigh
8892470f29 add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
81597cd616 move the lr schedule args up in base_train so they are tunable in configurator
cc3636b01c allow the tokenizer visualize_tokenization to also print the exact token id. you can never be paranoid enough
Compare 5 commits »
tacit synced commits to refs/pull/18/merge at tacit/nanochat from mirror 2025-10-24 23:32:13 +00:00
05a051dbe9 fix tokenization bug, there should be no space before first letter. sigh
Compare 2 commits »
tacit synced commits to refs/pull/151/merge at tacit/nanochat from mirror 2025-10-24 23:32:13 +00:00
05a051dbe9 fix tokenization bug, there should be no space before first letter. sigh
8892470f29 add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
81597cd616 move the lr schedule args up in base_train so they are tunable in configurator
cc3636b01c allow the tokenizer visualize_tokenization to also print the exact token id. you can never be paranoid enough
Compare 5 commits »
tacit synced commits to refs/pull/15/merge at tacit/nanochat from mirror 2025-10-24 23:32:13 +00:00
05a051dbe9 fix tokenization bug, there should be no space before first letter. sigh
8892470f29 add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
81597cd616 move the lr schedule args up in base_train so they are tunable in configurator
cc3636b01c allow the tokenizer visualize_tokenization to also print the exact token id. you can never be paranoid enough
Compare 5 commits »
tacit synced commits to refs/pull/147/merge at tacit/nanochat from mirror 2025-10-24 23:32:13 +00:00
05a051dbe9 fix tokenization bug, there should be no space before first letter. sigh
8892470f29 add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
81597cd616 move the lr schedule args up in base_train so they are tunable in configurator
cc3636b01c allow the tokenizer visualize_tokenization to also print the exact token id. you can never be paranoid enough
Compare 5 commits »
tacit synced commits to refs/pull/110/merge at tacit/nanochat from mirror 2025-10-24 23:32:13 +00:00
05a051dbe9 fix tokenization bug, there should be no space before first letter. sigh
8892470f29 add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
81597cd616 move the lr schedule args up in base_train so they are tunable in configurator
cc3636b01c allow the tokenizer visualize_tokenization to also print the exact token id. you can never be paranoid enough
Compare 5 commits »
tacit synced commits to refs/pull/89/merge at tacit/nanochat from mirror 2025-10-24 15:22:14 +00:00
05a051dbe9 fix tokenization bug, there should be no space before first letter. sigh
8892470f29 add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
81597cd616 move the lr schedule args up in base_train so they are tunable in configurator
cc3636b01c allow the tokenizer visualize_tokenization to also print the exact token id. you can never be paranoid enough
Compare 5 commits »
tacit synced commits to refs/pull/75/merge at tacit/nanochat from mirror 2025-10-24 15:22:14 +00:00
8892470f29 add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
81597cd616 move the lr schedule args up in base_train so they are tunable in configurator
cc3636b01c allow the tokenizer visualize_tokenization to also print the exact token id. you can never be paranoid enough
Compare 4 commits »
tacit synced commits to refs/pull/63/merge at tacit/nanochat from mirror 2025-10-24 15:22:14 +00:00
5eeb2b6ef9 experiment: looking to 'hire' a nanochat repo czar to help the repo, mentioning in readme
2dda5c4c8d Merge branch 'ulanch-fix/ios-safari-input-overlap'
80b203ea59 also bump run1000.sh to new uv sync
917c858136 Updates lockfile with CPU package support without overwriting other architectures
Compare 15 commits »
tacit synced commits to refs/pull/53/merge at tacit/nanochat from mirror 2025-10-24 15:22:14 +00:00
5eeb2b6ef9 experiment: looking to 'hire' a nanochat repo czar to help the repo, mentioning in readme
2dda5c4c8d Merge branch 'ulanch-fix/ios-safari-input-overlap'
80b203ea59 also bump run1000.sh to new uv sync
917c858136 Updates lockfile with CPU package support without overwriting other architectures
Compare 15 commits »
tacit synced commits to refs/pull/40/merge at tacit/nanochat from mirror 2025-10-24 15:22:14 +00:00
5eeb2b6ef9 experiment: looking to 'hire' a nanochat repo czar to help the repo, mentioning in readme
2dda5c4c8d Merge branch 'ulanch-fix/ios-safari-input-overlap'
80b203ea59 also bump run1000.sh to new uv sync
917c858136 Updates lockfile with CPU package support without overwriting other architectures
Compare 15 commits »
tacit synced commits to refs/pull/39/merge at tacit/nanochat from mirror 2025-10-24 15:22:13 +00:00
5eeb2b6ef9 experiment: looking to 'hire' a nanochat repo czar to help the repo, mentioning in readme
2dda5c4c8d Merge branch 'ulanch-fix/ios-safari-input-overlap'
80b203ea59 also bump run1000.sh to new uv sync
917c858136 Updates lockfile with CPU package support without overwriting other architectures
Compare 15 commits »
tacit synced commits to refs/pull/38/merge at tacit/nanochat from mirror 2025-10-24 15:22:13 +00:00
5eeb2b6ef9 experiment: looking to 'hire' a nanochat repo czar to help the repo, mentioning in readme
2dda5c4c8d Merge branch 'ulanch-fix/ios-safari-input-overlap'
80b203ea59 also bump run1000.sh to new uv sync
917c858136 Updates lockfile with CPU package support without overwriting other architectures
Compare 15 commits »
tacit synced commits to refs/pull/35/merge at tacit/nanochat from mirror 2025-10-24 15:22:13 +00:00
5eeb2b6ef9 experiment: looking to 'hire' a nanochat repo czar to help the repo, mentioning in readme
2dda5c4c8d Merge branch 'ulanch-fix/ios-safari-input-overlap'
80b203ea59 also bump run1000.sh to new uv sync
917c858136 Updates lockfile with CPU package support without overwriting other architectures
Compare 15 commits »
tacit synced commits to refs/pull/18/merge at tacit/nanochat from mirror 2025-10-24 15:22:13 +00:00
8892470f29 add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
81597cd616 move the lr schedule args up in base_train so they are tunable in configurator
cc3636b01c allow the tokenizer visualize_tokenization to also print the exact token id. you can never be paranoid enough
Compare 4 commits »
tacit synced commits to refs/pull/141/merge at tacit/nanochat from mirror 2025-10-24 15:22:13 +00:00
05a051dbe9 fix tokenization bug, there should be no space before first letter. sigh
8892470f29 add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
81597cd616 move the lr schedule args up in base_train so they are tunable in configurator
cc3636b01c allow the tokenizer visualize_tokenization to also print the exact token id. you can never be paranoid enough
Compare 5 commits »