• Joined on 2024-05-31
tacit synced and deleted reference refs/tags/refs/pull/394/merge at tacit/nanochat from mirror 2026-01-05 11:12:45 +00:00
tacit synced commits to refs/pull/93/merge at tacit/nanochat from mirror 2026-01-05 11:12:45 +00:00
9d4c9b786d many small fixes to base_train: reporting ETA, allowing some additional kwarg flexibility, making sure we don't crash when e.g. depth = 11 - we now calculate the closest num_heads that works
Compare 2 commits »
tacit synced commits to refs/pull/93/merge at tacit/nanochat from mirror 2026-01-05 03:02:46 +00:00
962b6bfba3 alright add transformers as a dep of the repo because it should be easy to evaluate the CORE score of HF models. Not super happy about it but i tried it and the uv.lock doesn't get bloated as much as i expected
ed2082fbc4 sane secrets management
eb7bbc1b66 delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts
507d54224a fix small bug where this would break if git stage has deleted files
Compare 10 commits »
tacit synced commits to refs/pull/407/merge at tacit/nanochat from mirror 2026-01-05 03:02:46 +00:00
a9e3ccfbb8 Merge 1e04f9846e44fd602ac2232db056fe95c891adb8 into 9d4c9b786d
9d4c9b786d many small fixes to base_train: reporting ETA, allowing some additional kwarg flexibility, making sure we don't crash when e.g. depth = 11 - we now calculate the closest num_heads that works
962b6bfba3 alright add transformers as a dep of the repo because it should be easy to evaluate the CORE score of HF models. Not super happy about it but i tried it and the uv.lock doesn't get bloated as much as i expected
ed2082fbc4 sane secrets management
eb7bbc1b66 delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts
Compare 7 commits »
tacit synced commits to refs/pull/405/merge at tacit/nanochat from mirror 2026-01-05 03:02:45 +00:00
9d4c9b786d many small fixes to base_train: reporting ETA, allowing some additional kwarg flexibility, making sure we don't crash when e.g. depth = 11 - we now calculate the closest num_heads that works
962b6bfba3 alright add transformers as a dep of the repo because it should be easy to evaluate the CORE score of HF models. Not super happy about it but i tried it and the uv.lock doesn't get bloated as much as i expected
ed2082fbc4 sane secrets management
eb7bbc1b66 delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts
Compare 7 commits »
tacit synced commits to refs/pull/400/merge at tacit/nanochat from mirror 2026-01-05 03:02:45 +00:00
9d4c9b786d many small fixes to base_train: reporting ETA, allowing some additional kwarg flexibility, making sure we don't crash when e.g. depth = 11 - we now calculate the closest num_heads that works
962b6bfba3 alright add transformers as a dep of the repo because it should be easy to evaluate the CORE score of HF models. Not super happy about it but i tried it and the uv.lock doesn't get bloated as much as i expected
ed2082fbc4 sane secrets management
eb7bbc1b66 delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts
Compare 7 commits »
tacit synced commits to refs/pull/399/merge at tacit/nanochat from mirror 2026-01-05 03:02:45 +00:00
9d4c9b786d many small fixes to base_train: reporting ETA, allowing some additional kwarg flexibility, making sure we don't crash when e.g. depth = 11 - we now calculate the closest num_heads that works
962b6bfba3 alright add transformers as a dep of the repo because it should be easy to evaluate the CORE score of HF models. Not super happy about it but i tried it and the uv.lock doesn't get bloated as much as i expected
ed2082fbc4 sane secrets management
eb7bbc1b66 delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts
Compare 7 commits »
tacit synced commits to refs/pull/370/merge at tacit/nanochat from mirror 2026-01-05 03:02:44 +00:00
962b6bfba3 alright add transformers as a dep of the repo because it should be easy to evaluate the CORE score of HF models. Not super happy about it but i tried it and the uv.lock doesn't get bloated as much as i expected
ed2082fbc4 sane secrets management
eb7bbc1b66 delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts
507d54224a fix small bug where this would break if git stage has deleted files
Compare 6 commits »
tacit synced commits to refs/pull/396/merge at tacit/nanochat from mirror 2026-01-05 03:02:44 +00:00
962b6bfba3 alright add transformers as a dep of the repo because it should be easy to evaluate the CORE score of HF models. Not super happy about it but i tried it and the uv.lock doesn't get bloated as much as i expected
ed2082fbc4 sane secrets management
eb7bbc1b66 delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts
507d54224a fix small bug where this would break if git stage has deleted files
Compare 6 commits »
tacit synced commits to refs/pull/296/merge at tacit/nanochat from mirror 2026-01-05 03:02:43 +00:00
962b6bfba3 alright add transformers as a dep of the repo because it should be easy to evaluate the CORE score of HF models. Not super happy about it but i tried it and the uv.lock doesn't get bloated as much as i expected
ed2082fbc4 sane secrets management
eb7bbc1b66 delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts
507d54224a fix small bug where this would break if git stage has deleted files
Compare 13 commits »
tacit synced commits to refs/pull/312/merge at tacit/nanochat from mirror 2026-01-05 03:02:43 +00:00
ed2082fbc4 sane secrets management
eb7bbc1b66 delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts
507d54224a fix small bug where this would break if git stage has deleted files
9c60dfb64c bump nanochat to use the latest stable pytorch that is 2.9.1 . Run e.g. to re-update your local environment if you git pull
Compare 9 commits »
tacit synced commits to refs/pull/151/merge at tacit/nanochat from mirror 2026-01-05 03:02:42 +00:00
962b6bfba3 alright add transformers as a dep of the repo because it should be easy to evaluate the CORE score of HF models. Not super happy about it but i tried it and the uv.lock doesn't get bloated as much as i expected
ed2082fbc4 sane secrets management
eb7bbc1b66 delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts
507d54224a fix small bug where this would break if git stage has deleted files
Compare 11 commits »
tacit synced commits to refs/pull/141/merge at tacit/nanochat from mirror 2026-01-05 03:02:39 +00:00
962b6bfba3 alright add transformers as a dep of the repo because it should be easy to evaluate the CORE score of HF models. Not super happy about it but i tried it and the uv.lock doesn't get bloated as much as i expected
ed2082fbc4 sane secrets management
eb7bbc1b66 delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts
507d54224a fix small bug where this would break if git stage has deleted files
Compare 6 commits »
tacit synced commits to master at tacit/nanochat from mirror 2026-01-05 03:02:37 +00:00
9d4c9b786d many small fixes to base_train: reporting ETA, allowing some additional kwarg flexibility, making sure we don't crash when e.g. depth = 11 - we now calculate the closest num_heads that works
962b6bfba3 alright add transformers as a dep of the repo because it should be easy to evaluate the CORE score of HF models. Not super happy about it but i tried it and the uv.lock doesn't get bloated as much as i expected
ed2082fbc4 sane secrets management
eb7bbc1b66 delete the configurator in favor of argparse and clean up a lot of kwarg details to make them more consistent across all scripts
507d54224a fix small bug where this would break if git stage has deleted files
Compare 6 commits »
tacit synced commits to refs/pull/59/merge at tacit/nanochat from mirror 2026-01-04 18:52:40 +00:00
be56d29b87 simplify redundant if/elif in bloat metrics
ee79f29fbd replace files-to-prompt with git ls-files for bloat metrics
da8b7ea4cb also delete the rustbpe test code, this now lives in rustbpe repo that is separate
aa42f40e66 delete the inline rustbpe project. it was ugly to have a project within project and rustbpe is now nicely a separate repo on my github karpathy/rustbpe and it's on pypi etc., so we just add it as a depedency to uv. i think it is appropriate that this is a separate repo because 1) it doesn't have too many knobs, other than the ones that are exposed - the regex pattern and vocab size and 2) all of its complexity is not algorithmic (it's equivalent to minbpe), instead it is efficiency-related, so it is ok to hide relatively speaking
Compare 5 commits »
tacit synced commits to refs/pull/370/merge at tacit/nanochat from mirror 2026-01-04 18:52:37 +00:00
be56d29b87 simplify redundant if/elif in bloat metrics
ee79f29fbd replace files-to-prompt with git ls-files for bloat metrics
da8b7ea4cb also delete the rustbpe test code, this now lives in rustbpe repo that is separate
aa42f40e66 delete the inline rustbpe project. it was ugly to have a project within project and rustbpe is now nicely a separate repo on my github karpathy/rustbpe and it's on pypi etc., so we just add it as a depedency to uv. i think it is appropriate that this is a separate repo because 1) it doesn't have too many knobs, other than the ones that are exposed - the regex pattern and vocab size and 2) all of its complexity is not algorithmic (it's equivalent to minbpe), instead it is efficiency-related, so it is ok to hide relatively speaking
Compare 5 commits »
tacit synced commits to refs/pull/393/merge at tacit/nanochat from mirror 2026-01-04 18:52:37 +00:00
be56d29b87 simplify redundant if/elif in bloat metrics
ee79f29fbd replace files-to-prompt with git ls-files for bloat metrics
da8b7ea4cb also delete the rustbpe test code, this now lives in rustbpe repo that is separate
aa42f40e66 delete the inline rustbpe project. it was ugly to have a project within project and rustbpe is now nicely a separate repo on my github karpathy/rustbpe and it's on pypi etc., so we just add it as a depedency to uv. i think it is appropriate that this is a separate repo because 1) it doesn't have too many knobs, other than the ones that are exposed - the regex pattern and vocab size and 2) all of its complexity is not algorithmic (it's equivalent to minbpe), instead it is efficiency-related, so it is ok to hide relatively speaking
Compare 5 commits »
tacit synced commits to refs/pull/328/merge at tacit/nanochat from mirror 2026-01-04 18:52:37 +00:00
be56d29b87 simplify redundant if/elif in bloat metrics
ee79f29fbd replace files-to-prompt with git ls-files for bloat metrics
da8b7ea4cb also delete the rustbpe test code, this now lives in rustbpe repo that is separate
aa42f40e66 delete the inline rustbpe project. it was ugly to have a project within project and rustbpe is now nicely a separate repo on my github karpathy/rustbpe and it's on pypi etc., so we just add it as a depedency to uv. i think it is appropriate that this is a separate repo because 1) it doesn't have too many knobs, other than the ones that are exposed - the regex pattern and vocab size and 2) all of its complexity is not algorithmic (it's equivalent to minbpe), instead it is efficiency-related, so it is ok to hide relatively speaking
Compare 7 commits »
tacit synced commits to refs/pull/399/merge at tacit/nanochat from mirror 2026-01-04 18:52:37 +00:00
be56d29b87 simplify redundant if/elif in bloat metrics
ee79f29fbd replace files-to-prompt with git ls-files for bloat metrics
da8b7ea4cb also delete the rustbpe test code, this now lives in rustbpe repo that is separate
aa42f40e66 delete the inline rustbpe project. it was ugly to have a project within project and rustbpe is now nicely a separate repo on my github karpathy/rustbpe and it's on pypi etc., so we just add it as a depedency to uv. i think it is appropriate that this is a separate repo because 1) it doesn't have too many knobs, other than the ones that are exposed - the regex pattern and vocab size and 2) all of its complexity is not algorithmic (it's equivalent to minbpe), instead it is efficiency-related, so it is ok to hide relatively speaking
Compare 5 commits »
tacit synced commits to refs/pull/32/merge at tacit/nanochat from mirror 2026-01-04 18:52:36 +00:00
be56d29b87 simplify redundant if/elif in bloat metrics
ee79f29fbd replace files-to-prompt with git ls-files for bloat metrics
da8b7ea4cb also delete the rustbpe test code, this now lives in rustbpe repo that is separate
aa42f40e66 delete the inline rustbpe project. it was ugly to have a project within project and rustbpe is now nicely a separate repo on my github karpathy/rustbpe and it's on pypi etc., so we just add it as a depedency to uv. i think it is appropriate that this is a separate repo because 1) it doesn't have too many knobs, other than the ones that are exposed - the regex pattern and vocab size and 2) all of its complexity is not algorithmic (it's equivalent to minbpe), instead it is efficiency-related, so it is ok to hide relatively speaking
Compare 5 commits »