• Joined on 2024-05-31
tacit synced commits to refs/pull/86/merge at tacit/nanochat from mirror 2025-10-21 22:02:19 +00:00
03cddd9878 actually let's not brick code on git pull. change error to warning
fe5aed940b add personality to nanochat. breaks previous code on git pull and requires download of a new file from s3, but there is a helpful error message so hopefully its ok
Compare 3 commits »
tacit synced commits to refs/pull/91/merge at tacit/nanochat from mirror 2025-10-21 22:02:19 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/85/merge at tacit/nanochat from mirror 2025-10-21 22:02:19 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/6/merge at tacit/nanochat from mirror 2025-10-21 22:02:19 +00:00
03cddd9878 actually let's not brick code on git pull. change error to warning
fe5aed940b add personality to nanochat. breaks previous code on git pull and requires download of a new file from s3, but there is a helpful error message so hopefully its ok
Compare 3 commits »
tacit synced commits to refs/pull/89/merge at tacit/nanochat from mirror 2025-10-21 22:02:19 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/88/head at tacit/nanochat from mirror 2025-10-21 22:02:19 +00:00
50bea28ef9 also add readme mention of the cpu mps changes
5bdc99abfb merge and resolve conflict
dfcb1c16f1 Merge branch 'master' into cpu-mps-dev
bb71c64579 fix silly issue in dataloader, this version is much faster and more portable to mps too
bb786c5560 i shouldnt have committed the lock file, i missed that. revert to the flagship build which is linux. sorry to pollute the repo history...
Compare 16 commits »
tacit synced commits to refs/pull/75/merge at tacit/nanochat from mirror 2025-10-21 22:02:19 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/36/merge at tacit/nanochat from mirror 2025-10-21 22:02:18 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/54/merge at tacit/nanochat from mirror 2025-10-21 22:02:18 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/53/merge at tacit/nanochat from mirror 2025-10-21 22:02:18 +00:00
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
50bea28ef9 also add readme mention of the cpu mps changes
Compare 26 commits »
tacit synced commits to refs/pull/49/merge at tacit/nanochat from mirror 2025-10-21 22:02:18 +00:00
c9ea7a91e2 Add customization instructions to README
03cddd9878 actually let's not brick code on git pull. change error to warning
fe5aed940b add personality to nanochat. breaks previous code on git pull and requires download of a new file from s3, but there is a helpful error message so hopefully its ok
Compare 4 commits »
tacit synced commits to refs/pull/43/merge at tacit/nanochat from mirror 2025-10-21 22:02:18 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/31/merge at tacit/nanochat from mirror 2025-10-21 22:02:18 +00:00
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
50bea28ef9 also add readme mention of the cpu mps changes
Compare 26 commits »
tacit synced commits to refs/pull/59/merge at tacit/nanochat from mirror 2025-10-21 22:02:18 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/4/merge at tacit/nanochat from mirror 2025-10-21 22:02:18 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/30/merge at tacit/nanochat from mirror 2025-10-21 22:02:18 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 29 commits »
tacit synced commits to refs/pull/23/merge at tacit/nanochat from mirror 2025-10-21 22:02:17 +00:00
bb71c64579 fix silly issue in dataloader, this version is much faster and more portable to mps too
c9ea7a91e2 Add customization instructions to README
03cddd9878 actually let's not brick code on git pull. change error to warning
fe5aed940b add personality to nanochat. breaks previous code on git pull and requires download of a new file from s3, but there is a helpful error message so hopefully its ok
Compare 5 commits »
tacit synced commits to refs/pull/30/head at tacit/nanochat from mirror 2025-10-21 22:02:17 +00:00
c5ef68cea2 Add comprehensive educational guide for nanochat
tacit synced commits to refs/pull/27/merge at tacit/nanochat from mirror 2025-10-21 22:02:17 +00:00
e2f3f58fa7 Merge 8b2b78ce43b9f441478dd1512982cc84e0fd8f08 into c9ea7a91e2
c9ea7a91e2 Add customization instructions to README
03cddd9878 actually let's not brick code on git pull. change error to warning
fe5aed940b add personality to nanochat. breaks previous code on git pull and requires download of a new file from s3, but there is a helpful error message so hopefully its ok
Compare 4 commits »
tacit synced commits to refs/pull/24/merge at tacit/nanochat from mirror 2025-10-21 22:02:17 +00:00
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
50bea28ef9 also add readme mention of the cpu mps changes
5bdc99abfb merge and resolve conflict
dfcb1c16f1 Merge branch 'master' into cpu-mps-dev
Compare 24 commits »