• Joined on 2024-05-31
tacit synced commits to refs/pull/141/merge at tacit/nanochat from mirror 2025-10-22 22:32:13 +00:00
5eeb2b6ef9 experiment: looking to 'hire' a nanochat repo czar to help the repo, mentioning in readme
2dda5c4c8d Merge branch 'ulanch-fix/ios-safari-input-overlap'
80b203ea59 also bump run1000.sh to new uv sync
917c858136 Updates lockfile with CPU package support without overwriting other architectures
Compare 15 commits »
tacit synced commits to master at tacit/nanochat from mirror 2025-10-22 22:32:13 +00:00
5eeb2b6ef9 experiment: looking to 'hire' a nanochat repo czar to help the repo, mentioning in readme
2dda5c4c8d Merge branch 'ulanch-fix/ios-safari-input-overlap'
80b203ea59 also bump run1000.sh to new uv sync
917c858136 Updates lockfile with CPU package support without overwriting other architectures
db1d5b595d Git ignore eval_bundle
Compare 14 commits »
tacit synced and deleted reference refs/tags/refs/pull/154/merge at tacit/nanochat from mirror 2025-10-22 22:32:13 +00:00
tacit synced and deleted reference refs/tags/refs/pull/153/merge at tacit/nanochat from mirror 2025-10-22 22:32:13 +00:00
tacit synced and deleted reference refs/tags/refs/pull/149/merge at tacit/nanochat from mirror 2025-10-22 22:32:13 +00:00
tacit synced and deleted reference refs/tags/refs/pull/146/merge at tacit/nanochat from mirror 2025-10-22 22:32:13 +00:00
tacit synced and deleted reference refs/tags/refs/pull/142/merge at tacit/nanochat from mirror 2025-10-22 22:32:13 +00:00
tacit synced and deleted reference refs/tags/refs/pull/122/merge at tacit/nanochat from mirror 2025-10-22 22:32:13 +00:00
tacit synced commits to refs/pull/56/merge at tacit/nanochat from mirror 2025-10-22 14:22:15 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/93/merge at tacit/nanochat from mirror 2025-10-22 14:22:15 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/86/merge at tacit/nanochat from mirror 2025-10-22 14:22:15 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 26 commits »
tacit synced commits to refs/pull/53/merge at tacit/nanochat from mirror 2025-10-22 14:22:15 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
49cd02f283 fix: remove unnecessary tensor allocation in DistAdamW optimizer
Compare 3 commits »
tacit synced commits to refs/pull/63/merge at tacit/nanochat from mirror 2025-10-22 14:22:15 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/39/merge at tacit/nanochat from mirror 2025-10-22 14:22:14 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/40/merge at tacit/nanochat from mirror 2025-10-22 14:22:14 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/38/merge at tacit/nanochat from mirror 2025-10-22 14:22:14 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/35/merge at tacit/nanochat from mirror 2025-10-22 14:22:14 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 34 commits »
tacit synced commits to refs/pull/34/merge at tacit/nanochat from mirror 2025-10-22 14:22:14 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 34 commits »
tacit synced commits to refs/pull/32/merge at tacit/nanochat from mirror 2025-10-22 14:22:14 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
a088b7a6ec use enable_gqa of pytorch sdpa, allows us to delete some code, didnt realize it's available
94ee507054 quick fix base eval due to fewshot requirement
33e8a27f91 Merge karpathy/cpu-mps-dev , adding the ability to run on CPU, on MPS, or on CUDA, with autodetect. Gnarly PR, nonzero chance I broke something.
Compare 28 commits »
tacit synced commits to refs/pull/31/merge at tacit/nanochat from mirror 2025-10-22 14:22:14 +00:00
2e938530ce delete spurious torch.empty allocation in adamw
49cd02f283 fix: remove unnecessary tensor allocation in DistAdamW optimizer
Compare 3 commits »