Andrej Karpathy
|
0307997f9b
|
merge two files base_loss and base_eval into a single file, it's nicer this way, and unify the huggingface code associated with both
|
2026-02-01 02:36:43 +00:00 |
|
Andrej Karpathy
|
1ddaad1c1c
|
nuke midtraining from orbit, it's not as needed now that we have a BOS-aligned dataloader. Also change the README a lot. midtrianing is not yet fully properly erased across the board, but good enough for step 1
|
2026-01-31 19:12:25 +00:00 |
|
Andrej Karpathy
|
02baa15405
|
i am feeling in a delete mood today. i need to delete a lot of code. there is too much code and surface area and complexity. ew
|
2026-01-30 17:08:53 +00:00 |
|
Andrej Karpathy
|
067daa7758
|
small fix cpu script ty PR #474
|
2026-01-30 02:11:25 +00:00 |
|
Andrej Karpathy
|
c88bbf8133
|
Merge branch 'engram'
|
2026-01-27 22:33:16 +00:00 |
|
Andrej Karpathy
|
c8d93beed2
|
add engram-lite, add log, tune scaling laws analysis scripts
|
2026-01-27 22:31:17 +00:00 |
|
Andrej Karpathy
|
8630d32be4
|
quick fix to not OOM main speedrun script
|
2026-01-26 22:31:42 +00:00 |
|
Andrej Karpathy
|
63bb5831e2
|
something i've wanted to do for a while - move all .sh runs to their own directory so they don't pollute root dir
|
2026-01-18 15:27:41 +00:00 |
|