Commit Graph

4 Commits

Author SHA1 Message Date
Artemis Git Integration
5cd79225c4 feat(train): enable torch.compile for chat_sft with fixed shapes for 30-50% speedup 2025-11-05 16:07:54 +00:00
Artemis Git Integration
d8be015b20 feat(chat_sft): add fixed-length padding for torch.compile compatibility
Replace variable-length padding with fixed 2048-token padding to create constant batch shapes, enabling efficient torch.compile in subsequent training steps
2025-11-05 16:04:26 +00:00
Andrej Karpathy
190d9515d0 dont evaluate the sampling evals during SFT they are too slow. keep the multiple choice evals. delete unused imports 2025-10-15 16:42:23 +00:00
karpathy
3a5e0bc50b initial commit 2025-10-13 06:49:24 -07:00