Artemis Git Integration
09f5420fab
feat: add auto-batch-size discovery to base_train, mid_train, and chat_sft with fallback defaults and manual override support
2025-11-05 16:50:27 +00:00
Artemis Git Integration
a8aad26041
feat(train): add batch sample functions for memory testing in auto-discovery
...
Add create_batch_sample_fn closures to base_train.py, mid_train.py, and chat_sft.py that generate realistic test batches matching training data formats for accurate memory
2025-11-05 16:48:55 +00:00
Artemis Git Integration
cba76ef8ef
feat(config): add auto batch size discovery with configurable parameters and CLI overrides
...
Replace hardcoded device_batch_size with auto_batch_size, batch_size_margin, batch_size_cache, and device_batch_size variables across training scripts
2025-11-05 16:47:32 +00:00
Andrej Karpathy
b8076dd367
fix bug in learning rate multiplier, it was ramping up instead of ramping down. see more in Issue #68 . also add --dry_run option useful for experimentation
2025-10-15 16:35:04 +00:00
karpathy
3a5e0bc50b
initial commit
2025-10-13 06:49:24 -07:00