Commit Graph

2 Commits

Author SHA1 Message Date
Amrit Bulusu
5c92dd02cb changing mu_transfer check to float32 and updating .md 2026-03-16 21:03:12 -04:00
Amrit Bulusu
641e8a6dd3 muP implementation: coord check, transfer check, and code quality fixes
- Fix output logit hook in coord check to apply muP scaling (base/width)
- Replace config mutation side effect with assertion in setup_optimizer
- Set mup_base_width at GPTConfig construction in base_train.py
- Remove dead code (_transfer_check_output_mult)
- Tune base LRs to center optimal multiplier near 1.0 (0.12, 6.0, 0.12)
- Use log scale on all loss plots for better low-loss detail
- Add automated muP tests (coord check + transfer check)
- Update muP_changes.md verification commands

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-14 16:28:50 -04:00