Merge origin/master into muonh

Resolved conflicts: - nanochat/fp8.py: Kept _Float8MatmulND class from muonh - scripts/base_train.py: Kept dual lrm logging from muonh
2026-06-16 02:59:10 +00:00 · 2026-02-12 21:30:17 -08:00 · 2026-02-12 21:30:17 -08:00 · 330fa1188c
commit 330fa1188c
parent 25ec1e6c43 2f09686724
1 changed files with 1 additions and 0 deletions
--- a/nanochat/fp8.py
+++ b/nanochat/fp8.py
@ -271,6 +271,7 @@ class _Float8MatmulND(torch.autograd.Function):
        return grad_input, grad_weight


+
 class Float8Linear(nn.Linear):
    """Drop-in nn.Linear replacement that does FP8 compute.