From 8ff4dae67c38c2ca726fe8ca83362cbe15a7e48c Mon Sep 17 00:00:00 2001 From: CharlesCNorton <135471798+CharlesCNorton@users.noreply.github.com> Date: Tue, 14 Oct 2025 07:47:02 -0400 Subject: [PATCH] correct subject-verb agreement in README.md Change "the scripts automatically compensates" to "the scripts automatically compensate" to match plural subject with plural verb form. --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index bc01055..549d5f9 100644 --- a/README.md +++ b/README.md @@ -80,7 +80,7 @@ torchrun --standalone --nproc_per_node=8 -m scripts.base_train -- --depth=26 --d torchrun --standalone --nproc_per_node=8 -m scripts.mid_train -- --device_batch_size=16 ``` -That's it! The biggest thing to pay attention to is making sure you have enough data shards to train on (the code will loop and do more epochs over the same training set otherwise, decreasing learning speed a bit), and managing your memory/VRAM, primarily by decreasing the `device_batch_size` until things fit (the scripts automatically compensates by increasing the number of gradient accumulation loops, simply turning parallel compute to sequential compute). +That's it! The biggest thing to pay attention to is making sure you have enough data shards to train on (the code will loop and do more epochs over the same training set otherwise, decreasing learning speed a bit), and managing your memory/VRAM, primarily by decreasing the `device_batch_size` until things fit (the scripts automatically compensate by increasing the number of gradient accumulation loops, simply turning parallel compute to sequential compute). And a bit more about computing environments that will run nanochat: