From 03398ce70a560def4b8d46e5a2240f7cd350303d Mon Sep 17 00:00:00 2001
From: Hossein-Lakzaei <hosseinlack123@gmail.com>
Date: Wed, 15 Oct 2025 00:46:25 +0330
Subject: [PATCH] Enhance README by consolidating LLM implementation
 description and removing redundancy

---
 README.md | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/README.md b/README.md
index 39da38c..70c307d 100644
--- a/README.md
+++ b/README.md
@@ -4,6 +4,9 @@
 
 > The best ChatGPT that $100 can buy.
 
+
+This repo is a full-stack implementation of an LLM like ChatGPT in a single, clean, minimal, hackable, dependency-lite codebase. nanochat is designed to run on a single 8XH100 node via scripts like [speedrun.sh](speedrun.sh), that run the entire pipeline start to end. This includes tokenization, pretraining, finetuning, evaluation, inference, and web serving over a simple UI so that you can talk to your own LLM just like ChatGPT. nanochat will become the capstone project of the course LLM101n being developed by Eureka Labs.
+
 ---
 
 ### Architecture Overview
@@ -14,8 +17,6 @@ Here’s an overview of the nanochat architecture:
 
 ---
 
-This repo is a full-stack implementation of an LLM like ChatGPT in a single, clean, minimal, hackable, dependency-lite codebase. nanochat is designed to run on a single 8XH100 node via scripts like [speedrun.sh](speedrun.sh), that run the entire pipeline start to end. This includes tokenization, pretraining, finetuning, evaluation, inference, and web serving over a simple UI so that you can talk to your own LLM just like ChatGPT. nanochat will become the capstone project of the course LLM101n being developed by Eureka Labs.
-
 ## Quick start
 
 The fastest way to feel the magic is to run the speedrun script [speedrun.sh](speedrun.sh), which trains and inferences the $100 tier of nanochat. On an 8XH100 node at $24/hr, this gives a total run time of about 4 hours. Boot up a new 8XH100 GPU box from your favorite provider (e.g. I use and like [Lambda](https://lambda.ai/service/gpu-cloud)), and kick off the training script: