mirror of
https://github.com/karpathy/nanochat.git
synced 2026-04-02 13:45:21 +00:00
Add architecture overview section to README and include architecture diagram
This commit is contained in:
parent
fa6521f7d0
commit
b86d86df17
10
README.md
10
README.md
|
|
@ -4,6 +4,16 @@
|
|||
|
||||
> The best ChatGPT that $100 can buy.
|
||||
|
||||
---
|
||||
|
||||
### Architecture Overview
|
||||
|
||||
Here’s an overview of the nanochat architecture:
|
||||
|
||||

|
||||
|
||||
---
|
||||
|
||||
This repo is a full-stack implementation of an LLM like ChatGPT in a single, clean, minimal, hackable, dependency-lite codebase. nanochat is designed to run on a single 8XH100 node via scripts like [speedrun.sh](speedrun.sh), that run the entire pipeline start to end. This includes tokenization, pretraining, finetuning, evaluation, inference, and web serving over a simple UI so that you can talk to your own LLM just like ChatGPT. nanochat will become the capstone project of the course LLM101n being developed by Eureka Labs.
|
||||
|
||||
## Quick start
|
||||
|
|
|
|||
BIN
dev/nanochat_architecture.png
Normal file
BIN
dev/nanochat_architecture.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 374 KiB |
Loading…
Reference in New Issue
Block a user