nanochat

mirror of https://github.com/karpathy/nanochat.git synced 2026-03-31 09:05:14 +00:00

Author	SHA1	Message	Date
google-labs-jules[bot]	2781d216c6	feat: Refactor nanochat to run on Vertex AI Pipelines This refactoring enables the nanochat project to be executed as a scalable and robust pipeline on Vertex AI. The monolithic `speedrun.sh` script has been decomposed into a series of containerized components orchestrated by a Kubeflow pipeline. The codebase has been updated to use Google Cloud Storage for artifact management, allowing for seamless data sharing between pipeline steps. A `Dockerfile` and Python wrappers for each pipeline step have been added to the `vertex_pipelines` directory. The `README.md` has been updated with instructions on how to build the Docker image and run the Vertex AI pipeline.	2025-11-04 01:26:51 +00:00
Luke Stanley	7a52f9bfbb	Updates lockfile with CPU package support without overwriting other architectures	2025-10-21 23:14:34 +00:00
karpathy	bb786c5560	i shouldnt have committed the lock file, i missed that. revert to the flagship build which is linux. sorry to pollute the repo history...	2025-10-21 10:07:40 -07:00
karpathy	2e9669e03a	upgrading all other files to be able to use cpu/mps as well as cuda. various minor other changes ,e.g. changing max_iterations to num_iterations in sft script for consistency in naming	2025-10-20 10:15:17 -07:00
karpathy	306bc380ab	add support for CPU and for MPS. I had to change a few cosmetic things. I also discovered I think a bit of a bug, where I was casting wte to bfloat16 in the wrong place (the model init) instead of in init_weights	2025-10-16 10:04:43 -07:00
karpathy	3a5e0bc50b	initial commit	2025-10-13 06:49:24 -07:00

6 Commits