This refactoring enables the nanochat project to be executed as a scalable and robust pipeline on Vertex AI.
The monolithic `speedrun.sh` script has been decomposed into a series of containerized components orchestrated by a Kubeflow pipeline.
The codebase has been updated to use Google Cloud Storage for artifact management, allowing for seamless data sharing between pipeline steps.
A `Dockerfile` and Python wrappers for each pipeline step have been added to the `vertex_pipelines` directory.
The `README.md` has been updated with instructions on how to build the Docker image and run the Vertex AI pipeline.