Commit Graph

3 Commits

Author SHA1 Message Date
google-labs-jules[bot]
a88e7ec21f fix: Correct Docker build for rustbpe tokenizer
This commit fixes a build failure in the Docker image by implementing a more robust build process for the `rustbpe` tokenizer.

The `Dockerfile` now explicitly creates a `uv` virtual environment, adds its `bin` directory to the `PATH`, installs `maturin` into the environment, and then runs the `maturin develop` command. This ensures that the build command executes within a fully configured environment with all necessary tools available on the `PATH`, resolving the "No such file or directory" error.
2025-11-04 02:24:08 +00:00
google-labs-jules[bot]
fa04262889 fix: Correct Docker build for rustbpe tokenizer
This commit fixes a build failure in the Docker image by adding the `--uv` flag to the `maturin develop` command.

The `maturin` build process was failing because it could not find `pip` within the `uv` environment. The `--uv` flag ensures that `maturin` correctly uses the `uv` environment to build the `rustbpe` tokenizer.
2025-11-04 02:05:34 +00:00
google-labs-jules[bot]
2781d216c6 feat: Refactor nanochat to run on Vertex AI Pipelines
This refactoring enables the nanochat project to be executed as a scalable and robust pipeline on Vertex AI.

The monolithic `speedrun.sh` script has been decomposed into a series of containerized components orchestrated by a Kubeflow pipeline.

The codebase has been updated to use Google Cloud Storage for artifact management, allowing for seamless data sharing between pipeline steps.

A `Dockerfile` and Python wrappers for each pipeline step have been added to the `vertex_pipelines` directory.

The `README.md` has been updated with instructions on how to build the Docker image and run the Vertex AI pipeline.
2025-11-04 01:26:51 +00:00