nanochat

tacit/nanochat

Fork 0

mirror of https://github.com/karpathy/nanochat.git synced 2026-01-09 21:22:41 +00:00

Commit Graph

Author	SHA1	Message	Date
google-labs-jules[bot]	08c628cb83	feat: Add ROCm and device-agnostic support This change adds support for ROCm and makes the codebase device-agnostic, allowing it to run on different hardware backends including ROCm, CUDA, and CPU. The key changes are: - Modified `pyproject.toml` to use ROCm-compatible PyTorch wheels and added the `pytorch-triton-rocm` dependency. - Refactored `nanochat/common.py` to dynamically detect the available hardware and set the device and distributed backend accordingly. - Updated all training, evaluation, and inference scripts to be device-agnostic, removing hardcoded CUDA references. - Adapted `speedrun.sh` for single-device execution by replacing `torchrun` with `python`. - Updated `nanochat/report.py` to provide more generic GPU information.	2025-10-14 05:07:30 +00:00
karpathy	3a5e0bc50b	initial commit	2025-10-13 06:49:24 -07:00

Author

SHA1

Message

Date

google-labs-jules[bot]

08c628cb83

feat: Add ROCm and device-agnostic support

This change adds support for ROCm and makes the codebase device-agnostic, allowing it to run on different hardware backends including ROCm, CUDA, and CPU.

The key changes are:
- Modified `pyproject.toml` to use ROCm-compatible PyTorch wheels and added the `pytorch-triton-rocm` dependency.
- Refactored `nanochat/common.py` to dynamically detect the available hardware and set the device and distributed backend accordingly.
- Updated all training, evaluation, and inference scripts to be device-agnostic, removing hardcoded CUDA references.
- Adapted `speedrun.sh` for single-device execution by replacing `torchrun` with `python`.
- Updated `nanochat/report.py` to provide more generic GPU information.

2025-10-14 05:07:30 +00:00

karpathy

3a5e0bc50b

initial commit

2025-10-13 06:49:24 -07:00

2 Commits