nanochat/services/inference/src/engine
2026-04-16 11:19:18 -07:00
..
__init__.py extract standalone inference service 2026-04-16 11:19:18 -07:00
checkpoint_manager.py extract standalone inference service 2026-04-16 11:19:18 -07:00
common.py extract standalone inference service 2026-04-16 11:19:18 -07:00
engine.py extract standalone inference service 2026-04-16 11:19:18 -07:00
execution.py extract standalone inference service 2026-04-16 11:19:18 -07:00
flash_attention.py extract standalone inference service 2026-04-16 11:19:18 -07:00
fp8.py extract standalone inference service 2026-04-16 11:19:18 -07:00
gpt.py extract standalone inference service 2026-04-16 11:19:18 -07:00
optim.py extract standalone inference service 2026-04-16 11:19:18 -07:00
tokenizer.py extract standalone inference service 2026-04-16 11:19:18 -07:00
tools.py extract standalone inference service 2026-04-16 11:19:18 -07:00