nanochat/modal
Manmohan Sharma 544ab89c04
fix(serve): stop turn when model emits second output block after injection
Training data taught the model to echo another <|output_start|>…<|output_end|> after our injected real tool result. Detect that second sequence and break the turn; the grounded answer has already streamed to the client.
2026-04-22 14:44:56 -07:00
..
_model.py feat: deploy d24 SFT + polished UI redesign with dark mode (#39) 2026-04-16 19:55:16 -04:00
_tokenizer.py feat: deploy d24 SFT + polished UI redesign with dark mode (#39) 2026-04-16 19:55:16 -04:00
_tools.py fix(tools): enable Tavily include_answer and fix UI overflow 2026-04-22 14:20:47 -07:00
serve.py fix(serve): stop turn when model emits second output block after injection 2026-04-22 14:44:56 -07:00