-
Your Name authored
- Added 'raw' to valid force-reasoning options (chat, stop, inject, prompt, twopass, mock, raw) - Implemented raw mode handler that: - Uses tokenizer.apply_chat_template() with add_generation_prompt=True - Seeds reasoning tag + commitment sentence - Uses two-pass generation: first captures reasoning, then gets final answer - Supports both streaming and non-streaming responses - Falls back gracefully if tokenizer not available This enables using the model's native tokenizer for prompt seeding, bypassing double-templating issues with chat APIs.
ceb4ae88
| Name |
Last commit
|
Last update |
|---|---|---|
| .vscode | ||
| codai | ||
| .gitignore | ||
| LICENSE.md | ||
| README.md | ||
| build.sh | ||
| coder | ||
| coderai | ||
| requirements-nvidia.txt | ||
| requirements-vulkan.txt | ||
| requirements.txt |