Fix force-reasoning bugs: duplicate tools, reasoning duplication, tool extraction

- Bug 1: Skip format_tools_for_prompt in raw mode (already had condition) - Bug 2: Use final_text (after reasoning) instead of generated_text for formatter - Bug 3: Pass final_text to ModelParserAdapter instead of generated_text This prevents reasoning from appearing in both content AND reasoning fields, and allows the tool parser to properly extract tool calls without being confused by reasoning tags.

Fix force-reasoning bugs: duplicate tools, reasoning duplication, tool extraction
- Bug 1: Skip format_tools_for_prompt in raw mode (already had condition) - Bug 2: Use final_text (after reasoning) instead of generated_text for formatter - Bug 3: Pass final_text to ModelParserAdapter instead of generated_text This prevents reasoning from appearing in both content AND reasoning fields, and allows the tool parser to properly extract tool calls without being confused by reasoning tags.
bcd9bc55 · Your Name · 017c0399 · bcd9bc55
Commit bcd9bc55 authored Mar 17, 2026 by Your Name
Show whitespace changes
Inline Side-by-side

Showing with 8 additions and 5 deletions

coderai coderai +8 -5

No files found.
--- a/coderai
+++ b/coderai
@@ -2429,18 +2429,21 @@ async def chat_completions(request: ChatCompletionRequest, http_request: Request
                    print(f"DEBUG: Error converting tool in raw mode: {e}, tool type: {type(t)}")
                    continue
-        # Step 1: Use ModelParserAdapter to extract tool calls from generated text
+        # Step 1: Use ModelParserAdapter to extract tool calls from final_text (NOT generated_text which includes reasoning)
+        # This fixes Bug 2 and Bug 3: reasoning was appearing in both content AND reasoning fields
+        # because the parser was receiving the full generated_text including reasoning
        extracted_tool_calls = None
-        clean_text = generated_text
+        clean_text = final_text  # Use final_text (after reasoning) instead of generated_text (which includes reasoning)
        if tools_list:
            adapter = ModelParserAdapter(model_name=response_model_name)
-            extracted_tool_calls = adapter.extract_tool_calls(generated_text, tools_list)
+            # Extract tool calls from final_text only (after reasoning is done)
+            extracted_tool_calls = adapter.extract_tool_calls(final_text, tools_list)
            if extracted_tool_calls:
                # Strip tool calls from the text
-                clean_text = adapter.strip_tool_calls_from_content(generated_text)
+                clean_text = adapter.strip_tool_calls_from_content(final_text)
                if global_debug:
-                    print(f"RAW: Extracted {len(extracted_tool_calls)} tool calls from generated text")
+                    print(f"RAW: Extracted {len(extracted_tool_calls)} tool calls from final_text (after reasoning)")
        # Estimate token counts
        prompt_tokens = len(raw_prompt_for_generation.split())