feat: Complete --force-reasoning implementation with raw mode tool extraction

- Add force_reasoning_prompt function with Big 10 family prefixes - Add inject_system and force_reasoning parameters - Update --force-reasoning CLI with comma-separated options - Add --dump option to show raw output, parsed output, and litellm debug - Fix stop tokens to include ]]> when prompt is selected - Add mock strategy for fake reasoning stats - Chain --system-prompt at start of existing system message - Add 'raw' option to --force-reasoning - Fix format_tools_for_prompt to skip in raw mode - Pass tools to format_for_raw_completion in raw mode - Add parse_and_format method to OpenAIFormatter for tool extraction - Use parse_and_format in raw mode for correct tool extraction pipeline Pipeline: Model output -> Extract reasoning (raw mode) -> ModelParserAdapter (extract tools) -> OpenAIFormatter (final format)

feat: Complete --force-reasoning implementation with raw mode tool extraction
- Add force_reasoning_prompt function with Big 10 family prefixes - Add inject_system and force_reasoning parameters - Update --force-reasoning CLI with comma-separated options - Add --dump option to show raw output, parsed output, and litellm debug - Fix stop tokens to include ]]> when prompt is selected - Add mock strategy for fake reasoning stats - Chain --system-prompt at start of existing system message - Add 'raw' option to --force-reasoning - Fix format_tools_for_prompt to skip in raw mode - Pass tools to format_for_raw_completion in raw mode - Add parse_and_format method to OpenAIFormatter for tool extraction - Use parse_and_format in raw mode for correct tool extraction pipeline Pipeline: Model output -> Extract reasoning (raw mode) -> ModelParserAdapter (extract tools) -> OpenAIFormatter (final format)
207ac71f · Your Name · 329a0042 · 207ac71f
Commit 207ac71f authored Mar 17, 2026 by Your Name
Hide whitespace changes
Inline Side-by-side

Showing with 48 additions and 8 deletions

coderai coderai +48 -8

No files found.
--- a/coderai
+++ b/coderai
@@ -2392,20 +2392,60 @@ async def chat_completions(request: ChatCompletionRequest, http_request: Request
            print(f"RAW: Generated text after cleanup: {generated_text[:100]}...")
        
        # Pass through the formatter/parser (same as regular mode)
-        # This handles tool extraction and OpenAI compatibility
-        from codai.models.parser import OpenAIFormatter
-        formatter = OpenAIFormatter(response_model_name)
+        # Pipeline: Model output -> Extract reasoning (if raw mode) -> ModelParserAdapter (extract tools) -> OpenAIFormatter (final format)
+        from codai.models.parser import OpenAIFormatter, ModelParserAdapter
+        
+        # Convert request tools for ModelParserAdapter
+        tools_list = None
+        if request.tools:
+            from codai.pydantic.textrequest import Tool, ToolFunction
+            tools_list = []
+            for t in request.tools:
+                try:
+                    # Handle both dict and pydantic model formats
+                    if isinstance(t, dict):
+                        func_data = t.get("function", {})
+                        tool_func = ToolFunction(
+                            name=func_data.get("name", ""),
+                            description=func_data.get("description"),
+                            parameters=func_data.get("parameters")
+                        )
+                    else:
+                        # Pydantic model
+                        tool_func = ToolFunction(
+                            name=t.function.name if hasattr(t.function, 'name') else str(t.function),
+                            description=t.function.description if hasattr(t.function, 'description') else None,
+                            parameters=t.function.parameters if hasattr(t.function, 'parameters') else None
+                        )
+                    tools_list.append(Tool(type=t.get("type", "function") if isinstance(t, dict) else t.type, function=tool_func))
+                except Exception as e:
+                    print(f"DEBUG: Error converting tool in raw mode: {e}, tool type: {type(t)}")
+                    continue
+        
+        # Step 1: Use ModelParserAdapter to extract tool calls from generated text
+        extracted_tool_calls = None
+        clean_text = generated_text
+        if tools_list:
+            adapter = ModelParserAdapter(model_name=response_model_name)
+            extracted_tool_calls = adapter.extract_tool_calls(generated_text, tools_list)
+            
+            if extracted_tool_calls:
+                # Strip tool calls from the text
+                clean_text = adapter.strip_tool_calls_from_content(generated_text)
+                if global_debug:
+                    print(f"RAW: Extracted {len(extracted_tool_calls)} tool calls from generated text")
        
        # Estimate token counts
        prompt_tokens = len(raw_prompt_for_generation.split())
-        completion_tokens = len(generated_text.split()) if generated_text else 0
+        completion_tokens = len(clean_text.split()) if clean_text else 0
        
-        # Format through the parser
-        formatted_response = formatter.format_litellm_full(
-            text=generated_text,
+        # Step 2: Use OpenAIFormatter for final formatting
+        formatter = OpenAIFormatter(response_model_name)
+        formatted_response = formatter.format_full(
+            text=clean_text,
            prompt_tokens=prompt_tokens,
            completion_tokens=completion_tokens,
-            tool_calls=None  # Raw mode doesn't have tool calls from generation
+            tool_calls=extracted_tool_calls
        )
        
        if global_debug: