Commits · 7d5d9e736d1363d7938abff2df0943208f9418ea · nexlab / aisbf

22 Mar, 2026 4 commits

Add condensation enhancements and server configuration · 7d5d9e73

Your Name authored Mar 22, 2026

- Add max_context field to CondensationConfig
- Support 'internal' keyword for local HuggingFace model in condensation
- Add internal model initialization with temperature=0.3, top_p=0.8, repeat_penalty=1.1
- Create condensation system prompts (conversational, semantic)
- Add aisbf.json for server configuration (host, port, dashboard auth)
- Update main.py to read server config from aisbf.json
- Update providers.json with max_context example for condensation

7d5d9e73

Improve autoselect prompt structure - split system/user messages, temperature 0, add stop token · 1ea22adb

Your Name authored Mar 22, 2026

- Changed autoselect prompt from single user message to system+user split
- System message contains the skill instructions
- User message contains the prompt and model list
- Set temperature to 0 for deterministic selection
- Added stop parameter: </aisbf_model_autoselection>
- Updated internal model handling to combine messages

1ea22adb

v0.4.0 - Configuration refactoring and autoselect enhancements · 8e4cdf2f

Your Name authored Mar 22, 2026

- Centralized API key storage in providers.json only
- Added support for provider-only rotation entries (auto-selects random model)
- Added default settings hierarchy at provider and rotation levels
- Limited autoselect selection context to 10 messages or 8000 tokens
- Added support for direct provider models in autoselect (rotation/provider/model)
- Added 'internal' keyword for local HuggingFace model selection
- Updated requirements.txt with torch and transformers

8e4cdf2f

feat: Complete kiro-gateway integration with full feature parity · b6bbf540

Your Name authored Mar 22, 2026

- Integrated complete kiro-gateway conversion pipeline (1,522 lines)
- Added multi-source authentication (Kiro IDE, kiro-cli, env vars)
- Implemented full OpenAI <-> Kiro format conversion
- Added support for tools/function calling
- Added support for images/multimodal content
- Implemented message merging, validation, and role normalization
- Added KiroAuthManager for automatic token refresh
- Created comprehensive conversion modules:
  - aisbf/kiro_converters.py (core conversion logic)
  - aisbf/kiro_converters_openai.py (OpenAI adapter)
  - aisbf/kiro_models.py (data models)
  - aisbf/kiro_auth.py (authentication)
  - aisbf/kiro_utils.py (utilities)
- Updated KiroProviderHandler with full conversion pipeline
- Added kiro_config support to ProviderConfig
- Updated providers.json with clean Kiro examples
- Added comprehensive documentation (KIRO_INTEGRATION.md)
- Implemented model name prefixing across all providers (provider_id/model)

No external kiro-gateway server needed - all functionality built-in.

b6bbf540

09 Feb, 2026 7 commits
- Fix Google tool calling - pass tools in config dict instead of as separate parameter · f8c57389
  Stefy Lanza (nextime / spora ) authored Feb 09, 2026
  
  f8c57389
- Fix Google tool calling - pass tools as separate parameter instead of in config · effd6447
  Stefy Lanza (nextime / spora ) authored Feb 09, 2026
  
  effd6447
- Fix UnboundLocalError in stream_generator - add json import · 221abb78
  Stefy Lanza (nextime / spora ) authored Feb 09, 2026
  
  221abb78
- Fix Google provider tool formatting - use genai_types alias for types module · 8628e8d3
  Stefy Lanza (nextime / spora ) authored Feb 09, 2026
  
  8628e8d3
- Fix Google provider tool formatting - import FunctionDeclaration from genai.types · e73e4a1e
  Stefy Lanza (nextime / spora ) authored Feb 09, 2026
  
  e73e4a1e
- Fix Google provider tool formatting - use genai.FunctionDeclaration objects instead of dicts · 58dd90df
  Stefy Lanza (nextime / spora ) authored Feb 09, 2026
  
  58dd90df
- Fix Google provider tool formatting - pass function declarations directly... · 36669a22
  Stefy Lanza (nextime / spora ) authored Feb 09, 2026
```
Fix Google provider tool formatting - pass function declarations directly instead of wrapping in dict
```
  36669a22
08 Feb, 2026 29 commits

Fix: Send tool_calls in Google streaming responses · 8d6b8de7

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

Changed line 511 to check list length instead of boolean evaluation.
This fixes the bug where tool calls extracted from Google chunks were not being
sent to clients because empty lists were being treated as falsy.

Before: 'tool_calls': delta_tool_calls if delta_tool_calls else None
After: 'tool_calls': delta_tool_calls if len(delta_tool_calls) > 0 else None

8d6b8de7

Fix tool call tracking - use set to track seen IDs instead of list slicing · fa75c0a6
Stefy Lanza (nextime / spora ) authored Feb 08, 2026

fa75c0a6
Add debug logging for Google chunk processing · e8f7a267
Stefy Lanza (nextime / spora ) authored Feb 08, 2026

e8f7a267

Fix Google streaming handler to properly extract and send tool calls · 0b4afcf6

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

- Added extraction of function_call from Google chunk parts
- Convert Google function_call to OpenAI-compatible tool_calls format
- Track accumulated_tool_calls to calculate deltas
- Include tool_calls in the delta when present
- Send chunks when there are new tool calls, not just text

0b4afcf6

Fix Google tools format - wrap function_declarations in list · d12485b6
Stefy Lanza (nextime / spora ) authored Feb 08, 2026

d12485b6

Fix syntax errors in providers.py · d4056de8

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

- Removed stray 'd' identifier on line 494
- Fixed indentation issues on lines 494 and 537
- File now compiles without errors

d4056de8

Fix Google tool conversion - properly format function_declarations array · 796a4fd8

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

- Fixed tool conversion to create single function_declarations array
- This prevents Google from detecting malformed tool use
- Tools are now properly converted from OpenAI to Google format

796a4fd8

Fix duplicate error message by skipping first line of error_details · 92ae0497
Stefy Lanza (nextime / spora ) authored Feb 08, 2026

92ae0497
Always return formatted error responses for rotation providers with appropriate status codes · 378e4127
Stefy Lanza (nextime / spora ) authored Feb 08, 2026

378e4127
Change error HTTP status code from 503 to 429 when notifyerrors is false · ebddd59f
Stefy Lanza (nextime / spora ) authored Feb 08, 2026

ebddd59f
Adjust error message formatting - remove duplicate line and improve spacing · 4e2f7315
Stefy Lanza (nextime / spora ) authored Feb 08, 2026

4e2f7315
Enhance error message formatting with bold text and JSON pretty printing · b636940a
Stefy Lanza (nextime / spora ) authored Feb 08, 2026

b636940a
Improve error message formatting by replacing semicolon separators with... · 689d5b4b
Stefy Lanza (nextime / spora ) authored Feb 08, 2026
```
Improve error message formatting by replacing semicolon separators with newlines for better readability
```
689d5b4b
Fix notifyerrors and streaming error response · b4b01a13
Stefy Lanza (nextime / spora ) authored Feb 08, 2026

b4b01a13
Fix err 500 · 9a91c635
Stefy Lanza (nextime / spora ) authored Feb 08, 2026

9a91c635
Fix error 500 · c52a65d7
Stefy Lanza (nextime / spora ) authored Feb 08, 2026

c52a65d7
notifyerrors fix · 32380465
Stefy Lanza (nextime / spora ) authored Feb 08, 2026

32380465

fix: Handle assistant wrapper pattern in streaming responses · b3b44f6f

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

- Detect and unwrap responses wrapped in 'assistant: [{"type": "text", "text": "..."}]' format
- Use extracted text for response content instead of raw accumulated text
- Fix variable scoping issue with tool_match variable
- Update token counting to use final_text when available

b3b44f6f

fix: Stream chunks normally, only add tool call chunk at end · b307c7fb

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

Instead of collecting all chunks and sending a modified response:
- Stream chunks normally as they come (with deltas like before)
- Only at the END, if tool call pattern detected, send additional chunk with tool_calls
- Then send final chunk with usage statistics

This preserves the original streaming behavior while adding tool call detection.

b307c7fb

fix: Extract text from assistant wrapper when no tool call present · 72c50449

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

When the model returns a response in the format:
assistant: [{'type': 'text', 'text': '...'}]
but without a tool call, extract just the text content
instead of returning the raw wrapper format.

72c50449

fix: Decode unicode escape sequences in tool JSON · a28caa59

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

The model returns literal \n (backslash-n) instead of actual newlines.
This breaks JSON parsing because {\n is not valid JSON syntax.
Use codecs.decode with 'unicode_escape' to convert escape sequences
to actual characters before parsing.

a28caa59

debug: Add detailed logging for tool call parsing in streaming · 47ce8dfe

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

- Log accumulated response text (first 500 and last 200 chars)
- Log extracted tool JSON with length and byte details
- Log ASCII codes for first 20 chars to detect encoding issues
- Log JSON parse errors with position details
- Log success/failure of JSON parsing attempts

47ce8dfe

fix: Simplify streaming tool call parsing with robust JSON extraction · 01b8c0b8

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

- Use brace counting for robust JSON extraction
- Try JSON first, then fix common issues (single quotes, trailing commas)
- Extract final assistant text using regex after tool JSON
- Remove complex nested parsing that was failing with escaped quotes

01b8c0b8

fix: Use ast.literal_eval for Python-style single quotes in tool calls · b13b20c9

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

- Models may return single quotes instead of double quotes
- Fall back to ast.literal_eval when JSON parsing fails
- Handle both JSON and Python-style literals in streaming responses

b13b20c9

feat: Add tool call detection in streaming responses · 3507a642

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

- Detect tool calls in accumulated streaming text after all chunks received
- Parse nested 'assistant: [...]' format with tool calls inside
- Parse simple 'tool: {...}' format
- Convert detected tool calls to OpenAI-compatible format
- Send tool_calls in first chunk, then final assistant text
- Proper handling of finish_reason in final chunk

3507a642

feat: Handle nested assistant/tool format in text responses · 847353e3

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

- Detect when entire response is wrapped in 'assistant: [...]'
- Parse nested 'tool: {...}' inside the assistant text
- Extract final assistant text from nested structure
- Handle multi-line JSON content with proper brace counting
- More robust parsing for complex nested formats

847353e3

feat: Add parsing for 'content/assistant' text format · 36773e04

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

- Detect '"content": "..." } assistant: [...]' pattern
- Extract tool content and convert to write action
- Extract assistant text from JSON array
- Handle multi-line content with newlines
- More robust tool call detection for various text formats

36773e04

feat: Add tool call parsing for 'tool: {...}' text format · b2aca709

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

- Detect 'tool: {...}' pattern in Google model text responses
- Parse and convert to OpenAI-compatible tool_calls format
- Extract assistant text from 'assistant: [...]' format if present
- Handle both 'action' and 'name' fields for tool identification
- Convert arguments to JSON string for OpenAI compatibility

This fixes issues where models return tool calls as text instead of
using proper function_call attributes.

b2aca709

fix: Revert Google streaming to yield raw chunk objects · e1e0092d

Stefy Lanza (nextime / spora ) authored Feb 08, 2026

- Google provider now yields raw chunk objects instead of pre-formatted SSE bytes
- The handlers.py handles the conversion to OpenAI-compatible format
- This fixes the issue where clients weren't receiving streaming responses

Note: Server must be restarted to pick up this change

e1e0092d