feat: Add multi-model support for audio transcription and image generation (1cdfe825) · Commits · nexlab / coderai

Commit 1cdfe825 authored Mar 08, 2026 by

Stefy Lanza (nextime / spora )

feat: Add multi-model support for audio transcription and image generation

- Add --audio-model and --image-model CLI arguments
- Add --loadall, --audio-ctx, --audio-offload, --vision-ctx, --vision-offload args
- Implement MultiModelManager class for dynamic model switching
- Add POST /v1/audio/transcriptions endpoint (OpenAI-compatible)
- Add POST /v1/images/generations endpoint (OpenAI-compatible)
- Update endpoints to use multi_model_manager for model selection
- Audio uses faster-whisper for local transcription
- Images use Stable Diffusion via diffusers

parent eb6b8d85

Expand all Hide whitespace changes

Inline Side-by-side

View file @ 1cdfe825

This diff is collapsed.

Please register or to comment