feat: Add multi-model support for audio transcription and image generation
- Add --audio-model and --image-model CLI arguments - Add --loadall, --audio-ctx, --audio-offload, --vision-ctx, --vision-offload args - Implement MultiModelManager class for dynamic model switching - Add POST /v1/audio/transcriptions endpoint (OpenAI-compatible) - Add POST /v1/images/generations endpoint (OpenAI-compatible) - Update endpoints to use multi_model_manager for model selection - Audio uses faster-whisper for local transcription - Images use Stable Diffusion via diffusers
Showing
This diff is collapsed.
Please
register
or
sign in
to comment