• Stefy Lanza (nextime / spora )'s avatar
    feat: Add multi-model support for audio transcription and image generation · 1cdfe825
    Stefy Lanza (nextime / spora ) authored
    - Add --audio-model and --image-model CLI arguments
    - Add --loadall, --audio-ctx, --audio-offload, --vision-ctx, --vision-offload args
    - Implement MultiModelManager class for dynamic model switching
    - Add POST /v1/audio/transcriptions endpoint (OpenAI-compatible)
    - Add POST /v1/images/generations endpoint (OpenAI-compatible)
    - Update endpoints to use multi_model_manager for model selection
    - Audio uses faster-whisper for local transcription
    - Images use Stable Diffusion via diffusers
    1cdfe825
Name
Last commit
Last update
.vscode Loading commit data...
.gitignore Loading commit data...
LICENSE.md Loading commit data...
README.md Loading commit data...
build.sh Loading commit data...
coder Loading commit data...
coderai Loading commit data...
requirements-nvidia.txt Loading commit data...
requirements-vulkan.txt Loading commit data...
requirements.txt Loading commit data...
requirements.txt~ Loading commit data...