-
Your Name authored
- loadall: pre-load image models into VRAM at startup (with OOM fallback) - loadswap: pre-load image models into CPU RAM at startup (first model stays in VRAM) - Audio and TTS models are cached at startup, loaded into memory on first request (they use specialized loading mechanisms via faster-whisper and kokoro)
bc2b1388
| Name |
Last commit
|
Last update |
|---|---|---|
| .vscode | ||
| codai | ||
| .gitignore | ||
| LICENSE.md | ||
| README.md | ||
| build.sh | ||
| coder | ||
| coderai | ||
| requirements-nvidia.txt | ||
| requirements-vulkan.txt | ||
| requirements.txt |