This Python application transcribes audio files with speaker diarization and timestamps using Whisper and Resemblyzer models.
This Python application transcribes audio files with speaker diarization and timestamps using Qwen2.5-Omni-7B and Resemblyzer models by default. Use --whisper to use Whisper instead.
## Features
- Automatic speech recognition with Qwen-Omni-7B (4-bit quantized)