Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Contribute to GitLab
Sign in
Toggle navigation
A
airtanscropt
Project
Project
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
SexHackMe
airtanscropt
Commits
45156b52
Commit
45156b52
authored
Dec 11, 2025
by
Stefy Lanza (nextime / spora )
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Add README with setup and usage instructions
parent
633bba14
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
51 additions
and
0 deletions
+51
-0
README.md
README.md
+51
-0
No files found.
README.md
0 → 100644
View file @
45156b52
# Audio Transcription App
This Python application transcribes audio files with speaker diarization and timestamps using Qwen-Omni-7B model.
## Features
-
Automatic speech recognition with Qwen-Omni-7B (4-bit quantized)
-
Speaker diarization using pyannote.audio
-
Timestamps for each utterance
-
Output in TXT format with same name as input audio
## Requirements
-
Python 3.8+
-
24GB VRAM GPU (for Qwen-Omni-7B quantized)
-
Hugging Face account with access to pyannote models
## Setup
1.
Clone or download the repository
2.
Create virtual environment:
```
bash
python3
-m
venv venv
```
3. Activate venv:
```
bash
source
venv/bin/activate
```
4. Install dependencies:
```
bash
pip
install
-r
requirements.txt
```
5. Set Hugging Face token:
```
bash
export
HF_TOKEN
=
your_huggingface_token
```
## Usage
```
bash
python transcript.py path/to/audio.wav
```
Output: `path/to/audio.txt`
## Output Format
```
[
00:00:00.00 - 00:00:05.00
]
SPEAKER_00: Transcribed text here.
[
00:00:05.00 - 00:00:10.00
]
SPEAKER_01: More transcribed text.
```
## Notes
-
Supports common audio formats (wav, mp3, etc.)
-
Requires internet for model downloads on first run
-
Processing time depends on audio length and hardware
\ No newline at end of file
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment