Commits · 552f2d10b9bd20b0697171dab42dd407dd006232 · SexHackMe / vidai

05 Oct, 2025 13 commits

Fix circular dependency in database configuration · 552f2d10

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Move db_type handling to environment variables to avoid circular dependency
- get_db_config() now uses environment variables for database type
- get_config() handles db_type specially to prevent recursion
- Fixes ImportError when running vidai.py

552f2d10

Add vidai.sh launcher script for Linux development · 3dfa46c4

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Create executable launcher script for non-PyInstaller builds
- Automatically detects and uses appropriate virtual environments
- Launches backend, web interface, and workers with correct CUDA/ROCm environments
- Reads configuration to determine backend selection
- Provides process management and cleanup on exit

3dfa46c4

Update build and setup scripts to support both CUDA and ROCm backends · 92ab2baa

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Setup script now creates both venv-cuda and venv-rocm by default
- Build script builds both CUDA and ROCm worker versions
- Start script reads configuration to launch appropriate backend workers
- Added pip3 detection and --break-system-packages support
- Windows batch files updated with pip3 detection
- Runtime can switch between CUDA/ROCm implementations based on config

92ab2baa

Fix PyTorch version specification in requirements files · a9ec4a6f

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Remove invalid local version labels from torch requirements
- Use proper pip syntax: torch>=2.0.0 with --index-url for CUDA/ROCm
- Fix pip installation errors for CUDA and ROCm builds

a9ec4a6f

Add configurable TCP ports and hosts for web interface and inter-process communication · 97b7c6fd

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Add network configuration settings: web_host, web_port, backend_host, backend_web_port, backend_worker_port
- Update web.py to use configurable host and port for Flask app
- Update backend.py to use configurable ports for socket servers
- Update worker processes to use configurable backend worker port
- Add network configuration section to admin panel
- Update command line arguments to accept network settings
- Maintain backward compatibility with default values

97b7c6fd

Add MySQL database support and complete multi-process Video AI implementation · 53c252b3

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Add MySQL support as alternative to SQLite with full configuration
- Implement database abstraction layer supporting both SQLite and MySQL
- Add database configuration section to admin panel
- Update requirements files with PyMySQL dependency
- Complete multi-process architecture with web, backend, and worker processes
- Add user registration system with email confirmation
- Implement token-based usage system with payment processing
- Add comprehensive documentation and licensing
- Create professional SaaS-style web interface
- Implement REST API for programmatic access
- Add email notification system and payment processors

53c252b3

Add complete RunPod.io integration for dynamic GPU pod management · 2c485eee

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

🎯 RunPod.io Cloud GPU Integration
• Dynamic pod creation and lifecycle management
• On-demand GPU scaling without local hardware costs
• Seamless integration with existing multi-process architecture

🏗️ Core Components Added:
• Dockerfile.runpod - Optimized GPU pod image for RunPod
• create_pod.sh - Automated build and deployment script
• vidai/runpod.py - Complete RunPod API integration module
• Enhanced backend with pod spawning capabilities
• Web interface RunPod configuration section

🔧 Key Features:
• Automatic pod creation for analysis jobs
• Cost optimization with idle pod cleanup (30min timeout)
• Multiple GPU type support (RTX 3090, A4000, A5000, 4090)
• Secure API key management and pod isolation
• Fallback to local processing when pods unavailable

📊 Architecture Enhancements:
• Pod lifecycle: Create → Start → Run → Process → Terminate
• Intelligent routing between local workers and cloud pods
• Real-time pod health monitoring and status tracking
• Persistent pod state management with cache files

🛡️ Production Features:
• Comprehensive error handling and recovery
• Detailed logging and monitoring capabilities
• Security-hardened pod environments
• Resource limits and cost controls

📚 Documentation:
• docs/runpod-integration.md - Complete integration guide
• Updated README.md with RunPod setup instructions
• test_runpod.py - Integration testing and validation
• Inline code documentation and examples

🚀 Benefits:
• Zero idle GPU costs - pay only for actual processing
• Access to latest GPU hardware without maintenance
• Unlimited scaling potential for high-throughput workloads
• Global pod distribution for low-latency processing

This implementation provides a production-ready cloud GPU scaling solution that maintains the system's self-contained architecture while adding powerful on-demand processing capabilities.

2c485eee

Add comprehensive ASCII art documentation with visual diagrams · cec0be4d

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Add detailed system architecture diagrams to README
- Create visual data flow and usage workflow diagrams
- Enhance docs/architecture.md with process and communication diagrams
- Add cluster architecture visualization with load balancing flows
- Include cross-platform deployment diagrams
- Provide visual representation of inter-process communication
- Document cluster communication protocols with ASCII schemas

cec0be4d

Add Windows batch file support for cross-platform compatibility · 9b4a4f65

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Create build.bat, start.bat, setup.bat, and clean.bat equivalents
- Update compatibility module with Windows-specific functions
- Add platform detection and script execution utilities
- Update README with Windows-specific instructions
- Ensure full Windows compatibility without requiring bash

9b4a4f65

Add comprehensive clustering support for distributed processing · 4b590a0b

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Implement cluster master/client architecture with token-based authentication
- Add cluster configuration options (host, port, token, client mode)
- Create cluster communication protocol for distributed workload management
- Implement load balancing with configurable process weights
- Add cluster management web interface for monitoring and control
- Support mixed local/remote worker deployment
- Enable/disable processes across cluster nodes
- Update queue manager for distributed job execution
- Add cluster documentation and configuration examples

4b590a0b

Add token usage tracking to queue system · 4a7a9d07

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Add estimated_tokens and used_tokens fields to processing_queue table
- Implement token estimation based on request type and content
- Track actual token usage during job processing
- Display estimated and used tokens in web interface queue views
- Update dashboard, queue list, and job details to show token information
- Simulate realistic token usage in queue processing

4a7a9d07

Add configurable communication type: Unix vs TCP sockets · d0222b6c

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Add comm_type configuration option (unix/tcp, default unix)
- Command line argument --comm-type for runtime selection
- Web configuration page includes communication type selection
- SocketCommunicator and SocketServer support both Unix and TCP
- Updated all processes to use configured communication type
- Documentation updated to reflect both socket types
- Unix sockets provide better performance for local communication

d0222b6c

Complete implementation of multi-process Video AI Analysis Tool · 2a1990a5

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Multi-process architecture: web, backend, analysis/training workers
- SQLite database for persistent configuration and system prompts
- Configurable CUDA/ROCm backends with command line override
- TCP socket-based inter-process communication
- Web interface with comprehensive configuration management
- GPLv3 licensing with copyright notices on all files
- Complete documentation: README, architecture docs, changelog
- Build and deployment scripts for different GPU backends
- Git repository setup with .gitignore for build artifacts

2a1990a5