Commits · 7d474b9f3f38269381d06058358e2d83d3558f34 · SexHackMe / vidai

05 Oct, 2025 24 commits

Remove obsolete webui.py file · 7d474b9f

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Deleted webui.py as it was replaced by vidai/web.py in the multi-process architecture
- Functionality moved to separate web interface process
- Cleaned up legacy monolithic code that is no longer used

7d474b9f

Implement dynamic model management system · 467292ed

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Add vidai/models.py with abstract base classes for different model types
- Support multiple HuggingFace transformers models (vision-language, text-only)
- Auto-detect model types from model names/paths
- Add model type configuration options (--model-type)
- Update worker_analysis.py to use dynamic model loading instead of hardcoded Qwen
- Maintain backward compatibility with existing model configurations

467292ed

Replace deprecated pynvml with nvidia-ml-py · 9f967d46

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Updated all requirements files to use nvidia-ml-py instead of deprecated pynvml
- Added fallback import in webui.py to support both packages
- Maintains backward compatibility while using the actively maintained package
- Updated installation instructions in error messages

9f967d46

Remove unused start.sh script · 52412101

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Removed start.sh since process startup is handled by vidai.py
- vidai.py now includes proper backend readiness checking
- Simplified deployment by using single Python launcher

52412101

Fix startup sequence to ensure backend readiness · 03f65ee6

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Update start.sh to wait for Unix socket creation before starting workers
- Remove retry logic from workers since startup script ensures backend is ready
- Workers now connect immediately knowing backend socket exists
- Prevents race conditions between backend startup and worker connections

03f65ee6

Implement bidirectional socket-based interprocess communication · 0aed6a48

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Separate communication protocols: TCP for web-backend, Unix sockets for backend-workers
- Remove file-based result passing, implement in-memory result storage in backend
- Web interface polls backend via socket for results instead of reading files
- Workers send results back through Unix socket connection to backend
- Maintain persistent connections for efficient bidirectional communication

0aed6a48

Fix worker communication initialization · 99d6e5b9

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Update workers to use proper socket path from config instead of hardcoded paths
- Ensure workers check communication type and use appropriate connection method
- Fix SocketCommunicator to properly handle Unix socket paths
- Revert database default back to 'unix' for proper Unix socket support

99d6e5b9

Fix inter-process communication issues · 9e0ff1bd

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Fix SocketCommunicator initialization to properly handle TCP vs Unix sockets
- Update backend to use configurable communication type from config
- Change default communication type from 'unix' to 'tcp' for better reliability
- Ensure consistent socket handling across all processes

9e0ff1bd

Implement multi-process architecture with CUDA/ROCm backend selection · 2604b812

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Refactor monolithic Flask app into separate processes:
  * Web interface process (vidai/web.py)
  * Backend routing process (vidai/backend.py)
  * Analysis worker processes (vidai/worker_analysis.py)
  * Training worker processes (vidai/worker_training.py)

- Add self-contained inter-process communication using TCP sockets
- Implement configuration system for backend selection (CUDA/ROCm)
- Add GPLv3 licensing and copyright notices
- Create comprehensive documentation and build scripts
- Update main launcher to manage all processes

This architecture provides better scalability, allows independent GPU backend selection, and maintains clean separation of concerns.

2604b812

Update files · 56e233bb
Stefy Lanza (nextime / spora ) authored Oct 05, 2025

56e233bb

Implement multi-process Video AI architecture · bc26540c

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Add socket-based inter-process communication
- Implement backend process for request routing
- Create separate web interface process
- Add CUDA/ROCm worker processes for analysis and training
- Add configuration system for backend selection
- Update build scripts for multi-component executables
- Add startup scripts for process orchestration
- Include GPLv3 license and copyright notices
- Add comprehensive documentation and README
- Create CHANGELOG for version tracking

bc26540c

Fix circular dependency in database configuration · 552f2d10

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Move db_type handling to environment variables to avoid circular dependency
- get_db_config() now uses environment variables for database type
- get_config() handles db_type specially to prevent recursion
- Fixes ImportError when running vidai.py

552f2d10

Add vidai.sh launcher script for Linux development · 3dfa46c4

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Create executable launcher script for non-PyInstaller builds
- Automatically detects and uses appropriate virtual environments
- Launches backend, web interface, and workers with correct CUDA/ROCm environments
- Reads configuration to determine backend selection
- Provides process management and cleanup on exit

3dfa46c4

Update build and setup scripts to support both CUDA and ROCm backends · 92ab2baa

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Setup script now creates both venv-cuda and venv-rocm by default
- Build script builds both CUDA and ROCm worker versions
- Start script reads configuration to launch appropriate backend workers
- Added pip3 detection and --break-system-packages support
- Windows batch files updated with pip3 detection
- Runtime can switch between CUDA/ROCm implementations based on config

92ab2baa

Fix PyTorch version specification in requirements files · a9ec4a6f

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Remove invalid local version labels from torch requirements
- Use proper pip syntax: torch>=2.0.0 with --index-url for CUDA/ROCm
- Fix pip installation errors for CUDA and ROCm builds

a9ec4a6f

Add configurable TCP ports and hosts for web interface and inter-process communication · 97b7c6fd

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Add network configuration settings: web_host, web_port, backend_host, backend_web_port, backend_worker_port
- Update web.py to use configurable host and port for Flask app
- Update backend.py to use configurable ports for socket servers
- Update worker processes to use configurable backend worker port
- Add network configuration section to admin panel
- Update command line arguments to accept network settings
- Maintain backward compatibility with default values

97b7c6fd

Add MySQL database support and complete multi-process Video AI implementation · 53c252b3

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Add MySQL support as alternative to SQLite with full configuration
- Implement database abstraction layer supporting both SQLite and MySQL
- Add database configuration section to admin panel
- Update requirements files with PyMySQL dependency
- Complete multi-process architecture with web, backend, and worker processes
- Add user registration system with email confirmation
- Implement token-based usage system with payment processing
- Add comprehensive documentation and licensing
- Create professional SaaS-style web interface
- Implement REST API for programmatic access
- Add email notification system and payment processors

53c252b3

Add complete RunPod.io integration for dynamic GPU pod management · 2c485eee

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

🎯 RunPod.io Cloud GPU Integration
• Dynamic pod creation and lifecycle management
• On-demand GPU scaling without local hardware costs
• Seamless integration with existing multi-process architecture

🏗️ Core Components Added:
• Dockerfile.runpod - Optimized GPU pod image for RunPod
• create_pod.sh - Automated build and deployment script
• vidai/runpod.py - Complete RunPod API integration module
• Enhanced backend with pod spawning capabilities
• Web interface RunPod configuration section

🔧 Key Features:
• Automatic pod creation for analysis jobs
• Cost optimization with idle pod cleanup (30min timeout)
• Multiple GPU type support (RTX 3090, A4000, A5000, 4090)
• Secure API key management and pod isolation
• Fallback to local processing when pods unavailable

📊 Architecture Enhancements:
• Pod lifecycle: Create → Start → Run → Process → Terminate
• Intelligent routing between local workers and cloud pods
• Real-time pod health monitoring and status tracking
• Persistent pod state management with cache files

🛡️ Production Features:
• Comprehensive error handling and recovery
• Detailed logging and monitoring capabilities
• Security-hardened pod environments
• Resource limits and cost controls

📚 Documentation:
• docs/runpod-integration.md - Complete integration guide
• Updated README.md with RunPod setup instructions
• test_runpod.py - Integration testing and validation
• Inline code documentation and examples

🚀 Benefits:
• Zero idle GPU costs - pay only for actual processing
• Access to latest GPU hardware without maintenance
• Unlimited scaling potential for high-throughput workloads
• Global pod distribution for low-latency processing

This implementation provides a production-ready cloud GPU scaling solution that maintains the system's self-contained architecture while adding powerful on-demand processing capabilities.

2c485eee

Add comprehensive ASCII art documentation with visual diagrams · cec0be4d

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Add detailed system architecture diagrams to README
- Create visual data flow and usage workflow diagrams
- Enhance docs/architecture.md with process and communication diagrams
- Add cluster architecture visualization with load balancing flows
- Include cross-platform deployment diagrams
- Provide visual representation of inter-process communication
- Document cluster communication protocols with ASCII schemas

cec0be4d

Add Windows batch file support for cross-platform compatibility · 9b4a4f65

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Create build.bat, start.bat, setup.bat, and clean.bat equivalents
- Update compatibility module with Windows-specific functions
- Add platform detection and script execution utilities
- Update README with Windows-specific instructions
- Ensure full Windows compatibility without requiring bash

9b4a4f65

Add comprehensive clustering support for distributed processing · 4b590a0b

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Implement cluster master/client architecture with token-based authentication
- Add cluster configuration options (host, port, token, client mode)
- Create cluster communication protocol for distributed workload management
- Implement load balancing with configurable process weights
- Add cluster management web interface for monitoring and control
- Support mixed local/remote worker deployment
- Enable/disable processes across cluster nodes
- Update queue manager for distributed job execution
- Add cluster documentation and configuration examples

4b590a0b

Add token usage tracking to queue system · 4a7a9d07

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Add estimated_tokens and used_tokens fields to processing_queue table
- Implement token estimation based on request type and content
- Track actual token usage during job processing
- Display estimated and used tokens in web interface queue views
- Update dashboard, queue list, and job details to show token information
- Simulate realistic token usage in queue processing

4a7a9d07

Add configurable communication type: Unix vs TCP sockets · d0222b6c

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Add comm_type configuration option (unix/tcp, default unix)
- Command line argument --comm-type for runtime selection
- Web configuration page includes communication type selection
- SocketCommunicator and SocketServer support both Unix and TCP
- Updated all processes to use configured communication type
- Documentation updated to reflect both socket types
- Unix sockets provide better performance for local communication

d0222b6c

Complete implementation of multi-process Video AI Analysis Tool · 2a1990a5

Stefy Lanza (nextime / spora ) authored Oct 05, 2025

- Multi-process architecture: web, backend, analysis/training workers
- SQLite database for persistent configuration and system prompts
- Configurable CUDA/ROCm backends with command line override
- TCP socket-based inter-process communication
- Web interface with comprehensive configuration management
- GPLv3 licensing with copyright notices on all files
- Complete documentation: README, architecture docs, changelog
- Build and deployment scripts for different GPU backends
- Git repository setup with .gitignore for build artifacts

2a1990a5