• Stefy Lanza (nextime / spora )'s avatar
    Implement GPU prioritization and weight-based job distribution · 4f6f914d
    Stefy Lanza (nextime / spora ) authored
    - Added --weight parameter to client connections (default: 100)
    - Modified cluster master to prioritize GPU-enabled clients for job distribution
    - GPU clients always get precedence over CPU-only clients
    - When no GPU workers have required model, GPU clients still preferred for model distribution
    - Client weights are combined with process weights for load balancing
    - Higher weight = more jobs assigned to that client
    
    Job distribution priority:
    1. GPU clients with required model already loaded
    2. CPU clients with required model already loaded
    3. GPU clients (model will be sent)
    4. CPU clients (model will be sent)
    
    Within each category, clients are selected based on combined weight.
    4f6f914d
vidai.py 13.5 KB