• Stefy Lanza (nextime / spora )'s avatar
    Improve GPU memory detection fallback chain · c2855737
    Stefy Lanza (nextime / spora ) authored
    - Add nvidia-smi as intermediate fallback before PyTorch in GPU stats collection
    - Fallback order: pynvml -> nvidia-smi -> PyTorch
    - Applied to api.py, backend.py, and cluster_client.py GPU stats functions
    - nvidia-smi provides accurate memory usage and utilization data
    - Fix SocketCommunicator.receive_message() timeout parameter error
    - Added optional timeout parameter to receive_message method
    - Fixes 'unexpected keyword argument timeout' error in api_stats and backend functions
    c2855737
Name
Last commit
Last update
docs Loading commit data...
templates Loading commit data...
vidai Loading commit data...
.gitignore Loading commit data...
AI.PROMPT Loading commit data...
CHANGELOG.md Loading commit data...
Dockerfile.runpod Loading commit data...
LICENSE Loading commit data...
README.md Loading commit data...
TODO.md Loading commit data...
build.bat Loading commit data...
build.sh Loading commit data...
clean.bat Loading commit data...
clean.sh Loading commit data...
create_pod.sh Loading commit data...
image.jpg Loading commit data...
requirements-cuda.txt Loading commit data...
requirements-rocm.txt Loading commit data...
requirements.txt Loading commit data...
setup.bat Loading commit data...
setup.sh Loading commit data...
start.bat Loading commit data...
test_comm.py Loading commit data...
test_runpod.py Loading commit data...
vidai.conf.sample Loading commit data...
vidai.py Loading commit data...
vidai.sh Loading commit data...