1. 28 Feb, 2026 4 commits
    • Stefy Lanza (nextime / spora )'s avatar
      Make procname optional - commented out in requirements · 3b451669
      Stefy Lanza (nextime / spora ) authored
      - Comment out procname in requirements-nvidia.txt
      - Comment out procname in requirements-vulkan.txt
      - Add note about requiring libproc2-dev for procname
      3b451669
    • Stefy Lanza (nextime / spora )'s avatar
      Update Vulkan dependencies: add glslc package · 389851fe
      Stefy Lanza (nextime / spora ) authored
      - build.sh: Update package list to include glslc, glslang-tools, glslang-dev
      - README.md: Update installation instructions with correct package names
      - Add better guidance for finding glslc in non-standard locations
      389851fe
    • Stefy Lanza (nextime / spora )'s avatar
      Fix Vulkan build: add glslc/shader compiler check · bd5b87b5
      Stefy Lanza (nextime / spora ) authored
      - Update build.sh to check for glslc before attempting build
      - Update README with correct package names (glslang-tools/glslang)
      - Add troubleshooting for missing glslc error
      bd5b87b5
    • Stefy Lanza (nextime / spora )'s avatar
      Add Vulkan support for AMD GPUs alongside NVIDIA/CUDA · 02fb99fa
      Stefy Lanza (nextime / spora ) authored
      - Add build.sh script with nvidia/vulkan arguments (default: nvidia)
      - Create backend abstraction: ModelBackend base class
      - Implement NvidiaBackend using HuggingFace Transformers
      - Implement VulkanBackend using llama-cpp-python with GGUF models
      - Add separate requirements files for nvidia and vulkan backends
      - Add --backend argument with auto/nvidia/vulkan options
      - Add Vulkan-specific options: --n-gpu-layers, --n-ctx
      - Make procname import optional
      - Update README with comprehensive Vulkan usage instructions
      - Add Vulkan troubleshooting section
      - Add GGUF model recommendations
      
      The application now supports:
      - NVIDIA GPUs via PyTorch/Transformers (HuggingFace models)
      - AMD GPUs via llama-cpp-python/Vulkan (GGUF models)
      02fb99fa
  2. 27 Feb, 2026 2 commits
    • Stefy Lanza (nextime / spora )'s avatar
      Fix NaN/inf probability tensor error during generation · ae1d0e38
      Stefy Lanza (nextime / spora ) authored
      - Add InvalidLogitsProcessor to replace NaN and Inf values with finite numbers
      - Add _validate_generation_params() to clamp temperature and top_p to valid ranges
      - Add try-except blocks with fallback to greedy decoding on numerical errors
      - Add error handling in streaming responses to prevent crashes
      - Fix temperature=0 handling to use greedy decoding instead of sampling
      ae1d0e38
    • Stefy Lanza (nextime / spora )'s avatar
      Initial commit: Add CoderAI OpenAI-compatible API server · 087ba9e1
      Stefy Lanza (nextime / spora ) authored
      - Add main server script with FastAPI and memory-aware model loading
      - Add requirements.txt with dependencies and platform-specific PyTorch options
      - Add comprehensive README.md with installation, usage, and troubleshooting
      - Add LICENSE.md with GPLv3 license
      087ba9e1