1. 20 Mar, 2024 1 commit
  2. 18 Mar, 2024 3 commits
    • Jannik Streidl's avatar
      set default to cpu · 132d741c
      Jannik Streidl authored
      132d741c
    • Jannik Streidl's avatar
      cuda support · 5abe0089
      Jannik Streidl authored
      5abe0089
    • Joseph Young's avatar
      Updated Dockerfile for CUDA backend · c5948d3e
      Joseph Young authored
      Enabled NVIDIA CUDA backend build stage in the Dockerfile for enhanced performance with GPU support. Moved the environment variable defining the device type for the embedding and TTS models to be shared between CPU and GPU configurations. The default device type for CPU build is now explicitly set to "cpu", while the CUDA build retains "cuda", ensuring clarity and performance optimization across different hardware setups.
      c5948d3e
  3. 17 Mar, 2024 3 commits
    • Joseph Young's avatar
      Refactor Dockerfile for CPU and CUDA builds · c004ecdc
      Joseph Young authored
      Switched to Chainguard images as base for both CPU and CUDA backend builds for improved security and compatibility. Replaced Ubuntu base with Chainguard's Python image for the CPU builds and PyTorch CUDA image for GPU acceleration, resolving python requirements conflicts. Updated package installation commands to align with the new Redhat-compatible base images. The Dockerfile now installs only the necessary dependencies, as Python is provided by the base image.
      
      These changes will facilitate a more secure and streamlined build process with better dependency management across different platforms.
      c004ecdc
    • Joseph Young's avatar
      Parametrize CUDA_VERSION in Dockerfile · e3b1cbbb
      Joseph Young authored
      Standardized CUDA_VERSION as a global ARG to ensure consistency and facilitate version updates across the Dockerfile. This change allows the CUDA version to be defined once at the beginning and reused, reducing the chance of mismatched versions and easing maintenance when changing CUDA versions. It further streamlines the build process for potential multi-stage builds with varying CUDA dependencies.
      
      Refs #nvidia-update
      e3b1cbbb
    • Joseph Young's avatar
      Optimize Dockerfile for CUDA support · f6cef312
      Joseph Young authored
      Refactored the Dockerfile to better organize and streamline environment variable settings, emphasizing support for a CUDA-based WebUI backend while retaining the ability to build a CPU-only image. Consolidated ENV commands to reduce layers, improving build efficiency, and set a default PORT environment to enhance container usability. Enabled exposure of the backend service on port 8080 and leveraged combined RUN directives to minimize the image footprint. These changes facilitate a more robust deployment process, catering to both CPU and CUDA environments.
      f6cef312
  4. 16 Mar, 2024 3 commits
  5. 14 Mar, 2024 2 commits
  6. 11 Mar, 2024 5 commits
  7. 10 Mar, 2024 17 commits
  8. 09 Mar, 2024 6 commits