Unverified Commit ef2b0e67 authored by Kris Hung's avatar Kris Hung Committed by GitHub
Browse files

fix: Revert the commit for DeepGEMM to fix vLLM WideEP (#2302)

parent f962c7d6
...@@ -17,7 +17,9 @@ ARG TORCH_BACKEND="cu128" ...@@ -17,7 +17,9 @@ ARG TORCH_BACKEND="cu128"
# Match 0.10.0 vLLM release # Match 0.10.0 vLLM release
# https://github.com/vllm-project/vllm/releases/tag/v0.10.0 # https://github.com/vllm-project/vllm/releases/tag/v0.10.0
ARG DEEPGEMM_REF="1876566" # Pinned to commit before https://github.com/deepseek-ai/DeepGEMM/pull/112 for DeepGEMM which seems to break on H100:
# "RuntimeError: Failed: CUDA runtime error csrc/jit/kernel_runtime.hpp:108 '98'"
ARG DEEPGEMM_REF="03d0be3"
ARG FLASHINF_REF="v0.2.8rc1" ARG FLASHINF_REF="v0.2.8rc1"
# Define general architecture ARGs for supporting both x86 and aarch64 builds. # Define general architecture ARGs for supporting both x86 and aarch64 builds.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment