[Bugfix][ROCm]Fix Qwen3-Next-80B-A3B-Thinking inference and optimize...
[Bugfix][ROCm]Fix Qwen3-Next-80B-A3B-Thinking inference and optimize non-standard block size (544) support under rocm_atten (#31380)
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com>
Showing
Please register or sign in to comment