[Quantization][ROCm] Fix MoE weight loading to be robust (Qwen3_MoE/Qwen3_next...
[Quantization][ROCm] Fix MoE weight loading to be robust (Qwen3_MoE/Qwen3_next as example models) (#33173)
Signed-off-by:
xuebwang-amd <xuebwang@amd.com>
Showing
vllm/model_executor/layers/fused_moe/layer.py
100644 → 100755
Please register or sign in to comment