[ROCm] Enable VLLM triton FP8 moe for gfx1201, tuned for Qwen3-30B-A3B-FP8...
[ROCm] Enable VLLM triton FP8 moe for gfx1201, tuned for Qwen3-30B-A3B-FP8 tp=2 and Qwen/Qwen3.5-35B-A3B-FP8 tp=2 (#38086) Signed-off-by:big-yellow-duck <jeffaw99@hotmail.com> Co-authored-by:
vllmellm <vllm.ellm@embeddedllm.com>
Showing
Please register or sign in to comment