Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
2039c630
Unverified
Commit
2039c630
authored
Apr 02, 2025
by
Thien Tran
Committed by
GitHub
Apr 02, 2025
Browse files
[Bugfix] Fix imports for MoE on CPU (#15841)
Signed-off-by:
Thien Tran
<
gau.nernst@yahoo.com.sg
>
parent
6efb195a
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
2 deletions
+3
-2
vllm/model_executor/layers/fused_moe/rocm_aiter_fused_moe.py
vllm/model_executor/layers/fused_moe/rocm_aiter_fused_moe.py
+3
-2
No files found.
vllm/model_executor/layers/fused_moe/rocm_aiter_fused_moe.py
View file @
2039c630
...
@@ -4,8 +4,6 @@ from typing import List, Optional
...
@@ -4,8 +4,6 @@ from typing import List, Optional
import
torch
import
torch
import
vllm.envs
as
envs
import
vllm.envs
as
envs
from
vllm.model_executor.layers.quantization.utils.fp8_utils
import
(
per_token_group_quant_fp8
)
from
vllm.platforms
import
current_platform
from
vllm.platforms
import
current_platform
...
@@ -38,6 +36,9 @@ def rocm_aiter_fused_experts(
...
@@ -38,6 +36,9 @@ def rocm_aiter_fused_experts(
import
aiter
as
rocm_aiter
import
aiter
as
rocm_aiter
import
aiter.fused_moe_bf16_asm
as
rocm_aiter_asm_fmoe
import
aiter.fused_moe_bf16_asm
as
rocm_aiter_asm_fmoe
from
vllm.model_executor.layers.quantization.utils.fp8_utils
import
(
per_token_group_quant_fp8
)
if
envs
.
VLLM_ROCM_USE_AITER_FP8_BLOCK_SCALED_MOE
and
use_fp8_w8a8
:
if
envs
.
VLLM_ROCM_USE_AITER_FP8_BLOCK_SCALED_MOE
and
use_fp8_w8a8
:
assert
w1_scale
is
not
None
assert
w1_scale
is
not
None
assert
w2_scale
is
not
None
assert
w2_scale
is
not
None
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment