Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
c8492797
Commit
c8492797
authored
Apr 02, 2025
by
zhuwenwen
Browse files
add moe_wna16_gemm
parent
fe6d3b05
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
2 additions
and
2 deletions
+2
-2
csrc/moe/moe_ops.h
csrc/moe/moe_ops.h
+1
-1
csrc/moe/torch_bindings.cpp
csrc/moe/torch_bindings.cpp
+1
-1
No files found.
csrc/moe/moe_ops.h
View file @
c8492797
...
@@ -18,7 +18,7 @@ void sgl_moe_align_block_size(torch::Tensor topk_ids, int64_t num_experts,
...
@@ -18,7 +18,7 @@ void sgl_moe_align_block_size(torch::Tensor topk_ids, int64_t num_experts,
torch
::
Tensor
sorted_token_ids
,
torch
::
Tensor
sorted_token_ids
,
torch
::
Tensor
experts_ids
,
torch
::
Tensor
experts_ids
,
torch
::
Tensor
num_tokens_post_pad
);
torch
::
Tensor
num_tokens_post_pad
);
#if
n
def USE_ROCM
#ifdef USE_ROCM
torch
::
Tensor
moe_wna16_gemm
(
torch
::
Tensor
input
,
torch
::
Tensor
output
,
torch
::
Tensor
moe_wna16_gemm
(
torch
::
Tensor
input
,
torch
::
Tensor
output
,
torch
::
Tensor
b_qweight
,
torch
::
Tensor
b_scales
,
torch
::
Tensor
b_qweight
,
torch
::
Tensor
b_scales
,
std
::
optional
<
torch
::
Tensor
>
b_qzeros
,
std
::
optional
<
torch
::
Tensor
>
b_qzeros
,
...
...
csrc/moe/torch_bindings.cpp
View file @
c8492797
...
@@ -31,7 +31,7 @@ TORCH_LIBRARY_EXPAND(TORCH_EXTENSION_NAME, m) {
...
@@ -31,7 +31,7 @@ TORCH_LIBRARY_EXPAND(TORCH_EXTENSION_NAME, m) {
" Tensor! num_tokens_post_pad) -> ()"
);
" Tensor! num_tokens_post_pad) -> ()"
);
m
.
impl
(
"sgl_moe_align_block_size"
,
torch
::
kCUDA
,
&
sgl_moe_align_block_size
);
m
.
impl
(
"sgl_moe_align_block_size"
,
torch
::
kCUDA
,
&
sgl_moe_align_block_size
);
#if
n
def USE_ROCM
#ifdef USE_ROCM
m
.
def
(
m
.
def
(
"moe_wna16_gemm(Tensor input, Tensor! output, Tensor b_qweight, "
"moe_wna16_gemm(Tensor input, Tensor! output, Tensor b_qweight, "
"Tensor b_scales, Tensor? b_qzeros, "
"Tensor b_scales, Tensor? b_qzeros, "
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment