Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
a75021db
Commit
a75021db
authored
Feb 13, 2025
by
zhuwenwen
Browse files
Merge branch 'v0.7.2-dev' of
http://10.6.10.68/dcutoolkit/deeplearing/vllm
into v0.7.2-dev
parents
a8b2d878
a9c16183
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
csrc/moe/moe_align_sum_kernels.cu
csrc/moe/moe_align_sum_kernels.cu
+1
-1
No files found.
csrc/moe/moe_align_sum_kernels.cu
View file @
a75021db
...
@@ -307,7 +307,7 @@ __global__ void ep_moe_align_block_size_kernel(scalar_t* __restrict__ topk_ids,
...
@@ -307,7 +307,7 @@ __global__ void ep_moe_align_block_size_kernel(scalar_t* __restrict__ topk_ids,
* assigned to expert expert_index.
* assigned to expert expert_index.
*/
*/
for
(
int
i
=
start_idx
;
i
<
numel
&&
i
<
start_idx
+
tokens_per_thread
;
++
i
)
{
for
(
int
i
=
start_idx
;
i
<
numel
&&
i
<
start_idx
+
tokens_per_thread
;
++
i
)
{
expert_id
=
topk_ids
[
i
];
int32_t
expert_id
=
topk_ids
[
i
];
if
(
expert_id
>=
start_expert
&&
expert_id
<
end_expert
)
{
if
(
expert_id
>=
start_expert
&&
expert_id
<
end_expert
)
{
++
tokens_cnts
[
index
(
num_experts
,
threadIdx
.
x
+
1
,
expert_id
-
start_expert
)];
++
tokens_cnts
[
index
(
num_experts
,
threadIdx
.
x
+
1
,
expert_id
-
start_expert
)];
}
}
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment