Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
5b19b930
Unverified
Commit
5b19b930
authored
Feb 05, 2025
by
Gregory Shtrasberg
Committed by
GitHub
Feb 05, 2025
Browse files
[ROCm][Kernel] Using the correct warp_size value
parent
75404d04
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
csrc/moe/moe_align_sum_kernels.cu
csrc/moe/moe_align_sum_kernels.cu
+2
-2
No files found.
csrc/moe/moe_align_sum_kernels.cu
View file @
5b19b930
...
@@ -207,8 +207,8 @@ __global__ void sgl_moe_align_block_size_kernel(
...
@@ -207,8 +207,8 @@ __global__ void sgl_moe_align_block_size_kernel(
__shared__
int32_t
shared_counts
[
32
][
8
];
__shared__
int32_t
shared_counts
[
32
][
8
];
__shared__
int32_t
local_offsets
[
256
];
__shared__
int32_t
local_offsets
[
256
];
const
int
warp_id
=
threadIdx
.
x
/
WARP_SIZE
;
const
int
warp_id
=
threadIdx
.
x
/
32
;
const
int
lane_id
=
threadIdx
.
x
%
WARP_SIZE
;
const
int
lane_id
=
threadIdx
.
x
%
32
;
const
int
experts_per_warp
=
8
;
const
int
experts_per_warp
=
8
;
const
int
my_expert_start
=
warp_id
*
experts_per_warp
;
const
int
my_expert_start
=
warp_id
*
experts_per_warp
;
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment