Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
DeepEP
Commits
2d655524
Commit
2d655524
authored
Feb 04, 2026
by
lishen
Browse files
fix: fp8 groupsize=0 bug
parent
043aa8f7
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
csrc/kernels/internode_ll.cu
csrc/kernels/internode_ll.cu
+2
-2
No files found.
csrc/kernels/internode_ll.cu
View file @
2d655524
...
...
@@ -462,7 +462,7 @@ LOW_LATENCY_DISPATCH_RECV:
const
auto
num_aligned_scales
=
ALIGN
<
int
>
(
kNumScales
,
sizeof
(
float
)
/
sizeof
(
scale_t
));
const
auto
recv_x_scales
=
static_cast
<
scale_t
*>
(
packed_recv_x_scales
)
+
local_expert_idx
*
num_ranks
*
num_max_dispatch_tokens_per_rank
*
(
kQuant
Typ
e
==
1
?
1
:
num_aligned_scales
);
(
kQuant
GroupSiz
e
==
0
?
1
:
num_aligned_scales
);
// Shared between sub-warps in warp groups
__shared__
int
shared_num_recv_tokens
[
kNumMaxWarpGroups
],
shared_recv_token_begin_idx
[
kNumMaxWarpGroups
];
...
...
@@ -514,7 +514,7 @@ LOW_LATENCY_DISPATCH_RECV:
const
auto
token_stride
=
num_elems_per_pack
;
const
auto
pack_stride
=
num_ranks
*
num_max_dispatch_tokens_per_rank
*
num_elems_per_pack
;
if
constexpr
(
kQuant
Typ
e
==
1
)
{
if
constexpr
(
kQuant
GroupSiz
e
==
0
)
{
if
(
lane_id
==
0
)
{
recv_x_scales
[
token_idx
]
=
ld_nc_global
(
src_scales
);
}
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment