Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
98522149
"vscode:/vscode.git/clone" did not exist on "408ba02218a2952f29a0271baf5544bd37dcd63c"
Unverified
Commit
98522149
authored
Jan 25, 2025
by
yizhang2077
Committed by
GitHub
Jan 25, 2025
Browse files
mirror fix for custom allreduce (#3124)
parent
5d9d15e7
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
sgl-kernel/src/sgl-kernel/csrc/trt_reduce_internal.cu
sgl-kernel/src/sgl-kernel/csrc/trt_reduce_internal.cu
+1
-1
No files found.
sgl-kernel/src/sgl-kernel/csrc/trt_reduce_internal.cu
View file @
98522149
...
@@ -160,7 +160,7 @@ __inline__ __device__ void block_barrier(uint32_t** signals, uint32_t const flag
...
@@ -160,7 +160,7 @@ __inline__ __device__ void block_barrier(uint32_t** signals, uint32_t const flag
}
}
template
<
typename
T
,
int
RANKS_PER_NODE
,
bool
COPY_INPUT
=
true
>
template
<
typename
T
,
int
RANKS_PER_NODE
,
bool
COPY_INPUT
=
true
>
static
__global__
void
oneShotAllReduceKernel
(
AllReduceParams
params
)
{
static
__global__
void
__launch_bounds__
(
512
,
1
)
oneShotAllReduceKernel
(
AllReduceParams
params
)
{
// Suppose that two GPUs participate in the AR exchange, and we start four blocks.
// Suppose that two GPUs participate in the AR exchange, and we start four blocks.
// The message is partitioned into chunks as detailed below:
// The message is partitioned into chunks as detailed below:
// message
// message
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment