Commit e729231c authored by lijian's avatar lijian
Browse files

fix hang when num-experts>256.


Signed-off-by: default avatarlijian <34831075+lijian0711@users.noreply.github.com>
parent a4b49551
...@@ -855,7 +855,7 @@ combine(void* combined_x, ...@@ -855,7 +855,7 @@ combine(void* combined_x,
syncwarp(); syncwarp();
while (sync_large_warp_counters[warp_group_id] < num_warps_per_group); while (sync_large_warp_counters[warp_group_id] < num_warps_per_group);
if (sub_warp_id == 1 and lane_id == 0) { if (sub_warp_id == 0 and lane_id == 0) {
while (ld_acquire_global(atomic_clean_flag) == 0); while (ld_acquire_global(atomic_clean_flag) == 0);
auto dst_ptr = rdma_recv_flag + global_expert_idx; auto dst_ptr = rdma_recv_flag + global_expert_idx;
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment