Use one qp per sm for internode normal kernels (#181)
let the sender SM use the channel_id, and the receiver SM use channel_id + num_channels
Showing
Please register or sign in to comment
let the sender SM use the channel_id, and the receiver SM use channel_id + num_channels