[Bugfix][Mooncake] Fix thread-local CUDA context for NVLink transfers in _send_blocks (#39548)
Signed-off-by:Zhewen Li <zhewenli@inferact.ai> Co-authored-by:
Zhewen Li <zhewenli@inferact.ai>
Showing
Please register or sign in to comment
Signed-off-by:Zhewen Li <zhewenli@inferact.ai> Co-authored-by:
Zhewen Li <zhewenli@inferact.ai>