Commit 456e8c10 authored by guanyu1's avatar guanyu1 Committed by zhangzbb
Browse files

[BUGFIX] mrope 二维表示情况下,在dcu上kernel无法提前下发

parent b233584a
......@@ -816,8 +816,12 @@ class GPUModelRunner(
non_blocking=True,
)
return
# self.mrope_positions.gpu[:, :num_tokens].copy_(
# self.mrope_positions.cpu[:, :num_tokens],
# non_blocking=True,
# )
self.mrope_positions.gpu[:, :num_tokens].copy_(
self.mrope_positions.cpu[:, :num_tokens],
self.mrope_positions.cpu[:, :num_tokens].contiguous().pin_memory(),
non_blocking=True,
)
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment