Commit a02a5490 authored by silencealiang's avatar silencealiang
Browse files

减少cpu侧开销

parent 1310cbf8
Pipeline #2463 passed with stage
......@@ -312,7 +312,7 @@ def device_limited_topk(
return probs, top_indices
@torch.compile(options={"triton.cudagraphs": True, "triton.cudagraph_trees": False})
def topk_softmax_with_capacity(
logits: torch.Tensor,
topk: int,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment