Commit 2544deb6 authored by zhuwenwen's avatar zhuwenwen
Browse files

Merge branch 'v0.15.1-dev-fth' into 'v0.15.1-dev'

修复block-wise 用triton报错的bug

See merge request dcutoolkit/deeplearing/vllm!444
parents e675f1f4 359c16db
......@@ -526,6 +526,7 @@ class W8A8BlockFp8LinearOp:
def _run_triton(
self,
out_features: int,
input_2d: torch.Tensor,
weight: torch.Tensor,
weight_scale: torch.Tensor,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment