"vscode:/vscode.git/clone" did not exist on "a1e9e36faa18d35d4afe8bc6e91311f9d6b8c3c0"
Commit 66017f31 authored by zhangqha's avatar zhangqha
Browse files

Merge branch 'v0.15.1-dev_dsa_mqa' into 'v0.15.1-dev'

feat:修复dsa的mqa接口兼容glm5

See merge request dcutoolkit/deeplearing/vllm!478
parents f7461a96 66979358
...@@ -121,9 +121,10 @@ def sparse_attn_indexer( ...@@ -121,9 +121,10 @@ def sparse_attn_indexer(
chunk.cu_seqlen_ke, chunk.cu_seqlen_ke,
q_fp8[chunk.token_start:chunk.token_end].shape[0], q_fp8[chunk.token_start:chunk.token_end].shape[0],
k.shape[0], k.shape[0],
64, q_fp8.shape[1],
128, q_fp8.shape[2],
True, None,
True
) )
num_rows = logits.shape[0] num_rows = logits.shape[0]
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment