Unverified Commit ee484b3f authored by Danielle Robinson's avatar Danielle Robinson Committed by GitHub
Browse files

Set splitk=1 for fused-moe-lora expand kernel (#32882)


Signed-off-by: default avatarDanielle Robinson <dmmaddix@amazon.com>
Co-authored-by: default avatarDanielle Robinson <dmmaddix@amazon.com>
Co-authored-by: default avatarJee Jee Li <pandaleefree@gmail.com>
parent a9b53dd4
......@@ -351,7 +351,7 @@ def _fused_moe_lora_expand(
"GROUP_SIZE_M": group_size_m,
"num_warps": num_warps,
"num_stages": num_stages,
"SPLIT_K": split_k, # Set split_k = 1 for expand calls
"SPLIT_K": 1, # Set split_k = 1 for expand calls
"USE_GDC": use_gdc,
"launch_pdl": use_gdc, # triton kernel metadata
}
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment