"docs/vscode:/vscode.git/clone" did not exist on "12aed7e453aea713495bd7cced6f9e2bb78aaa79"
Unverified Commit c870eb9e authored by Xin Yang's avatar Xin Yang Committed by GitHub
Browse files

[LoRA] Update LoRA expand kernel block_n calculation (#32621)


Signed-off-by: default avatarXin Yang <xyangx@amazon.com>
parent 6af03f23
......@@ -251,7 +251,7 @@ def get_lora_op_configs(
else:
default = {
"block_m": 64,
"block_n": max(64, next_power_of_2(128 // num_slices)),
"block_n": 64 if num_slices > 1 else 128,
"block_k": 16,
"num_warps": 4,
"num_ctas": 1,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment