[lora/moe] Avoid extra intermediate buffer & Python slicing in expand phase...
[lora/moe] Avoid extra intermediate buffer & Python slicing in expand phase when split_k == 1 (#32774)
Signed-off-by:
陈建华 <1647430658@qq.com>
Showing
Please register or sign in to comment