"vllm/v1/attention/ops/flashmla.py" did not exist on "a3416fe1dc6f5580aea842be2a49b783e80d2fb4"
[lora/moe] Avoid extra intermediate buffer & Python slicing in expand phase...
[lora/moe] Avoid extra intermediate buffer & Python slicing in expand phase when split_k == 1 (#32774)
Signed-off-by:
陈建华 <1647430658@qq.com>
Showing
Please register or sign in to comment