add VLLM_USE_LIGHTOP_MOE_SUM_MUL_ADD
support prefix cache on kme fix the error in test_moe caused by moe align not supporting 511 and 211 multi-modal switching to torch implementation on z100l&k100
Showing
Please register or sign in to comment