support prefix cache on kme
fix the error in test_moe caused by moe align not supporting 511 multi-modal switching to torch implementation on z100l&k100
Showing
Please register or sign in to comment
fix the error in test_moe caused by moe align not supporting 511 multi-modal switching to torch implementation on z100l&k100