"docs/vscode:/vscode.git/clone" did not exist on "09dc7c690c88ea17a886f6c5e6e8e92a74af4078"
  • zhuwenwen's avatar
    add VLLM_USE_LIGHTOP_MOE_SUM_MUL_ADD · c2e6f453
    zhuwenwen authored
    support prefix cache on kme
    fix the error in test_moe caused by moe align not supporting 511 and 211
    multi-modal switching to torch implementation on z100l&k100
    c2e6f453
__init__.py 107 KB