"vllm/vscode:/vscode.git/clone" did not exist on "350c94deb30747f84536ee34d91c6fca564667ce"
  • zhuwenwen's avatar
    support prefix cache on kme · 6f1db287
    zhuwenwen authored
    fix the error in test_moe caused by moe align not supporting 511
    multi-modal switching to torch implementation on z100l&k100
    6f1db287
test_moe.py 27.2 KB