"ssh:/git@developer.sourcefind.cn:2222/myrfy001/vllm_dsv4.git" did not exist on "c908a07f57c07562c6a208166f247e9c71bea8d1"
Flashinfer_CUTLASS_MOE fuses quantization for TP (#27223)
Signed-off-by:
Shu Wang. <shuw@nvidia.com>
Showing
Please register or sign in to comment