perf(fused-moe): 接入 W16A16 Marlin MoE 并缓存 pack 权重 See merge request dcutoolkit/deeplearing/vllm!347
Attach a file by drag & drop or click to upload