Merge branch 'v0.9.2-dev-tc_opt' into 'v0.9.2-dev'
fix: 修复 expanded sampling metadata 对 numpy/array-like 输入不兼容导致崩溃 perf(fused-moe): 预打包 Marlin W16A16 MoE 权重,降低 warmup 显存峰值 See merge request dcutoolkit/deeplearing/vllm!357
Showing
Please register or sign in to comment