增加环境变量开关,禁用 Marlin W16A16 MoE 路径 强制 Triton 且权重已是 Marlin packed 时给出明确报错 Marlin 支持探测改为 best-effort(不再依赖 VLLM_USE_LIGHTOP)
Attach a file by drag & drop or click to upload