Merge branch 'v0.11.0-dev_marlin_opt' into 'v0.11.0-dev'
feat(moe/marlin): 移除 VLLM_USE_MARLIN_W16A16_MOE,改为基于 lightop 探测自动启用并一次性缓存决策 See merge request dcutoolkit/deeplearing/vllm!376
Showing
Please register or sign in to comment
feat(moe/marlin): 移除 VLLM_USE_MARLIN_W16A16_MOE,改为基于 lightop 探测自动启用并一次性缓存决策 See merge request dcutoolkit/deeplearing/vllm!376