"benchmarks/vscode:/vscode.git/clone" did not exist on "3f3b6b21500bce2061cae33706bd47c8b6663771"
  1. 21 Jan, 2026 2 commits
    • laibao's avatar
      feat(moe/marlin): Marlin W16A16 MoE 自动探测并预打包(去掉手动开关) · de588fab
      laibao authored
        - 移除 VLLM_USE_MARLIN_W16A16_MOE 环境变量
        - 初始化阶段基于 lightop 探测并缓存 _marlin_w16a16_moe_enabled,满足条件强制 use_nn_moe=False
        - 权重加载后按缓存结果一次性 Marlin pack;运行时按 packed 标记走 Marlin fast path
      de588fab
    • laibao's avatar
      perf(qwen3): 融合 q/k RMSNorm + RoPE · 7cd7bf8a
      laibao authored
      新增 VLLM_USE_FUSED_RMS_ROPE 分支,走 fused 路径
      注册 torch.ops.vllm.rms_rotary_embedding_fuse(direct_register_custom_op)
      cos_sin_cache 自动转 device/dtype 并缓存,避免每次重复拷贝
      7cd7bf8a
  2. 20 Jan, 2026 2 commits
  3. 19 Jan, 2026 2 commits
  4. 17 Jan, 2026 4 commits
  5. 16 Jan, 2026 7 commits
  6. 15 Jan, 2026 3 commits
  7. 14 Jan, 2026 2 commits
  8. 13 Jan, 2026 3 commits
  9. 12 Jan, 2026 5 commits
  10. 09 Jan, 2026 3 commits
  11. 08 Jan, 2026 2 commits
  12. 07 Jan, 2026 3 commits
  13. 06 Jan, 2026 2 commits