"vllm/vscode:/vscode.git/clone" did not exist on "7ed6a4f0e1b39499675edf1dd6079d4bf21eb0fe"
  1. 07 Apr, 2026 1 commit
  2. 02 Apr, 2026 1 commit
  3. 26 Mar, 2026 1 commit
    • laibao's avatar
      feat(v1 attention): 为 ROCm FlashAttention 接入 unified kv layout,并打通... · ea9b8584
      laibao authored
      feat(v1 attention): 为 ROCm FlashAttention 接入 unified kv layout,并打通 mm_prefix、qq_bias 与 use_alibi_sqrt 透传
      在 ROCm FlashAttention 后端增加 unified KV layout 选择逻辑
      接入 unified varlen kernel 调用路径
      在 FlashAttention metadata 中补充 mm_prefix_range 与 qq_bias 透传
      ea9b8584
  4. 24 Mar, 2026 1 commit
  5. 23 Mar, 2026 1 commit
  6. 16 Mar, 2026 2 commits
  7. 12 Mar, 2026 3 commits
  8. 04 Mar, 2026 2 commits
  9. 03 Mar, 2026 1 commit
  10. 02 Mar, 2026 1 commit
  11. 26 Feb, 2026 1 commit
  12. 24 Feb, 2026 1 commit
    • laibao's avatar
      • perf(v1): 增加可选的快速 token-id 拷贝路径 · d3a95d54
      laibao authored
        - 新增环境变量 `VLLM_V1_FAST_TOKEN_ID_COPY`(默认关闭)
        - 在 `CachedRequestState` 中缓存 int32 的 prompt token ids(numpy 数组)
        - 开启后在 `InputBatch` 中使用 `np.copyto` 拷贝 prompt/output token ids
      d3a95d54
  13. 08 Feb, 2026 1 commit
  14. 06 Feb, 2026 1 commit
  15. 05 Feb, 2026 1 commit
  16. 02 Feb, 2026 1 commit
  17. 27 Jan, 2026 1 commit
  18. 26 Jan, 2026 3 commits
  19. 25 Jan, 2026 1 commit
  20. 24 Jan, 2026 4 commits
  21. 23 Jan, 2026 4 commits
  22. 22 Jan, 2026 3 commits
  23. 21 Jan, 2026 3 commits
  24. 20 Jan, 2026 1 commit