- 22 Jan, 2026 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 21 Jan, 2026 10 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
elvischenv authored
Signed-off-by:elvischenv <219235043+elvischenv@users.noreply.github.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Robert Shaw authored
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
shanjiaz authored
Signed-off-by:
shanjiaz <zsjwpianpian@gmail.com> Signed-off-by:
shanjiaz <43143795+shanjiaz@users.noreply.github.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 20 Jan, 2026 10 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Shinichi Hemmi authored
Signed-off-by:
Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com> Co-authored-by:
Kenichi Maehashi <maehashi@preferred.jp>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
杨朱 · Kiki authored
This PR completes the removal of the deprecated vllm:time_per_output_token_seconds metric that was deprecated in v0.11, hidden in v0.12, scheduled for removal in v0.13, but delayed until v0.15. Signed-off-by:
carlory <baofa.fan@daocloud.io> Co-authored-by:
Claude Haiku 4.5 <noreply@anthropic.com>
-
Walter Beller-Morales authored
Signed-off-by:walterbm <walter.beller.morales@gmail.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 19 Jan, 2026 6 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Tomas Ruiz authored
Signed-off-by:Tomas Ruiz <tomas.ruiz.te@gmail.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Vadim Gimpelson authored
[BUGFIX] Fix degenerate strides in TRTLLM query tensors for FlashInfer backend. Fixes issue #32353 (#32417) Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
- 18 Jan, 2026 5 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Li Xie authored
Signed-off-by:xieli <xieli@stepfun.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 17 Jan, 2026 1 commit
-
-
Guofang.Tang authored
Signed-off-by:
Guofang Tang <tinggofun@gmail.com> Co-authored-by:
Guofang Tang <tinggofun@gmail.com>
-
- 16 Jan, 2026 2 commits
-
-
Chenyaaang authored
Signed-off-by:Chenyaaang <chenyangli@google.com>
-
Hongxin Xu authored
Signed-off-by:
xhx1022 <1737006628@qq.com> Co-authored-by:
arlenxu <arlenxu@tencent.com>
-
- 15 Jan, 2026 5 commits
-
-
Matthias Gehre authored
Signed-off-by:Matthias Gehre <matthias.gehre@amd.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Pleaplusone authored
[ROCm][Perf] Enable shuffle kv cache layout and assembly paged attention kernel for `AiterFlashAttentionBackend` (#29887) Signed-off-by:ganyi <ygan@amd.com>
-