- 20 Jan, 2026 5 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 19 Jan, 2026 6 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Tomas Ruiz authored
Signed-off-by:Tomas Ruiz <tomas.ruiz.te@gmail.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Vadim Gimpelson authored
[BUGFIX] Fix degenerate strides in TRTLLM query tensors for FlashInfer backend. Fixes issue #32353 (#32417) Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
- 18 Jan, 2026 5 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Li Xie authored
Signed-off-by:xieli <xieli@stepfun.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 17 Jan, 2026 1 commit
-
-
Guofang.Tang authored
Signed-off-by:
Guofang Tang <tinggofun@gmail.com> Co-authored-by:
Guofang Tang <tinggofun@gmail.com>
-
- 16 Jan, 2026 2 commits
-
-
Chenyaaang authored
Signed-off-by:Chenyaaang <chenyangli@google.com>
-
Hongxin Xu authored
Signed-off-by:
xhx1022 <1737006628@qq.com> Co-authored-by:
arlenxu <arlenxu@tencent.com>
-
- 15 Jan, 2026 9 commits
-
-
Matthias Gehre authored
Signed-off-by:Matthias Gehre <matthias.gehre@amd.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Pleaplusone authored
[ROCm][Perf] Enable shuffle kv cache layout and assembly paged attention kernel for `AiterFlashAttentionBackend` (#29887) Signed-off-by:ganyi <ygan@amd.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
dtc authored
Signed-off-by:Tianchen Ding <dtcccc@linux.alibaba.com>
-
Ofir Zafrir authored
Signed-off-by:Ofir Zafrir <ofir.zafrir@intel.com>
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
- 14 Jan, 2026 7 commits
-
-
Lumosis authored
Signed-off-by:Lihao Ran <imlihao.ran@gmail.com>
-
vllmellm authored
[Bugfix][ROCm][performance] Resolve the performance regression issue of the Qwen3-Next-80B-A3B-Thinking under rocm_atten (#32336) Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Angela Yi authored
Signed-off-by:angelayi <yiangela7@gmail.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 13 Jan, 2026 5 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Mickaël Seznec authored
Signed-off-by:
Mickael Seznec <mickael@mistral.ai> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-