- 21 Nov, 2025 9 commits
-
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Huamin Li authored
Signed-off-by:
Huamin Li <3ericli@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Hongxia Yang authored
Signed-off-by:Hongxia Yang <hongxia.yang@amd.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Wentao Ye authored
[Feature] Shared Experts Overlap with FI deepgemm swap kernel, 2.2% throughput improvement and 3.6% TTFT improvement (#28879) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 20 Nov, 2025 12 commits
-
-
Fanli Lin authored
Signed-off-by:Fanli Lin <fanli.lin@intel.com>
-
Zhewen Li authored
Signed-off-by:zhewenli <zhewenli@meta.com>
-
Shinichi Hemmi authored
Signed-off-by:Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Anna Shors authored
Signed-off-by:
ashors1 <ashors@nvidia.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
Dezhan authored
Co-authored-by:Dezhan Tu <dztu@meta.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Pleaplusone authored
[ROCm][BugFix] Fix shared expert loading error when disable `VLLM_ROCM_USE_AITER_FUSION_SHARED_EXPERTS` (#28633) Signed-off-by:ganyi <ygan@amd.com>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Isotr0py authored
Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Shengliang Xu authored
Signed-off-by:Shengliang Xu <shengliangx@nvidia.com>
-
liangel-02 authored
Signed-off-by:Angel Li <liangel@meta.com>
-
- 19 Nov, 2025 19 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
JartX authored
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Max Hu authored
Signed-off-by:Max Hu <hyoung2991@gmail.com>
-
Yongye Zhu authored
Signed-off-by:Yongye Zhu <zyy1102000@gmail.com>
-
Shu Wang authored
Signed-off-by:
Shu Wang. <shuw@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by:
FENP <yuanyongjie.yyj@antgroup.com> Signed-off-by:
LookAround <lixushi@huawei.com> Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
FENP <yuanyongjie.yyj@antgroup.com> Co-authored-by:
LookAround <lixushi@huawei.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com> Co-authored-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com>
-
Izzy Putterman authored
Signed-off-by:
Izzy Putterman <iputterman@nvidia.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
杰兮 authored
Signed-off-by:
zhyajie <yajizhan@amd.com> Co-authored-by:
zhyajie <yajizhan@amd.com>
-
Robert Shaw authored
Signed-off-by:Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Yuxuan Zhang authored
Signed-off-by:zRzRzRzRzRzRzR <2448370773@qq.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Shanshan Shen authored
[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487) Signed-off-by:shen-shanshan <467638484@qq.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Chen Bruce authored
Signed-off-by:
bruceszchen <bruceszchen@tencent.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Gleb Kurchanov authored
Signed-off-by:Gleb Kurchanov <nepherpitou@gmail.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-