- 21 Nov, 2025 4 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Xiao Li authored
[AITER] [ROCm] Fix crash when loading llama4 model with old aiter version installed, fallback to forward_native implementation (#29124) Signed-off-by:Xiao Li <ilx@meta.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Wentao Ye authored
[Feature] Shared Experts Overlap with FI deepgemm swap kernel, 2.2% throughput improvement and 3.6% TTFT improvement (#28879) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 20 Nov, 2025 27 commits
-
-
Driss Guessous authored
Signed-off-by:drisspg <drisspguessous@gmail.com>
-
Software Developer authored
Signed-off-by:dsuhinin <suhinin.dmitriy@gmail.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
rookie authored
Signed-off-by:
zhangguozhu <zhangguozhu@360.cn> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
zhangguozhu <zhangguozhu@360.cn> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Fanli Lin authored
Signed-off-by:Fanli Lin <fanli.lin@intel.com>
-
Zhewen Li authored
Signed-off-by:zhewenli <zhewenli@meta.com>
-
Samit authored
Signed-off-by:
SamitHuang <285365963@qq.com> Signed-off-by:
Samit <285365963@qq.com> Signed-off-by:
samithuang <285365963@qq.com> Co-authored-by:
22quinn <33176974+22quinn@users.noreply.github.com>
-
Shinichi Hemmi authored
Signed-off-by:Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
-
Or Ozeri authored
Signed-off-by:
Or Ozeri <oro@il.ibm.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Anna Shors authored
Signed-off-by:
ashors1 <ashors@nvidia.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
Dezhan authored
Co-authored-by:Dezhan Tu <dztu@meta.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Pleaplusone authored
[ROCm][BugFix] Fix shared expert loading error when disable `VLLM_ROCM_USE_AITER_FUSION_SHARED_EXPERTS` (#28633) Signed-off-by:ganyi <ygan@amd.com>
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Quentin Gallouédec authored
Signed-off-by:
Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Isotr0py authored
Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Canlin Guo authored
Signed-off-by:
gcanlin <canlinguosdu@gmail.com> Signed-off-by:
凌葭 <lvjiang.lj@alibaba-inc.com> Co-authored-by:
凌葭 <lvjiang.lj@alibaba-inc.com>
-
prashanth058 authored
Signed-off-by:prashanth058 <prashanth.dannamaneni@uipath.com>
-
Shengliang Xu authored
Signed-off-by:Shengliang Xu <shengliangx@nvidia.com>
-
Benjamin Chislett authored
Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Qiang Zhang authored
Signed-off-by:chiangzhang <chiangzhang@tencent.com>
-
Kuntai Du authored
[DeepSeek + LMCache Multiprocess] handle MLA for deepseek model + LMCache Multiprocess connector (#29039) Signed-off-by:KuntaiDu <kuntai@uchicago.edu>
-
liangel-02 authored
Signed-off-by:Angel Li <liangel@meta.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
- 19 Nov, 2025 9 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Alexander Matveev authored
Signed-off-by:
Alexander Matveev <amatveev@redhat.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
JartX authored
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Max Hu authored
Signed-off-by:Max Hu <hyoung2991@gmail.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Yongye Zhu authored
Signed-off-by:Yongye Zhu <zyy1102000@gmail.com>
-
Shu Wang authored
Signed-off-by:
Shu Wang. <shuw@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Julien Denize authored
Signed-off-by:Julien Denize <julien.denize@mistral.ai>
-