- 11 Nov, 2025 19 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Fanli Lin authored
Signed-off-by:Lin, Fanli <fanli.lin@intel.com>
-
bnellnm authored
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Chaojun Zhang authored
Signed-off-by:chaojun-zhang <chaojun.zhang@intel.com>
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Fanli Lin authored
Signed-off-by:Lin, Fanli <fanli.lin@intel.com>
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
David Ben-David authored
Signed-off-by:
David Ben-David <davidb@pliops.com> Co-authored-by:
David Ben-David <davidb@pliops.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com>
-
Robert Shaw authored
Signed-off-by:
Vadim Gimpelson <vadim.gimpelson@gmail.com> Signed-off-by:
Robert Shaw <robertgshaw2@gmail.com> Co-authored-by:
Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
Zuyi Zhao authored
[Frontend] Add sagemaker_standards dynamic lora adapter and stateful session management decorators to vLLM OpenAI API server (#27892) Signed-off-by:
Zuyi Zhao <zhaozuy@amazon.com> Signed-off-by:
Shen Teng <sheteng@amazon.com> Co-authored-by:
Shen Teng <sheteng@amazon.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Zhuohan Li authored
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 10 Nov, 2025 21 commits
-
-
Adrian Abeyta authored
Signed-off-by:adabeyta <aabeyta@redhat.com>
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Ilya Markov authored
Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Signed-off-by:
ilmarkov <markovilya197@gmail.com> Co-authored-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Robert Shaw authored
Signed-off-by:Robert Shaw <robertgshaw2@gmail.com>
-
Andrew Xia authored
Signed-off-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Andrew Xia <axia@fb.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Wei Wei authored
Signed-off-by:Wei Wei <wwei6@meta.com>
-
Sage Moore authored
Signed-off-by:
Sage Moore <sage@neuralmagic.com> Signed-off-by:
Sage Moore <sagemoore@utexas.edu> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Rémi Delacourt authored
Signed-off-by:
Rémi Delacourt <remi@mistral.ai> Signed-off-by:
remi <remi@mistral.ai> Co-authored-by:
Russell Bryant <rbryant@redhat.com>
-
jiahanc authored
Signed-off-by:jiahanc <173873397+jiahanc@users.noreply.github.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
vllmellm authored
[RFC][ROCm][AITER] Keep all AITER kernels in `_aiter_ops` class like `_custom_ops` and `_ipex_ops` (#24490) Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
caozuoba authored
Co-authored-by:Jee Jee Li <pandaleefree@gmail.com>
-
zejunchen-zejun authored
[Rocm][fused_moe][fp4] view weight to torch.float4_e2m1fn_x2 when running aiter fused moe for fp4 model (#27474) Signed-off-by:zejunchen-zejun <zejun.chen@amd.com>
-
Ferrebo authored
Signed-off-by:
Ferrebo <itachi971009@gmail.com> Signed-off-by:
kebo01 <kebo01@baidu.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Yu Jiaqi authored
Signed-off-by:piood <2477084691@qq.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Xiake Sun authored
[Hardware][AMD][Model] Add Triton MoE tuning support and optimized configs for Qwen3 omni for MI308X (#28373) Signed-off-by:
Xiake Sun <xiake.sun@amd.com> Signed-off-by:
Xiake Sun <xisun@amd.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-