- 22 Dec, 2025 3 commits
-
-
Kevin McKay authored
Signed-off-by:c0de128 <kevin.mckay@outlook.com>
-
CedricHuang authored
[Feature]: Support NVIDIA ModelOpt HF FP8 variants FP8_PER_CHANNEL_PER_TOKEN and FP8_PB_WO in vLLM (#30957)
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 21 Dec, 2025 2 commits
-
-
Robert Shaw authored
-
汪志鹏 authored
Signed-off-by:
princepride <wangzhipeng628@gmail.com> Signed-off-by:
汪志鹏 <wangzhipeng628@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk> Co-authored-by:
bbrowning <bbrownin@redhat.com>
-
- 20 Dec, 2025 3 commits
-
-
baonudesifeizhai authored
Signed-off-by:
baonudesifeizhai <85092850+baonudesifeizhai@users.noreply.github.com> Signed-off-by:
Dongjie Zou <85092850+baonudesifeizhai@users.noreply.github.com> Signed-off-by:
baonudesifeizhai <baonudesifeizhai@gmail.com> Signed-off-by:
Robert Shaw <robertgshaw2@gmail.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robertgshaw2@gmail.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
zejunchen-zejun authored
[Bugfix] fix the alias bug of AttentionBackendEnum when register CUSTOM attention backend to vllm (#30869) Signed-off-by:zejunchen-zejun <zejun.chen@amd.com>
-
- 19 Dec, 2025 10 commits
-
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Seiji Eicher authored
Signed-off-by:Seiji Eicher <seiji@anyscale.com>
-
Zhonghua Deng authored
Signed-off-by:
Abatom <abzhonghua@gmail.com> Signed-off-by:
Jumiar <liuanqim10@126.com> Signed-off-by:
Zyann7 <zyann7@outlook.com> Co-authored-by:
Jumiar <liuanqim10@126.com> Co-authored-by:
Zyann7 <zyann7@outlook.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
Marko Rosenmueller authored
Signed-off-by:
Marko Rosenmueller <5467316+dr75@users.noreply.github.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com>
-
lif authored
Signed-off-by:
majiayu000 <1835304752@qq.com> Co-authored-by:
Claude Opus 4.5 <noreply@anthropic.com>
-
Wenqi Glantz authored
Signed-off-by:Wenqi Glantz <wglantz@nvidia.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
PlatinumGod authored
Signed-off-by:
yujiepu <pyjapple@gmail.com> Signed-off-by:
PlatinumGod <pyjapple@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 18 Dec, 2025 12 commits
-
-
Elizabeth Thomas authored
Signed-off-by:
Elizabeth Thomas <email2eliza@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Isotr0py authored
[MM Encoder]: Migrate legacy ViT `MultiHeadAttention` to new `MMEncoderAttention` interface (#30684) Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
inkcherry authored
Signed-off-by:inkcherry <mingzhi.liu@amd.com>
-
sarathc-cerebras authored
Signed-off-by:
sarathc-cerebras <sarath.chandran@cerebras.net> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Yifan Qiao authored
Signed-off-by:Yifan Qiao <yifanqiao@berkeley.edu>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Zhengxu Chen authored
Signed-off-by:zhxchen17 <zhxchen17@fb.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Micah Williamson authored
Signed-off-by:Micah Williamson <micah.williamson@amd.com>
-
SungMinCho authored
Signed-off-by:
SungMinCho <tjdals4565@gmail.com> Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com>
-
Isotr0py authored
-
- 17 Dec, 2025 9 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Ye (Charlotte) Qi authored
Signed-off-by:Ye (Charlotte) Qi <yeq@meta.com>
-
Xinyu Chen authored
Signed-off-by:Xinyu Chen <xinyu1.chen@intel.com>
-
Robin authored
[Bugfix][Frontend] Prevent IndexError in MiniMax M2 tool parser during streaming extraction (#30555) Signed-off-by:WangErXiao <863579016@qq.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 16 Dec, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-