- 19 Sep, 2025 37 commits
-
-
nvjullin authored
Signed-off-by:
Julien Lin <jullin@nvidia.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
qizixi authored
Signed-off-by:zixi-qi <qizixi@meta.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Lucia Fang authored
Signed-off-by:Lu Fang <fanglu@fb.com>
-
samzong authored
[Docs] add __init__.py to vllm/model_executor/layers/quantization/compressed_tensors/transform (#24974) Signed-off-by:samzong <samzong.lu@gmail.com>
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
LJH-LBJ authored
Signed-off-by:
Junhong <liujunhong11@huawei.com> Co-authored-by:
Junhong <liujunhong11@huawei.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
samzong authored
Signed-off-by:samzong <samzong.lu@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Icey authored
Signed-off-by:Icey <1790571317@qq.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Yan Ma authored
Signed-off-by:
Yan Ma <yan.ma@intel.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Li, Jiang authored
[Bugfix][CPU] Add placeholder to avoid import errors when using fused_moe ops on platforms without triton (#25137) Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Chendi.Xue authored
Signed-off-by:Chendi Xue <Chendi.Xue@intel.com>
-
Michael Yao authored
Signed-off-by:windsonsea <haifeng.yao@daocloud.io>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
Chen Ding authored
Signed-off-by:Chen Ding <candy.dc@alibaba-inc.com>
-
Andrew Xia authored
[gpt-oss] Add ResponseReasoningPartAddedEvent, ResponseReasoningPartDoneEvent for streaming (#24938) Signed-off-by:Andrew Xia <axia@meta.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Andrew Sansom authored
Signed-off-by:
Andrew Sansom <andrew@protopia.ai> Signed-off-by:
Andrew Sansom <qthequartermasterman@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 18 Sep, 2025 3 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-