- 19 Sep, 2025 29 commits
-
-
samzong authored
[Docs] add __init__.py to vllm/model_executor/layers/quantization/compressed_tensors/transform (#24974) Signed-off-by:samzong <samzong.lu@gmail.com>
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
LJH-LBJ authored
Signed-off-by:
Junhong <liujunhong11@huawei.com> Co-authored-by:
Junhong <liujunhong11@huawei.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
samzong authored
Signed-off-by:samzong <samzong.lu@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Icey authored
Signed-off-by:Icey <1790571317@qq.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Yan Ma authored
Signed-off-by:
Yan Ma <yan.ma@intel.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Li, Jiang authored
[Bugfix][CPU] Add placeholder to avoid import errors when using fused_moe ops on platforms without triton (#25137) Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Chendi.Xue authored
Signed-off-by:Chendi Xue <Chendi.Xue@intel.com>
-
Michael Yao authored
Signed-off-by:windsonsea <haifeng.yao@daocloud.io>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
Chen Ding authored
Signed-off-by:Chen Ding <candy.dc@alibaba-inc.com>
-
Andrew Xia authored
[gpt-oss] Add ResponseReasoningPartAddedEvent, ResponseReasoningPartDoneEvent for streaming (#24938) Signed-off-by:Andrew Xia <axia@meta.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Andrew Sansom authored
Signed-off-by:
Andrew Sansom <andrew@protopia.ai> Signed-off-by:
Andrew Sansom <qthequartermasterman@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 18 Sep, 2025 11 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Aziz authored
Signed-off-by:
AzizCode92 <azizbenothman76@gmail.com> Signed-off-by:
Aziz <azizbenothman76@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Nikhil Gupta authored
Signed-off-by:Nikhil Gupta <nikhil.gupta2@arm.com>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk@thinkingmachines.ai> Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Rohan Potdar authored
Signed-off-by:Rohan138 <rohanpotdar138@gmail.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-