- 21 Sep, 2025 2 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk@thinkingmachines.ai> Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 20 Sep, 2025 7 commits
-
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Manoel Marques authored
Signed-off-by:
Manoel Marques <manoel.marques@ibm.com> Signed-off-by:
Manoel Marques <manoelmrqs@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
JartX authored
[BUGFIX] GPTQ quantization compatibility for Qwen3 Next MOE models (AutoGPTQ and AutoRound-GPTQ) (#25268) Signed-off-by:JartX <sagformas@epdcenter.es>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 19 Sep, 2025 17 commits
-
-
Zhiyu authored
Signed-off-by:
Zhiyu Cheng <zhiyuc@nvidia.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
David-Wen authored
Signed-off-by:
David-Wen <18927700430@163.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
samzong authored
[Docs] add __init__.py to vllm/model_executor/layers/quantization/compressed_tensors/transform (#24974) Signed-off-by:samzong <samzong.lu@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
LJH-LBJ authored
Signed-off-by:
Junhong <liujunhong11@huawei.com> Co-authored-by:
Junhong <liujunhong11@huawei.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Icey authored
Signed-off-by:Icey <1790571317@qq.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Li, Jiang authored
[Bugfix][CPU] Add placeholder to avoid import errors when using fused_moe ops on platforms without triton (#25137) Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Chendi.Xue authored
Signed-off-by:Chendi Xue <Chendi.Xue@intel.com>
-
Chen Ding authored
Signed-off-by:Chen Ding <candy.dc@alibaba-inc.com>
-
- 18 Sep, 2025 14 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Aziz authored
Signed-off-by:
AzizCode92 <azizbenothman76@gmail.com> Signed-off-by:
Aziz <azizbenothman76@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Nikhil Gupta authored
Signed-off-by:Nikhil Gupta <nikhil.gupta2@arm.com>
-
qizixi authored
Signed-off-by:
zixi-qi <qizixi@meta.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <noooop@126.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
Michael Goin authored
-
Asaf Joseph Gardin authored
Signed-off-by:asafg <39553475+Josephasafg@users.noreply.github.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Aaron Pham authored
Signed-off-by:
Aaron Pham <contact@aarnphm.xyz> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Punitvara authored
Signed-off-by:
Punit Vara <punitvara@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Tao He authored
Signed-off-by:Tao He <linzhu.ht@alibaba-inc.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
YiwenC authored
-