- 19 Sep, 2025 12 commits
-
-
samzong authored
[Docs] add __init__.py to vllm/model_executor/layers/quantization/compressed_tensors/transform (#24974) Signed-off-by:samzong <samzong.lu@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
LJH-LBJ authored
Signed-off-by:
Junhong <liujunhong11@huawei.com> Co-authored-by:
Junhong <liujunhong11@huawei.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Icey authored
Signed-off-by:Icey <1790571317@qq.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Li, Jiang authored
[Bugfix][CPU] Add placeholder to avoid import errors when using fused_moe ops on platforms without triton (#25137) Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Chendi.Xue authored
Signed-off-by:Chendi Xue <Chendi.Xue@intel.com>
-
Chen Ding authored
Signed-off-by:Chen Ding <candy.dc@alibaba-inc.com>
-
- 18 Sep, 2025 18 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Aziz authored
Signed-off-by:
AzizCode92 <azizbenothman76@gmail.com> Signed-off-by:
Aziz <azizbenothman76@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Nikhil Gupta authored
Signed-off-by:Nikhil Gupta <nikhil.gupta2@arm.com>
-
qizixi authored
Signed-off-by:
zixi-qi <qizixi@meta.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <noooop@126.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
Michael Goin authored
-
Asaf Joseph Gardin authored
Signed-off-by:asafg <39553475+Josephasafg@users.noreply.github.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Aaron Pham authored
Signed-off-by:
Aaron Pham <contact@aarnphm.xyz> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Punitvara authored
Signed-off-by:
Punit Vara <punitvara@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Tao He authored
Signed-off-by:Tao He <linzhu.ht@alibaba-inc.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
YiwenC authored
-
YiwenC authored
Signed-off-by:
Yiwen Chen <yiwen66@berkeley.edu> Signed-off-by:
YiwenC <54658925+666even666@users.noreply.github.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
toncao authored
[Bugfix][Qwen3-Next] add prefixes to shared_expert in qwen3-next and mlp in qwen2moe to successfully load ignored params in quantized models (#24960) Signed-off-by:
toncao <cpatonn@gmail.com> Co-authored-by:
toncao <cpatonn@gmail.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.io> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Huang Jie <92386084+JJJYmmm@users.noreply.github.com> Co-authored-by:
松灵 <26085463+wulipc@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 17 Sep, 2025 9 commits
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Tao He authored
Signed-off-by:Tao He <linzhu.ht@alibaba-inc.com>
-
danielafrimi authored
Signed-off-by:
Daniel Afrimi <danielafrimi8@gmail.com> Co-authored-by:
root <root@cw-dfw-h100-001-305-026.cm.cluster>
-
whx authored
Signed-off-by:whx-sjtu <2952154980@qq.com>
-
whx authored
Signed-off-by:whx-sjtu <2952154980@qq.com>
-
rouchenzi authored
Signed-off-by:
rouchenzi <ruochenwen@gmail.com> Signed-off-by:
rouchenzi <40842833+rouchenzi@users.noreply.github.com> Co-authored-by:
Bowen Wang <abmfy@icloud.com>
-
haoyangli-amd authored
Signed-off-by:
Haoyang Li <lihaoyang0109@gmail.com> Co-authored-by:
Haoyang Li <haoyang.li@amd.com>
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.io> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Huang Jie <92386084+JJJYmmm@users.noreply.github.com> Co-authored-by:
松灵 <26085463+wulipc@users.noreply.github.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Tahsin Tunan authored
Signed-off-by:
Tahsin Tunan <tahsintunan@gmail.com> Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Luka Govedič <lgovedic@redhat.com>
-
- 16 Sep, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-