- 11 Feb, 2026 1 commit
-
-
bnellnm authored
[MoE Refactor] Introduce MoERunner abstraction and move execution logic from FusedMoE to DefaultMoERunner (#32344) Signed-off-by:Bill Nell <bnell@redhat.com>
-
- 10 Feb, 2026 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 09 Feb, 2026 4 commits
-
-
Michael Goin authored
-
JJJYmmm authored
Signed-off-by:
JJJYmmm <1650675829@qq.com> Signed-off-by:
JJJYmmm <92386084+JJJYmmm@users.noreply.github.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
wulipc <wulipc@users.noreply.github.com> Co-authored-by:
ywang96 <ywang96@users.noreply.github.com> Co-authored-by:
Isotr0py <Isotr0py@users.noreply.github.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Ekagra Ranjan authored
Signed-off-by:
Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com> Co-authored-by:
Nicolò Lucchesi <nicolo.lucchesi@gmail.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
- 08 Feb, 2026 1 commit
-
-
danisereb authored
Signed-off-by:Daniel Serebrenik <daserebrenik@nvidia.com>
-
- 07 Feb, 2026 2 commits
-
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
果冻虾仁 authored
-
- 06 Feb, 2026 6 commits
-
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Michael Goin authored
It seems users can be confused about vLLM's performance when running with very small amounts of CPU cores available. We are missing a clear overview of what vLLM's process architecture is, so I added this along with some diagrams in arch_overview.md, and included a section on CPU resource recommendations in optimization.md Signed-off-by:mgoin <mgoin64@gmail.com>
-
Raushan Turganbay authored
Signed-off-by:
raushan <raushan@huggingface.co> Signed-off-by:
Raushan Turganbay <raushan.turganbay@alumni.nu.edu.kz> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
SorenDreano authored
Co-authored-by:
Soren Dreano <soren@numind.ai> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
chengchengpei authored
Signed-off-by:
Chengcheng Pei <chengchengpei@outlook.com> Signed-off-by:
chengchengpei <5881383+chengchengpei@users.noreply.github.com> Co-authored-by:
chengchengpei <5881383+chengchengpei@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 05 Feb, 2026 4 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
liranschour authored
Signed-off-by:
Liran Schour <lirans@il.ibm.com> Signed-off-by:
liranschour <liranschour@users.noreply.github.com> Co-authored-by:
Or Ozeri <or@ozery.com> Co-authored-by:
Nicolò Lucchesi <nicolo.lucchesi@gmail.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
rinbaro authored
Signed-off-by:rinbaro <ilgomishra@gmail.com>
-
Ilya Boytsov authored
Signed-off-by:
Ilya Boytsov <ilyaboytsov1805@gmail.com> Signed-off-by:
Ilya Boytsov <boytsovpanamera@mail.ru> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
wang.yuqi <yuqi.wang@daocloud.io>
-
- 04 Feb, 2026 3 commits
-
-
Muhammad Hashmi authored
Signed-off-by:
Muhammad Hashmi <mhashmi@berkeley.edu> Signed-off-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
NickLucche <nlucches@redhat.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Frank Wang authored
Signed-off-by:frankwang28 <frank.wbb@hotmail.com>
-
- 03 Feb, 2026 4 commits
-
-
dtc authored
Signed-off-by:
Tianchen Ding <dtcccc@linux.alibaba.com> Co-authored-by:
Nicolò Lucchesi <nicolo.lucchesi@gmail.com>
-
Krish Gupta authored
Signed-off-by:KrxGu <krishom70@gmail.com>
-
zxy authored
Signed-off-by:
zxy <zhou0493@e.ntu.edu.sg> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
- 02 Feb, 2026 5 commits
-
-
Komal Kumar Teru authored
Signed-off-by:
kkt-cohere <komal@cohere.com> Signed-off-by:
Komal Kumar Teru <162363718+kkt-cohere@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
RED authored
Signed-off-by:
liuli <ll407707@alibaba-inc.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
liuli <ll407707@alibaba-inc.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Sawyer Bowerman authored
Signed-off-by:
Sawyer Bowerman <sbowerma@redhat.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Paco Xu authored
Signed-off-by:Paco Xu <paco.xu@daocloud.io>
-
csy0225 authored
Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
i-zhangmingming <i-zhangmingming@stepfun.com> Co-authored-by:
xiewuxun <xiewuxun@stepfun.com> Co-authored-by:
zetaohong <i-hongzetao@stepfun.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
- 01 Feb, 2026 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 31 Jan, 2026 6 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
jma99_2333 authored
Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
jennyyyyzhen authored
Signed-off-by:jennyyyyzhen <yzhen@hmc.edu>
-
Patrick von Platen authored
Signed-off-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 30 Jan, 2026 2 commits
-
-
Nathan Weinberg authored
Signed-off-by:Nathan Weinberg <nweinber@redhat.com>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-