- 09 Jun, 2025 1 commit
-
-
Yinghai Lu authored
Signed-off-by:Yinghai Lu <yinghai@thinkingmachines.ai>
-
- 08 Jun, 2025 2 commits
-
-
jennyyyyzhen authored
Signed-off-by:
yZhen <yZhen@fb.com> Co-authored-by:
yZhen <yZhen@fb.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 07 Jun, 2025 2 commits
-
-
Luka Govedič authored
Signed-off-by:luka <luka@neuralmagic.com>
-
Driss Guessous authored
Signed-off-by:drisspg <drisspguessous@gmail.com>
-
- 06 Jun, 2025 12 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Nicolò Lucchesi authored
Signed-off-by:
nicklucche <nlucches@redhat.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
Adolfo Victoria authored
Co-authored-by:Adolfo Victoria <adovi@meta.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Yu Guo authored
-
jmswen authored
Signed-off-by:Jon Swenson <jmswen@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Jinghui Zhang authored
Co-authored-by:jinghui <jinghui@fb.com>
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <benjamin.chislett@centml.ai>
-
- 05 Jun, 2025 3 commits
-
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Robert Shaw authored
Signed-off-by:
rshaw@neuralmagic.com <robertgshaw2@gmail.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 04 Jun, 2025 10 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:nicklucche <nlucches@redhat.com>
-
CYJiang authored
Signed-off-by:googs1025 <googs1025@gmail.com>
-
jmswen authored
Signed-off-by:Jon Swenson <jmswen@gmail.com>
-
Seiji Eicher authored
Signed-off-by:Seiji Eicher <seiji@anyscale.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Kaixi Hou authored
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Li, Jiang authored
-
Yan Ru Pei authored
-
Chen Zhang authored
[Bugfix] Max concurrency estimation and check_enough_kv_cache_memory for models with sliding window layers (#19029) Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 03 Jun, 2025 8 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
Raushan Turganbay authored
Signed-off-by:raushan <raushan@huggingface.co>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Rui Qiao authored
-
Siyuan Liu authored
Signed-off-by:
Siyuan Liu <lsiyuan@google.com> Co-authored-by:
Hossein Sarshar <hossein.sarshar@gmail.com> Co-authored-by:
Chengji Yao <chengjiyao@google.com>
-
- 02 Jun, 2025 1 commit
-
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
- 01 Jun, 2025 1 commit
-
-
zhrrr authored
[Misc] reuse num_tokens_across_dp of get_dp_padding to avoid unnecessary dp all reduce in set_forward_context (#18935) Signed-off-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com>
-