- 04 Jun, 2025 3 commits
-
-
Li, Jiang authored
-
Yan Ru Pei authored
-
Chen Zhang authored
[Bugfix] Max concurrency estimation and check_enough_kv_cache_memory for models with sliding window layers (#19029) Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 03 Jun, 2025 22 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:
nicklucche <nlucches@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Chauncey authored
[Bugfix]: Fix the incompatibility issue with tool_choice 'required' when Thinking is enabled (#19075) Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Ekagra Ranjan authored
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Yikun Jiang <yikun@apache.org>
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
CYJiang authored
Signed-off-by:googs1025 <googs1025@gmail.com>
-
Nicolò Lucchesi authored
Signed-off-by:nicklucche <nlucches@redhat.com>
-
Raushan Turganbay authored
Signed-off-by:raushan <raushan@huggingface.co>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
汪志鹏 authored
Signed-off-by:汪志鹏 <wangzhipeng628@gmail.com>
-
Rui Qiao authored
-
Li, Jiang authored
Signed-off-by:jiang.li <jiang1.li@intel.com>
-
Tyler Michael Smith authored
Signed-off-by:
Tyler Michael Smith <tysmith@redhat.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Siyuan Liu authored
Signed-off-by:
Siyuan Liu <lsiyuan@google.com> Co-authored-by:
Hossein Sarshar <hossein.sarshar@gmail.com> Co-authored-by:
Chengji Yao <chengjiyao@google.com>
-
- 02 Jun, 2025 5 commits
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Frαnçois authored
Signed-off-by:
François Paupier <francois.paupier@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
jennyyyyzhen authored
Signed-off-by:
yzhen <yzhen@devgpu093.cco2.facebook.com> Co-authored-by:
yZhen <yZhen@fb.com> Co-authored-by:
yzhen <yzhen@devgpu093.cco2.facebook.com>
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
Robert Shaw authored
Signed-off-by:rshaw@neuralmagic.com <robertgshaw2@gmail.com>
-
- 01 Jun, 2025 7 commits
-
-
zhrrr authored
[Misc] reuse num_tokens_across_dp of get_dp_padding to avoid unnecessary dp all reduce in set_forward_context (#18935) Signed-off-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
rongfu.leng authored
Signed-off-by:rongfu.leng <rongfu.leng@daocloud.io>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Isotr0py authored
[LoRA] Support dynamically initialize `packed_modules_mapping` for VLM with arbitrary components (#18987) Signed-off-by:
isotr0py <2037008807@qq.com> Signed-off-by:
Isotr0py <2037008807@qq.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <benjamin.chislett@centml.ai>
-
- 31 May, 2025 3 commits
-
-
Ekagra Ranjan authored
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Yizhou Liu <liu_yizhou@outlook.com>
-