- 04 Jun, 2025 3 commits
-
-
Li, Jiang authored
-
Yan Ru Pei authored
-
Chen Zhang authored
[Bugfix] Max concurrency estimation and check_enough_kv_cache_memory for models with sliding window layers (#19029) Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 03 Jun, 2025 13 commits
-
-
Chauncey authored
[Bugfix]: Fix the incompatibility issue with tool_choice 'required' when Thinking is enabled (#19075) Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Yikun Jiang <yikun@apache.org>
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
汪志鹏 authored
Signed-off-by:汪志鹏 <wangzhipeng628@gmail.com>
-
Rui Qiao authored
-
Siyuan Liu authored
Signed-off-by:
Siyuan Liu <lsiyuan@google.com> Co-authored-by:
Hossein Sarshar <hossein.sarshar@gmail.com> Co-authored-by:
Chengji Yao <chengjiyao@google.com>
-
- 02 Jun, 2025 1 commit
-
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
- 01 Jun, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 31 May, 2025 4 commits
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
Charlie Fu authored
Signed-off-by:charlifu <charlifu@amd.com>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Pooya Davoodi authored
Signed-off-by:Pooya Davoodi <pooya.davoodi@parasail.io>
-
- 30 May, 2025 6 commits
-
-
Will Eaton authored
Signed-off-by:Will Eaton <weaton@redhat.com>
-
Isotr0py authored
Signed-off-by:
isotr0py <2037008807@qq.com> Signed-off-by:
Isotr0py <2037008807@qq.com>
-
Nick Hill authored
-
Shawn Huang authored
Signed-off-by:
huangyuxiang03 <huangyx0321@gmail.com> Co-authored-by:
huangyuxiang03 <huangyx0321@gmail.com>
-
Carol Zheng authored
Signed-off-by:Carol Zheng <cazheng@google.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 29 May, 2025 7 commits
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Will Eaton <weaton@redhat.com>
-
Nicolò Lucchesi authored
Signed-off-by:nicklucche <nlucches@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
Satyajith Chilappagari authored
Signed-off-by:Satyajith Chilappagari <satchill@amazon.com>
-
Richard Zou authored
Signed-off-by:rzou <zou3519@gmail.com>
-
- 28 May, 2025 5 commits
-
-
Hongxia Yang authored
[Bugfix][ROCm] fix the power of 2 exception from triton_unified_attention.py when running llama4 models and unit test fix (#18100) Signed-off-by:
Hongxia Yang <hongxia.yang@amd.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
Akshat Tripathi authored
Signed-off-by:
Akshat Tripathi <akshat@krai.ai> Signed-off-by:
Chengji Yao <chengjiyao@google.com> Signed-off-by:
xihajun <junfan@krai.ai> Signed-off-by:
Jorge de Freitas <jorge.de-freitas22@imperial.ac.uk> Signed-off-by:
Jorge de Freitas <jorge@krai.ai> Co-authored-by:
Chengji Yao <chengjiyao@google.com> Co-authored-by:
xihajun <junfan@krai.ai> Co-authored-by:
Jorge de Freitas <jorge.de-freitas22@imperial.ac.uk> Co-authored-by:
Jorge de Freitas <jorge@krai.ai>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Alex Brooks authored
Signed-off-by:Alex-Brooks <Alex.Brooks@ibm.com>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-