- 14 Nov, 2025 9 commits
-
-
Srreyansh Sethi authored
Signed-off-by:
WorldExplored <srreyansh.sethi@gmail.com> Signed-off-by:
Srreyansh Sethi <107075589+WorldExplored@users.noreply.github.com> Signed-off-by:
vnadathur <glvikramn@gmail.com> Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by:
vnadathur <236933696+vnadathur@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
vnadathur <glvikramn@gmail.com> Co-authored-by:
wang.yuqi <noooop@126.com> Co-authored-by:
wang.yuqi <yuqi.wang@daocloud.io>
-
lyn610 authored
Add tracking and periodic logging for the number of preempted requests in the metrics logger. This helps monitor system behavior under load. Signed-off-by:Yining Liu <610lyn@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
Yan Ma authored
Signed-off-by:Yan Ma <yan.ma@intel.com>
-
rasmith authored
[Bugfix][CI/Test][Spec Decode] Fix illegal memory access in offline_inference/spec_decode.py (Issue 27619) (#28432) Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
-
haoyangli-amd authored
Signed-off-by:Haoyang Li <lihaoyang0109@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Hank_ authored
Signed-off-by:
Hank <hcc.mayday@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 13 Nov, 2025 31 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Yanan Cao authored
Signed-off-by:Yanan Cao <gmagogsfm@gmail.com>
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
elvischenv authored
Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Yannick Schnider authored
Signed-off-by:
Yannick Schnider <yannick.schnider1@ibm.com> Signed-off-by:
Yannick Schnider <Yannick.Schnider1@ibm.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Jane (Yuan) Xu authored
Signed-off-by:Jane Xu <janeyx@meta.com>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
bruceszchen <bruceszchen@tencent.com>
-
Yuanping Song authored
Signed-off-by:Yuanping Song <yuanping.song@outlook.com>
-
Huamin Li authored
Signed-off-by:Huamin Li <3ericli@gmail.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
zofia authored
Signed-off-by:Zhu, Zufang <zufang.zhu@intel.com>
-
baonudesifeizhai authored
Signed-off-by:baonudesifeizhai <baonudesifeizhai@gmail.com>
-
Zijing Liu authored
Signed-off-by:Zijing Liu <liuzijing2014@gmail.com>
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
tjandy98 authored
Signed-off-by:tjandy98 <3953059+tjandy98@users.noreply.github.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Lucia Fang authored
Support DeepEP for Kimi-k2-thinking through enabling gemm selection for compressed-tensor marlin wna16 (#28574) Signed-off-by:Lu Fang <fanglu@fb.com>
-
Fanli Lin authored
Signed-off-by:Fanli Lin <fanli.lin@intel.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Andrew Xia authored
[Frontend][responsesAPI][1/n] convert responses API tool input to chat completions tool format (#28231) Signed-off-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com>
-
Andrew Xia authored
Signed-off-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Andrew Xia <axia@fb.com>
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-