- 05 Nov, 2025 22 commits
-
-
Baizhou Zhang authored
-
Kangyan-Zhou authored
-
bigmoyan authored
Signed-off-by:wangzhengtao <wangzhengtao@msh.team>
-
Yuxuan Zhang authored
-
Yuhong Guo authored
-
yinghui authored
-
zejunchen-zejun authored
Signed-off-by:
zejunchen-zejun <zejun.chen@amd.com> Co-authored-by:
HAI <hixiao@gmail.com>
-
yinghui authored
-
Hubert Lu authored
-
Chang Su authored
-
Glen Liu authored
[Feature] add --lora-request-distribution arg to bench_serving.py and support skewed and distinct workloads (#12175)
-
Kangyan-Zhou authored
-
Yingchun Lai authored
-
ai-easy-cpu authored
Co-authored-by:
AI-bot-easy <litchys0123@outlook.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Liangsheng Yin authored
-
Kaixi Hou authored
-
sglang-bot authored
-
Nicolas Castet authored
-
Chang Su authored
-
soaringk authored
-
alisonshao authored
-
Baizhou Zhang authored
-
- 04 Nov, 2025 18 commits
-
-
Chang Su authored
-
Kaixi Hou authored
-
Johnsonms authored
-
Lianmin Zheng authored
Co-authored-by:
github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by:
SangBin Cho <rkooo567@gmail.com>
-
Baizhou Zhang authored
-
Trevor Morris authored
-
Liangsheng Yin authored
-
Liangsheng Yin authored
-
Liangsheng Yin authored
-
Ke Bao authored
-
Yuan Luo authored
Co-authored-by:luoyuan.luo <luoyuan.luo@antgroup.com>
-
Chang Su authored
[router][grpc] Fix model validation, tool call check, streaming logic and misc in responses (#12616)
-
fzyzcjy authored
-
fzyzcjy authored
-
fzyzcjy authored
-
Liangsheng Yin authored
-
Shangming Cai authored
Signed-off-by:Shangming Cai <csmthu@gmail.com>
-
Minglei Zhu authored
-