- 19 Oct, 2025 16 commits
-
-
Liangsheng Yin authored
-
Liangsheng Yin authored
Revert "Fix: Dynamic RoPE Cache Expansion to Prevent Position-ID Out-of-Bounds in EAGLE + Long-Sequence Workloads" (#11827)
-
fzyzcjy authored
-
fzyzcjy authored
Co-authored-by:Yineng Zhang <me@zhyncs.com>
-
fzyzcjy authored
-
fzyzcjy authored
-
fzyzcjy authored
-
narutolhy authored
-
Lianmin Zheng authored
-
YAMY authored
Fix: Dynamic RoPE Cache Expansion to Prevent Position-ID Out-of-Bounds in EAGLE + Long-Sequence Workloads (#10788)
-
Liangsheng Yin authored
-
tazjin authored
-
Marin authored
-
ybyang authored
-
ybyang authored
-
Simo Lin authored
-
- 18 Oct, 2025 24 commits
-
-
kyleliang-nv authored
-
Qiaolin Yu authored
-
b8zhong authored
-
Kindyaa authored
feat(example/fastapi): support --startup-timeout using Qwen3-Next-80B-A3B-Instruct as example (#11710) Co-authored-by:
chenan01 <chenan01@cheche-MacBook-Pro.local> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
b8zhong authored
-
Liangsheng Yin authored
-
Teng Ma authored
-
fzyzcjy authored
-
Lianmin Zheng authored
-
Yuwei An authored
Signed-off-by:Oasis-Git <ayw.sirius19@gmail.com>
-
Minglei Zhu authored
-
Zilin Zhu authored
-
Zilin Zhu authored
-
Qiaolin Yu authored
-
Jimmy authored
Co-authored-by:Shangming Cai <csmthu@gmail.com>
-
fzyzcjy authored
-
Chang Su authored
[router][grpc] Support parallel queue puts in grpc_request_manager and remove mutex for grpc_client (#11798)
-
fzyzcjy authored
-
Cheng Wan authored
-
Minglei Zhu authored
Co-authored-by:Baizhou Zhang <sobereddiezhang@gmail.com>
-
fzyzcjy authored
-
fzyzcjy authored
-
Lianmin Zheng authored
Co-authored-by:
SangBin Cho <rkooo567@gmail.com> Co-authored-by:
Cheng Wan <cwan@x.ai> Co-authored-by:
Cheng Wan <54331508+ch-wan@users.noreply.github.com>
-
Zilin Zhu authored
-