- 23 Apr, 2026 1 commit
-
-
yaoht authored
-
- 21 Apr, 2026 1 commit
-
-
yaoht authored
-
- 16 Mar, 2026 5 commits
-
-
thatPepe authored
【比赛2025秋】T2-1-4 qwen3vl
-
PanZezhong authored
-
hejianlin authored
Co-authored-by:PanZezhong <panzezhong@qiyuanlab.com>
-
thatPepe authored
issue/265 perf(llm): replace O(n²) full-sequence detokenize with incr…
-
MaYuhang authored
-
- 11 Mar, 2026 4 commits
- 10 Mar, 2026 1 commit
-
-
thatPepe authored
issue/244 feat(llm): add prefix cache reuse for static KV cache
-
- 09 Mar, 2026 5 commits
- 06 Mar, 2026 5 commits
-
-
thatPepe authored
issue/233 fix: improve request lifecycle management and timeout handling
-
thatPepe authored
Issue/248 support flash-attention
-
wooway777 authored
-
wooway777 authored
-
PanZezhong authored
-
- 05 Mar, 2026 6 commits
-
-
wooway777 authored
-
PanZezhong authored
-
PanZezhong authored
-
PanZezhong authored
-
PanZezhong authored
-
PanZezhong1725 authored
issue/246 change default kvcache blocksize to 256
-
- 04 Mar, 2026 1 commit
-
-
thatPepe authored
issue/251 - change nt config to op config for kv caching
-
- 03 Mar, 2026 1 commit
-
-
wooway777 authored
-
- 02 Mar, 2026 2 commits
-
-
qinyiqun authored
-
gongchensu authored
issue/241 fix mmlu test, add vllm support
-
- 27 Feb, 2026 1 commit
-
-
PanZezhong authored
-
- 26 Feb, 2026 1 commit
-
-
PanZezhong authored
-
- 25 Feb, 2026 1 commit
-
-
PanZezhong authored
-
- 24 Feb, 2026 2 commits
- 13 Feb, 2026 3 commits