- 03 Oct, 2025 1 commit
-
-
fzyzcjy authored
-
- 04 Jul, 2025 1 commit
-
-
Lianmin Zheng authored
Move mem_fraction_static adjustment for multimodal models to `server_args.py` & Fix session control & Other cleanups (#7748)
-
- 11 May, 2025 1 commit
-
-
applesaucethebun authored
Co-authored-by:Brayden Zhong <b8zhong@uwaterloo.ca>
-
- 25 Mar, 2025 1 commit
-
-
fzyzcjy authored
-
- 03 Mar, 2025 1 commit
-
-
Lianmin Zheng authored
Support penalty in overlap mode; return logprob with chunked prefill; improve benchmark scripts (#3988) Co-authored-by:
SangBin Cho <rkooo567@gmail.com> Co-authored-by:
dhou-xai <dhou@x.ai> Co-authored-by:
Hanming Lu <hanming_lu@berkeley.edu>
-
- 28 Jan, 2025 1 commit
-
-
Byron Hsu authored
Co-authored-by:zhyncs <me@zhyncs.com>
-
- 29 Dec, 2024 1 commit
-
-
Ying Sheng authored
-
- 28 Nov, 2024 2 commits
-
-
Ying Sheng authored
-
Lianmin Zheng authored
-
- 27 Nov, 2024 1 commit
-
-
Ying Sheng authored
-
- 25 Nov, 2024 1 commit
-
-
Ying Sheng authored
-