- 18 Nov, 2024 2 commits
-
-
Ke Bao authored
-
Lianmin Zheng authored
-
- 17 Nov, 2024 1 commit
-
-
Lianmin Zheng authored
Co-authored-by:Haotian Liu <6631389+haotian-liu@users.noreply.github.com>
-
- 16 Nov, 2024 2 commits
- 15 Nov, 2024 1 commit
-
-
Lianmin Zheng authored
[Fix] Adjust default chunked prefill size and cuda graph max bs according to GPU memory capacity (#2044)
-
- 14 Nov, 2024 1 commit
-
-
Patrick Yi authored
-
- 13 Nov, 2024 1 commit
-
-
Lianmin Zheng authored
-
- 08 Nov, 2024 1 commit
-
-
Lianmin Zheng authored
-
- 07 Nov, 2024 1 commit
-
-
Chayenne authored
-
- 06 Nov, 2024 1 commit
-
-
Lzhang-hub authored
Co-authored-by:
Lianmin Zheng <lianminzheng@gmail.com> Co-authored-by:
Byron Hsu <byronhsu1230@gmail.com>
-
- 31 Oct, 2024 1 commit
-
-
Byron Hsu authored
-
- 27 Oct, 2024 1 commit
-
-
Lianmin Zheng authored
-
- 26 Oct, 2024 3 commits
-
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
Liangsheng Yin authored
-
- 25 Oct, 2024 1 commit
-
-
DarkSharpness authored
-
- 21 Oct, 2024 1 commit
-
-
Lianmin Zheng authored
-
- 19 Oct, 2024 1 commit
-
-
Lianmin Zheng authored
-
- 16 Oct, 2024 2 commits
-
-
havetc authored
-
Lianmin Zheng authored
-
- 14 Oct, 2024 1 commit
-
-
Shuo Yang authored
-
- 13 Oct, 2024 2 commits
-
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
- 12 Oct, 2024 1 commit
-
-
Zhang, Liangang authored
-
- 11 Oct, 2024 3 commits
-
-
Lianmin Zheng authored
-
glen-amd authored
-
Zhang, Liangang authored
-
- 07 Oct, 2024 2 commits
-
-
Jani Monoses authored
-
Lianmin Zheng authored
Co-authored-by:Zhang Liangang <liangang.zhang@intel.com>
-
- 04 Oct, 2024 2 commits
-
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
- 30 Sep, 2024 1 commit
-
-
Ying Sheng authored
-
- 29 Sep, 2024 2 commits
-
-
Xinyu Yang authored
-
Lianmin Zheng authored
-
- 18 Sep, 2024 1 commit
-
-
Lianmin Zheng authored
-
- 17 Sep, 2024 4 commits