- 12 Oct, 2025 2 commits
-
-
Mahmoud Ashraf authored
[bugfix]: use correct causality condition for flashattention, flashinfer, and triton backends (#10172)
-
Liangsheng Yin authored
Co-authored-by:
Lianmin Zheng <15100009+merrymercy@users.noreply.github.com> Co-authored-by:
Hanming Lu <69857889+hanming-lu@users.noreply.github.com>
-
- 08 Oct, 2025 1 commit
-
-
Netanel Haber authored
Signed-off-by:Netanel Haber <nhaber@nvidia.com>
-
- 01 Oct, 2025 1 commit
-
-
Liangsheng Yin authored
-
- 22 Sep, 2025 1 commit
-
-
Ethan (Yusheng) Su authored
-
- 15 Sep, 2025 1 commit
-
-
Ke Bao authored
-
- 13 Sep, 2025 1 commit
-
-
Yi Zhang authored
-
- 23 Aug, 2025 1 commit
-
-
Lianmin Zheng authored
-
- 20 Aug, 2025 1 commit
-
-
Ke Bao authored
-
- 17 Aug, 2025 1 commit
-
-
Ke Bao authored
-
- 13 Aug, 2025 1 commit
-
-
Lianmin Zheng authored
Co-authored-by:gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 06 Aug, 2025 1 commit
-
-
Ke Bao authored
-
- 05 Aug, 2025 1 commit
-
-
Ying Sheng authored
-
- 17 Jun, 2025 1 commit
-
-
u4lr451 authored
Co-authored-by:
austindeng <austindeng@tencent.com> Co-authored-by:
tianqilin.99 <tianqilin.99@bytedance.com> Co-authored-by:
Qiaolin Yu <liin1211@outlook.com> Co-authored-by:
ch-wan <cwan39@gatech.edu>
-
- 16 Jun, 2025 1 commit
-
-
Lianmin Zheng authored
-
- 15 Jun, 2025 3 commits
-
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
- 13 Jun, 2025 1 commit
-
-
Lianmin Zheng authored
-
- 30 May, 2025 1 commit
-
-
Jianan Ji authored
-
- 29 May, 2025 1 commit
-
-
Ke Bao authored
-
- 20 May, 2025 1 commit
-
-
JieXin Liang authored
-
- 12 May, 2025 2 commits
-
-
Lianmin Zheng authored
-
applesaucethebun authored
Co-authored-by:Brayden Zhong <b8zhong@uwaterloo.ca>
-
- 17 Apr, 2025 1 commit
-
-
woodx authored
-
- 20 Mar, 2025 1 commit
-
-
JieXin Liang authored
-
- 19 Mar, 2025 1 commit
-
-
JieXin Liang authored
-
- 17 Mar, 2025 1 commit
-
-
Lianmin Zheng authored
-
- 08 Mar, 2025 1 commit
-
-
Lianmin Zheng authored
-
- 06 Mar, 2025 1 commit
-
-
Lianmin Zheng authored
Co-authored-by:
Sehoon Kim <kssteven418@gmail.com> Co-authored-by:
SangBin Cho <rkooo567@gmail.com> Co-authored-by:
Sehoon Kim <sehoon@x.ai>
-
- 05 Mar, 2025 2 commits
-
-
Baizhou Zhang authored
-
Ying Sheng authored
Co-authored-by:Ke Bao <ISPObaoke@163.com>
-
- 03 Mar, 2025 4 commits
-
-
Lianmin Zheng authored
-
Lianmin Zheng authored
Support penalty in overlap mode; return logprob with chunked prefill; improve benchmark scripts (#3988) Co-authored-by:
SangBin Cho <rkooo567@gmail.com> Co-authored-by:
dhou-xai <dhou@x.ai> Co-authored-by:
Hanming Lu <hanming_lu@berkeley.edu>
-
Lianmin Zheng authored
-
Baizhou Zhang authored
-
- 18 Feb, 2025 1 commit
-
-
Ke Bao authored
-
- 11 Feb, 2025 1 commit
-
-
Ke Bao authored
-
- 10 Feb, 2025 1 commit
-
-
Ke Bao authored
-
- 05 Feb, 2025 1 commit
-
-
Ke Bao authored
-