- 23 Aug, 2025 11 commits
-
-
Lianmin Zheng authored
-
hzh0425 authored
-
fzyzcjy authored
-
fzyzcjy authored
-
fzyzcjy authored
-
hlu1 authored
Signed-off-by:Hao Lu <14827759+hlu1@users.noreply.github.com>
-
Chang Su authored
-
Yineng Zhang authored
-
fzyzcjy authored
-
Chanh Nguyen authored
Co-authored-by:
Chanh Nguyen <cnguyen@linkedin.com> Co-authored-by:
Liangsheng Yin <hnyls2002@gmail.com>
-
Moein Khazraee authored
Co-authored-by:Zhiqiang Xie <xiezhq@stanford.edu>
-
- 22 Aug, 2025 17 commits
-
-
sogalin authored
-
Hubert Lu authored
-
datdo-msft authored
-
Wenxuan Tan authored
Co-authored-by:gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
huangtingwei authored
Co-authored-by:Zhiqiang Xie <xiezhq@stanford.edu>
-
pansicheng authored
Co-authored-by:Zhiqiang Xie <xiezhq@stanford.edu>
-
Xuchun Shang authored
Signed-off-by:Xuchun Shang <xuchun.shang@linux.alibaba.com>
-
Qiaolin Yu authored
Co-authored-by:ispobock <ispobaoke@gmail.com>
-
Mick authored
-
Qiaolin Yu authored
Co-authored-by:ispobock <ispobaoke@gmail.com>
-
Elfie Guo authored
-
timmy-feng authored
-
Yongfei Xu authored
Support MHA with chunked prefix cache for flashinfer/flashmla backend, support page size > 1 for MHA chunked prefix (#8616) Co-authored-by:xuyongfei.xyf <xuyongfei.xyf@antgroup.com>
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
Xinyuan Tong authored
-
Yineng Zhang authored
-
Yineng Zhang authored
-
- 21 Aug, 2025 12 commits
-
-
Stefan He authored
-
zixuanzhang226 authored
-
Xinyuan Tong authored
Signed-off-by:Xinyuan Tong <xinyuantong.cs@gmail.com>
-
Yineng Zhang authored
-
gongwei-130 authored
-
gongwei-130 authored
-
Hongbo Xu authored
-
hlu1 authored
Signed-off-by:
Hao Lu <14827759+hlu1@users.noreply.github.com> Signed-off-by:
Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by:
Xinyuan Tong <xinyuantong.cs@gmail.com>
-
fzyzcjy authored
-
fzyzcjy authored
-
DiweiSun authored
-
pranavm-nvidia authored
-