- 17 Jun, 2025 2 commits
-
-
Yijie Zhu authored
Co-authored-by:刁莹煜 <diaoyingyu1@hisilicon.com>
-
kk authored
Co-authored-by:wunhuang <wunhuang@amd.com>
-
- 08 Jun, 2025 2 commits
-
-
Xiaoyu Zhang authored
-
Yineng Zhang authored
-
- 07 Jun, 2025 1 commit
-
-
Xiaoyu Zhang authored
-
- 06 Jun, 2025 2 commits
-
-
Jianan Ji authored
-
HAI authored
Co-authored-by:
wunhuang <wunhuang@amd.com> Co-authored-by:
Hubert Lu <Hubert.Lu@amd.com>
-
- 05 Jun, 2025 1 commit
-
-
Pavani Majety authored
[CUTLASS-FP4-MOE] Introduce CutlassMoEParams class for easy initialization of Cutlass Grouped Gems Metadata (#6887) Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
- 04 Jun, 2025 1 commit
-
-
Cheng Wan authored
Set `num_fused_shared_experts` as `num_shared_experts` when shared_experts fusion is not disabled (#6736)
-
- 29 May, 2025 1 commit
-
-
ChangyiYang authored
-
- 28 May, 2025 1 commit
-
-
Baizhou Zhang authored
-
- 16 May, 2025 1 commit
-
-
Elfie Guo authored
-
- 11 May, 2025 1 commit
-
-
applesaucethebun authored
Co-authored-by:Brayden Zhong <b8zhong@uwaterloo.ca>
-
- 08 May, 2025 2 commits
-
-
Baizhou Zhang authored
-
JieXin Liang authored
-
- 28 Apr, 2025 1 commit
-
-
HAI authored
-
- 19 Apr, 2025 1 commit
-
-
Xiaoyu Zhang authored
-
- 16 Apr, 2025 1 commit
-
-
Lianmin Zheng authored
-
- 11 Apr, 2025 1 commit
-
-
HAI authored
-
- 07 Apr, 2025 2 commits
-
-
HAI authored
Co-authored-by:Lianmin Zheng <lianminzheng@gmail.com>
-
Chang Su authored
Co-authored-by:
Cheng Wan <cwan39@gatech.edu> Co-authored-by:
fzyzcjy <ch271828n@outlook.com> Co-authored-by:
ispobock <ispobaoke@163.com>
-
- 18 Mar, 2025 2 commits
-
-
Yineng Zhang authored
-
Xiaoyu Zhang authored
-
- 17 Mar, 2025 1 commit
-
-
Xiaoyu Zhang authored
-
- 13 Mar, 2025 2 commits
-
-
Lianmin Zheng authored
-
Meng, Hengyu authored
Co-authored-by:Zhang, Liangang <liangang.zhang@intel.com>
-
- 12 Mar, 2025 1 commit
-
-
Yineng Zhang authored
-
- 10 Mar, 2025 1 commit
-
-
Lianmin Zheng authored
-
- 09 Mar, 2025 1 commit
-
-
HandH1998 authored
-
- 06 Mar, 2025 1 commit
-
-
HAI authored
-
- 04 Mar, 2025 1 commit
-
-
HAI authored
-
- 03 Mar, 2025 1 commit
-
-
Lianmin Zheng authored
Support penalty in overlap mode; return logprob with chunked prefill; improve benchmark scripts (#3988) Co-authored-by:
SangBin Cho <rkooo567@gmail.com> Co-authored-by:
dhou-xai <dhou@x.ai> Co-authored-by:
Hanming Lu <hanming_lu@berkeley.edu>
-
- 21 Feb, 2025 1 commit
-
-
HAI authored
-
- 31 Jan, 2025 1 commit
-
-
Ke Bao authored
-
- 27 Jan, 2025 1 commit
-
-
Lianmin Zheng authored
-
- 17 Jan, 2025 1 commit
-
-
Yineng Zhang authored
Co-authored-by:Zhangyi <1109276519@qq.com>
-
- 16 Jan, 2025 1 commit
-
-
Yineng Zhang authored
-
- 13 Jan, 2025 2 commits
-
-
kk authored
Co-authored-by:Lin, Soga <soga.lin@amd.com>
-
kk authored
Co-authored-by:
wunhuang <wunhuang@amd.com> Co-authored-by:
Lin, Soga <soga.lin@amd.com>
-
- 08 Jan, 2025 1 commit
-
-
Lianmin Zheng authored
Co-authored-by: SangBin Cho rkooo567@gmail.com
-