- 28 Apr, 2025 3 commits
-
-
Yineng Zhang authored
-
Trevor Morris authored
-
Liangsheng Yin authored
-
- 27 Apr, 2025 30 commits
-
-
Lianmin Zheng authored
-
Baizhou Zhang authored
-
Lianmin Zheng authored
-
Liangsheng Yin authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
Revert "Revert "fix: import vllm_rotary_embedding error when head_size not in 64, 128, 256, 512"" (#5777)
-
Lianmin Zheng authored
-
zhanweidu authored
Signed-off-by:congcongke <zhanweidu@163.com>
-
Kebe authored
Signed-off-by:Kebe <mail@kebe7jun.com>
-
lambert0312 authored
-
Michał Moskal authored
-
aoshen524 authored
-
yan97ao authored
-
JieXin Liang authored
-
Stefan He authored
-
vzed authored
-
Frankey_8080 authored
-
JieXin Liang authored
-
DavidBao authored
-
Ke Bao authored
-
vzed authored
-
saltyfish66 authored
-
Yuhong Guo authored
-
JieXin Liang authored
-
Wenxuan Tan authored
-
Kyungmin Lee authored
-
liwenju0 authored
-
- 26 Apr, 2025 4 commits
-
-
Yi Zhang authored
-
ZXN authored
Co-authored-by:bppps <zouyu.zzx@alibaba-inc.com>
-
Mick authored
Co-authored-by:
Xinyuan Tong <justinning0323@outlook.com> Co-authored-by:
XinyuanTong <115166877+JustinTong0323@users.noreply.github.com>
-
Ke Bao authored
-
- 25 Apr, 2025 3 commits
-
-
Xiaoyu Zhang authored
update triton 3.2.0 h200 fused moe triton config and add warning about triton fused_moe_kernel performance degradation due to different Triton versions. (#5740)
-
Lianmin Zheng authored
-
Lianmin Zheng authored
Revert "[Model] Support `ArcticForCausalLM` architecture (Snowflake/snowflake-arctic-instruct)" (#5754)
-