- 18 Jul, 2025 8 commits
-
-
Sai Enduri authored
-
Zhiqiang Xie authored
-
jianan-gu authored
[Quantization][w8a8_int8] Fix weight loading issue for w8a8_int8 path with "ignore" layer list in quantization config (#7820)
-
jianan-gu authored
-
yilian49 authored
-
Minglei Zhu authored
-
Qi Yuhang authored
-
Mick authored
-
- 17 Jul, 2025 12 commits
-
-
Zhao Chen authored
Signed-off-by:Zhao Chen <zhaochen.zju@gmail.com>
-
Asher authored
Signed-off-by:Asher Zhang <asherszhang@tencent.com>
-
Ziqi Fan authored
-
fzyzcjy authored
-
Yuan Luo authored
-
Cheng Wan authored
-
Cheng Wan authored
-
hzh0425 authored
-
Cheng Wan authored
-
Simo Lin authored
-
Yingchun Lai authored
Co-authored-by:Stefan He <hebiaobuaa@gmail.com>
-
Mick authored
-
- 16 Jul, 2025 11 commits
-
-
Peng Zhang authored
-
Xiaoze Fan authored
Signed-off-by:
jason-fxz <jason341132@qq.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Simo Lin authored
-
Peng Zhang authored
-
YanbingJiang authored
-
Mick authored
-
Qiaolin Yu authored
-
Qiaolin Yu authored
-
Xinyuan Tong authored
Signed-off-by:Xinyuan Tong <justinning0323@outlook.com>
-
strgrb authored
Co-authored-by:Zhang Kaihong <zhangkaihong.zkh@alibaba-inc.com>
-
kozo authored
Signed-off-by:
Xinyuan Tong <justinning0323@outlook.com> Co-authored-by:
Xinyuan Tong <justinning0323@outlook.com>
-
- 15 Jul, 2025 9 commits
-
-
Sai Enduri authored
Co-authored-by:Hubert Lu <55214931+hubertlu-tw@users.noreply.github.com>
-
yhang authored
-
Albert authored
Signed-off-by:Tianyu Zhou <albert.zty@antgroup.com>
-
Yineng Zhang authored
-
jiawei authored
-
Xinyuan Tong authored
-
Xinyuan Tong authored
Signed-off-by:Xinyuan Tong <justinning0323@outlook.com>
-
Qi Yuhang authored
[feat]Support fusion kernel for constructing quant input and scale factor for fp8_blockwise_scaled_grouped_mm (#8023)
-
Chang Su authored
-