- 16 Sep, 2025 11 commits
-
-
Jiacheng Huang authored
-
PanZezhong1725 authored
-
zhushuang authored
-
PanZezhong1725 authored
issue/434 hccl support bf16
-
Ceng2333 authored
Signed-off-by:Ceng <441651826@qq.com>
-
Ceng authored
Signed-off-by:Ceng <441651826@qq.com>
-
PanZezhong1725 authored
Issue/428: Merge `rope_v2` into `rope`
-
Ziminli authored
issue/428: update the rope implementation on Ascend, Cambricon, and Kunlun to use the refactored interface and return unimplemented error for NEOX-style algorithm
-
Ziminli authored
-
Ziminli authored
-
PanZezhong1725 authored
* issue/450: change indexToReducedOffset() to indexToOffset in elementwise framework on CPU, NVIDIA, Cambricon, Metax, Moore, and Kunlun * issue/450: remove indexToReducedOffset() in all platforms * issue/450: add the testcases that pinpoint the issue in infiniop-test
-
- 15 Sep, 2025 3 commits
- 10 Sep, 2025 1 commit
-
-
PanZezhong1725 authored
-
- 09 Sep, 2025 2 commits
-
-
PanZezhong1725 authored
issue/434 nccl support bf16
-
PanZezhong1725 authored
-
- 04 Sep, 2025 3 commits
-
-
PanZezhong1725 authored
issue/425: implement GEMM with MUBLAS and MUDNN backends in moore gpu
-
zhushuang authored
-
PanZezhong1725 authored
-
- 03 Sep, 2025 15 commits
-
-
zhangyue authored
issue/416: p800 rearrange kernel
-
zhangyue authored
-
zhangyue authored
-
zhangyue authored
issue/342: 昆仑芯P800上random_sample算子
-
xgqdut2016 authored
-
Ziminli authored
-
xgqdut2016 authored
-
xgqdut2016 authored
-
xgqdut2016 authored
-
xgqdut2016 authored
-
xgqdut2016 authored
-
xgqdut2016 authored
-
zhangyue authored
issue/421: 适配 rmsnorm 测例修改,支持 bf16 和 f16数据类型 weights
-
zhangyue authored
-
zhangyue authored
issue/418: 解决 p800 上手写算子引用 sm 上指针的报错问题
-
- 02 Sep, 2025 2 commits
- 29 Aug, 2025 1 commit
-
-
PanZezhong1725 authored
-
- 28 Aug, 2025 2 commits