- 15 Sep, 2025 1 commit
-
-
Ziminli authored
issue/450: change indexToReducedOffset() to indexToOffset in elementwise framework on CPU, NVIDIA, Cambricon, Metax, Moore, and Kunlun
-
- 10 Sep, 2025 1 commit
-
-
PanZezhong1725 authored
-
- 09 Sep, 2025 2 commits
-
-
PanZezhong1725 authored
issue/434 nccl support bf16
-
PanZezhong1725 authored
-
- 04 Sep, 2025 3 commits
-
-
PanZezhong1725 authored
issue/425: implement GEMM with MUBLAS and MUDNN backends in moore gpu
-
zhushuang authored
-
PanZezhong1725 authored
-
- 03 Sep, 2025 15 commits
-
-
zhangyue authored
issue/416: p800 rearrange kernel
-
zhangyue authored
-
zhangyue authored
-
zhangyue authored
issue/342: 昆仑芯P800上random_sample算子
-
xgqdut2016 authored
-
Ziminli authored
-
xgqdut2016 authored
-
xgqdut2016 authored
-
xgqdut2016 authored
-
xgqdut2016 authored
-
xgqdut2016 authored
-
xgqdut2016 authored
-
zhangyue authored
issue/421: 适配 rmsnorm 测例修改,支持 bf16 和 f16数据类型 weights
-
zhangyue authored
-
zhangyue authored
issue/418: 解决 p800 上手写算子引用 sm 上指针的报错问题
-
- 02 Sep, 2025 2 commits
- 29 Aug, 2025 1 commit
-
-
PanZezhong1725 authored
-
- 28 Aug, 2025 2 commits
- 27 Aug, 2025 7 commits
- 26 Aug, 2025 5 commits
-
-
xgqdut2016 authored
-
zhangyue authored
issue/404: kunlun_common.h 和 kunlun_handle.h 解耦合
-
zhangyue authored
-
zhangyue authored
-
zhangyue authored
-
- 25 Aug, 2025 1 commit
-
-
zhangyue authored
issue/390: kunlun p800 causal softmax
-