- 03 Mar, 2026 2 commits
- 02 Mar, 2026 2 commits
- 28 Feb, 2026 7 commits
-
-
PanZezhong authored
-
PanZezhong authored
-
spike-zhu authored
Issue/1021: issue/1021 - feat: support bf16 in infiniccl with mccl
-
zhushuang authored
-
wooway777 authored
-
xgqdut2016 authored
-
wooway777 authored
-
- 24 Feb, 2026 2 commits
- 13 Feb, 2026 1 commit
-
-
thatPepe authored
Demo-131 Cuda graph with optimized paged attention
-
- 12 Feb, 2026 12 commits
-
-
thatPepe authored
issue/961: fix metax init with preload
-
thatPepe authored
Issue/1008
-
zhangyue authored
-
wooway777 authored
-
zhangyue authored
-
zhangyue authored
-
zhangyue authored
-
zhangyue authored
-
zhangyue authored
-
zhangyue authored
-
thatPepe authored
Issue/972:摩尔平台基于 muDNN 的 w8a8 量化实现,并完善 scaled_mm_int8 python 测试脚本
-
zhushuang authored
-
- 11 Feb, 2026 14 commits
-
-
zhushuang authored
-
zhushuang authored
-
thatPepe authored
Issue/862 - Fix compilation errors (missing headers, cub namespace) t…
-
gongchensu authored
-
thatPepe authored
issue/523 - switched to cambricon mlu 1.22 interface
-
thatPepe authored
issue/837 - support int32 and int64 in cambricon add
-
thatPepe authored
issue/1001 - feat: add paged attention prefill and decode for moore gpu referencing nvidia
-
thatPepe authored
issue/1012 - feat: add paged caching for moore gpu referencing nvidia
-
thatPepe authored
issue/838 - Cambricon Batched RoPE
-
wooway777 authored
-
thatPepe authored
issue/899 - fix: fix causal_softmax and rearrange bug
-
zhushuang authored
-
zhushuang authored
-
zhushuang authored
-