- 01 Dec, 2025 3 commits
-
-
xgqdut2016 authored
-
pengcheng888 authored
issue/680-为mulmat添加alpha参数算子
-
pengcheng888 authored
-
- 26 Nov, 2025 2 commits
-
-
PanZezhong1725 authored
fix: correct macro for mccub/hccub conditional compilation.
-
zhuyue authored
-
- 25 Nov, 2025 2 commits
-
-
PanZezhong1725 authored
issue/666 - Standardized test imports
-
PanZezhong1725 authored
issue/612 - differentiate mempcy sync/async
-
- 24 Nov, 2025 1 commit
-
-
wooway777 authored
-
- 22 Nov, 2025 13 commits
-
-
PanZezhong1725 authored
Issue/658 - Update test tolerances and remove device-specific dtype f…
-
zhuyue authored
-
PanZezhong1725 authored
将 `README.md` 中的 `pip install . -e` 改为 `pip install -e .`
-
PanZezhong1725 authored
Issue/661 - Support run.py --bench
-
zhuyue authored
-
gongchensu authored
Feature/moore adapt
-
zhuyue authored
-
zhuyue authored
-
zhuyue authored
- Implement Moore backend for add, mul, and silu elementwise operations - Filter unsupported dtypes (BF16, F64) for Moore platform in tests
-
PanZezhong1725 authored
issue/656 - updated readme
-
wooway777 authored
-
Your Name authored
-
Your Name authored
-
- 21 Nov, 2025 11 commits
-
-
gongchensu authored
Issue/648 - Remove metax device support for layer_norm, softmax, and …
-
zhuyue authored
-
PanZezhong1725 authored
Issue/654 - Fix CUDA 13.0 compatibility issues
-
zhuyue authored
-
zhuyue authored
-
pengcheng888 authored
issue/652- 修改rope函数的测试脚本
-
pengcheng888 authored
-
gongchensu authored
Fix metax add rms_norm operators.
-
zhuyue authored
-
qinyiqun authored
* ISSUE/628 适配QY C610 GPU,增加编译选项,适配已有算子。添加bge类模型所需的算子,包括gelu,layer_norm,lp_norm(支持l1,l2 norm),relu,softmax,tanh。 --------- Co-authored-by:
xgqdut2016 <kenan_gewei@163.com> Co-authored-by:
xgqdut2016 <140036308+xgqdut2016@users.noreply.github.com>
-
wooway777 authored
-
- 20 Nov, 2025 8 commits
-
-
xgqdut2016 authored
-
pengcheng888 authored
issue/642 - added a breakpoint in run.py debug
-
wooway777 authored
-
crapromer authored
* issue/636 - add support for fp8 with maca sdk * issue/636 - add functional header to support Fn * issue/636 - format code with clang
-
pengcheng888 authored
issue/637 - 为from_list函数添加bf16数据类型的支持
-
pengcheng888 authored
-
PanZezhong1725 authored
issue/608- 修改functional中的rope, 添加nn.RoPE的实现和测试
-
PanZezhong1725 authored
Co-authored-by:pengcheng888 <pengcheng@example.com>
-