Commits · f5e6d7294dc54e196396c80e18fb251eb9cd703a · jerrrrry / infinicore

22 Oct, 2025 1 commit

Issue/497 - Enhanced Test Framework (#520) · f5e6d729

thatPepe authored Oct 22, 2025

* issue/497 - add dtype __eq__ and __hash__

* issue/497 - simplified infinicore test functions

* issue/497 - improved test framework

greatly reduced the code required for specific operators;
added strided tensor support;

* issue/497 - add add interface to assist test

* issue/497 - generalized test framework based on add

* issue/497 - support non-contiguous tensors in result comparison

* issue/497 - temporarily fixed strided tensor creation

* issue/497 - rms norm interface

* issue/497 - now requires test function definition

* issue/497 - support mixed dtype

* issue/497 - initial rms norm test

* issue/497 - unified in place and out of place tests

* issue/497 - renamed src/infinicore/op

* issue/497 - reduced comments

* issue/497 - attention

* issue/497 - removed generic parameter mapping

* issue/497 - temporary attention test

* issue/497 - captitalize op name initial

* issue/497 - add a script to run all op tests

* issue/497 - fix comments

* issue/497 - simplified infinicore tensor creation from torch

* issue/497 - support tensor init modes

* issue/497 - support tensor from/to files

* issue/497 - adjust naming

f5e6d729

21 Oct, 2025 1 commit

issue/505 - Feature/add debug (#504) · a3c5f3aa

gongchensu authored Oct 21, 2025



* Add debug function in InfiniCore tensor.

* refactor test scripts and remove txt write
add large-scale and non-contiguous tensor I/O tests

* Move debug.py out of the op operator test folder.

---------
Co-authored-by: zhuyue <zhuyue@qiyuanlab.com>

a3c5f3aa

16 Oct, 2025 1 commit

issue/383: Add logsoftmax ops · 05a2e149

gongchensu authored Oct 16, 2025


Co-authored-by: wawahejun <hejunlbbc@gmail.com>
Co-authored-by: zhuyue <zhuyue@qiyuanlab.com>

05a2e149

11 Oct, 2025 2 commits
- issue/492: 修复 infinicore.Tensor.dtype 和 infinicore.Tensor.device 返回类型的问题 · 93e7d887
  Jiacheng Huang authored Oct 11, 2025
  
  93e7d887
- issue/461 InfiniCore 推理运行时 · 9a05446f
  PanZezhong1725 authored Oct 11, 2025
```
Co-authored-by: Jiacheng Huang <huangjiacheng0709@outlook.com>
Co-authored-by: wooway777 <wooway777@gmail.com>
```
  9a05446f
29 Sep, 2025 2 commits
- issue/427 - the sigmoid, topksoftmax, and topkrouter ops · ed530e11
  pengcheng888 authored Sep 29, 2025
  
  ed530e11
- issue/486 Adapt seven operators to Hygon machines. · e698ef6b
  gongchensu authored Sep 29, 2025
```
Co-authored-by: zhuyue <zhuyue@qiyuanlab.com>
```
  e698ef6b
25 Sep, 2025 1 commit

issue/477 - Cambricon MLU NeoX · 6af2e427

wooway777 authored Sep 25, 2025

Added NeoX support to Cambricon RoPE; Added a missing argument in the profiling script;

6af2e427

23 Sep, 2025 1 commit
- feat: rename Dequantize to DequantizeAWQ in nvidia gpu · 4217976d
  zhushuang authored Sep 23, 2025
  
  4217976d
18 Sep, 2025 2 commits
- issue/458 add AWQ dequantization torch test and improve variable naming readability · 82b2a84c
  spike-zhu authored Sep 18, 2025
  
  82b2a84c
- Issue/459 (#460) · 3a91947e
  thatPepe authored Sep 18, 2025
```
* issue/459 - Support more data type combinations

* issue/459 - added test cases for 9G7B and 9G70B

* issue/459 - modified rms kernel to support larger tensors
```
  3a91947e
17 Sep, 2025 1 commit
- issue/436: 支持9g7b 4b模型 · 3bdd832e
  zhangyue authored Sep 17, 2025
  
  3bdd832e
16 Sep, 2025 2 commits
- issue/428: accommodate the changes to c/gguf tests · f6e8476b
  Ziminli authored Sep 07, 2025
  
  f6e8476b
- issue/428: merge rope_v2 into rope with algorithm selection · 86515765
  Ziminli authored Sep 07, 2025
  
  86515765
15 Sep, 2025 2 commits
- issue/450: add the testcases that pinpoint the issue in infiniop-test · 9db54b8f
  Ziminli authored Sep 15, 2025
  
  9db54b8f
- issue/450: change indexToReducedOffset() to indexToOffset in elementwise... · 5e581b8e
  Ziminli authored Sep 15, 2025
```
issue/450: change indexToReducedOffset() to indexToOffset in elementwise framework on CPU, NVIDIA, Cambricon, Metax, Moore, and Kunlun
```
  5e581b8e
10 Sep, 2025 1 commit
- issue/440 feat: add softplus operator · 1635fd92
  PanZezhong1725 authored Sep 10, 2025
  
  1635fd92
03 Sep, 2025 3 commits
- issue/423: improve the precision of the torch implementation of rms_norm · 612defae
  Ziminli authored Sep 03, 2025
  
  612defae
- issue/342: success random_sample all · d741ee7d
  xgqdut2016 authored Sep 02, 2025
  
  d741ee7d
- issue/342: F16 success but BF16 failed · c5bc6628
  xgqdut2016 authored Sep 01, 2025
  
  c5bc6628
02 Sep, 2025 1 commit
- [T2-2-3] blkmjsian · 9ad23fad
  blkmjsian authored Sep 02, 2025
```
- dequantize awq
- rope v2
```
  9ad23fad
27 Aug, 2025 1 commit
- issue/260: 摩尔平台 causal_softmax 算子开发 · f50c2a40
  spike-zhu authored Aug 27, 2025
  
  f50c2a40
22 Aug, 2025 1 commit
- issue/388: Support 3D Cases for RMS Norm · dc8ddd58
  Ziminli authored Aug 22, 2025
  
  dc8ddd58
20 Aug, 2025 1 commit
- issue/278 - Cambricon Random Sample · c24a52ea
  thatPepe authored Aug 20, 2025
  
  c24a52ea
14 Aug, 2025 1 commit
- issue/365 - fixing an interface mismatch · 1d1e0649
  thatPepe authored Aug 14, 2025
  
  1d1e0649
13 Aug, 2025 3 commits
- issue/240 - added bf16 support to cambricon rms norm and adjusted tolerance · 7e3ade06
  wooway777 authored Aug 12, 2025
  
  7e3ade06
- issue/240 - added bf16 support to cambricon causal softmax and adjusted tolerance · ef577d9d
  wooway777 authored Aug 12, 2025
  
  ef577d9d
- issue/214 - Elementwise Support for Cambricon Bang · adbda4c4
  thatPepe authored Aug 13, 2025
  
  adbda4c4
12 Aug, 2025 1 commit
- issue/356 - fixed a type error · f74a28f6
  wooway777 authored Aug 12, 2025
  
  f74a28f6
06 Aug, 2025 1 commit

issue/347 Add support for BOOL/BF16 and printing utils in infiniop-test · a23c4d13

Tianyu Xiong authored Aug 06, 2025

* utils: add printing support for int8_t, bf16_t and fp16_t

* utils: add support for BF16 in infiniop-test

* utils: add support for BOOL in infiniop-test

a23c4d13

14 Jul, 2025 1 commit
- issue/158/refactor: 支持天数的 pytorch 测试 · d9de5133
  YdrMaster authored Jul 14, 2025
```
Signed-off-by: YdrMaster <ydrml@hotmail.com>
```
  d9de5133
11 Jul, 2025 2 commits
- issue/213/fix 修复cuda conv，关闭cudnn执行时报错 · e3b28d1b
  PanZezhong authored Jul 11, 2025
  
  e3b28d1b
- Issue/213 添加conv算子cpu/cuda实现 · d417f967
  zhiwu zhou authored Jul 11, 2025
  
  d417f967
10 Jul, 2025 1 commit
- issue/316 添加开发者手册 · 0eac5adc
  PanZezhong authored Jul 10, 2025
  
  0eac5adc
09 Jul, 2025 2 commits
- issue/314/fix: cuda compilation and test · f72b8653
  YdrMaster authored Jul 10, 2025
```
Signed-off-by: YdrMaster <ydrml@hotmail.com>
```
  f72b8653
- Add a CPU implementation of ReLU · 4ac6e71b
  Jiacheng Huang authored May 29, 2025
  
  4ac6e71b
08 Jul, 2025 1 commit
- issue/254/fix 为elementwise算子添加bf16支持 · ceda7c1c
  PanZezhong authored Jul 08, 2025
  
  ceda7c1c
07 Jul, 2025 1 commit
- issue/307 unify test tensor creation in pytorch tests · f62e952e
  PanZezhong authored Jul 07, 2025
  
  f62e952e
04 Jul, 2025 1 commit
- issue/304 fix clip · 6e7fe25f
  PanZezhong authored Jul 04, 2025
  
  6e7fe25f
01 Jul, 2025 1 commit

issue/254: 添加算子在CPU和CUDA上对BF16的支持，并增加相应的测试代码 (#255) · f88d4ad8

蒋帅宏（Shuaihong_Jiang） authored Jul 01, 2025



* issue/254: 添加算子在CPU和CUDA上对BF16的支持，并增加相应的测试代码

* issue/254: 将修改后的算子格式化后重新提交

* 修改与最新main的冲突

* 解决冲突后rms_norm原本的精度过不了了，现在由
{"atol": 5e-3, "rtol": 5e-3}更改为
{"atol": 8e-3, "rtol": 8e-3}

* rms_norm在debug模式下FP16的测试用例失败了（本地测试能通过，github上过不了），
所以将容差增大了两倍进行测试

* 将rms_normd的测试输入缩放0.5，将容差改回原始值来进行ci测试

* issue/254: 1.使用CHECK_DTYPE宏来进行数据类型检验
2.在test的utils.py中添加了设备对BF16支持的检验

* issue/254: rms_norm测试fp16容差由
torch.float16: {"atol": 1e-3, "rtol": 1e-3},
改为torch.float16: {"atol": 2e-3, "rtol": 2e-3},
并删除对输入0.5的放缩

* issue/254: 在utils.py中debug方法和debug_all方法中
添加了对BF16的特判

* 修改支持BF16测试的设备类型检查方法

* 修改支持BF16测试的设备检查

* issue/254: reduce redundancy in rms_norm.py

* issue/254: add back the missing comment in rms_norm.py

* issue/254: add fp32 tolerance condition in causal_softmax.py

---------
Co-authored-by: Zimin Li <coollizimin@gmail.com>

f88d4ad8