Commits · b49f56456252e896b7cdcf78a2dea5fe463b54ec · jerrrrry / infinicore

04 Jul, 2025 2 commits
- fix format · b49f5645
  penghceng888 authored Jul 04, 2025
  
  b49f5645
- issue/302 - fix compile error of window system · ffecc90c
  pengcheng888 authored Jul 04, 2025
  
  ffecc90c
02 Jul, 2025 5 commits
- Merge pull request #159 from YdrMaster/iluvatar · 8366330c
  PanZezhong1725 authored Jul 02, 2025
```
issue/158/feat: 支持天数
```
  8366330c
- issue/158/fix: 支持天数的特殊 BLOCK SIZE · 2d77211b
  YdrMaster authored Jul 02, 2025
```
Signed-off-by: YdrMaster <ydrml@hotmail.com>
```
  2d77211b
- issue/296 isContigous tolerates length 1 dimension (#297) · f6a645a3
  PanZezhong1725 authored Jul 02, 2025
  
  f6a645a3
- issue/158/fix: 修改天数上的其他编译问题 · 09ed53f7
  YdrMaster authored Jun 06, 2025
```
Signed-off-by: YdrMaster <ydrml@hotmail.com>
```
  09ed53f7
- issue/158/feat: 支持天数编译 · 29089d99
  YdrMaster authored Apr 08, 2025
```
Signed-off-by: YdrMaster <ydrml@hotmail.com>
```
  29089d99
01 Jul, 2025 1 commit

issue/254: 添加算子在CPU和CUDA上对BF16的支持，并增加相应的测试代码 (#255) · f88d4ad8

蒋帅宏（Shuaihong_Jiang） authored Jul 01, 2025



* issue/254: 添加算子在CPU和CUDA上对BF16的支持，并增加相应的测试代码

* issue/254: 将修改后的算子格式化后重新提交

* 修改与最新main的冲突

* 解决冲突后rms_norm原本的精度过不了了，现在由
{"atol": 5e-3, "rtol": 5e-3}更改为
{"atol": 8e-3, "rtol": 8e-3}

* rms_norm在debug模式下FP16的测试用例失败了（本地测试能通过，github上过不了），
所以将容差增大了两倍进行测试

* 将rms_normd的测试输入缩放0.5，将容差改回原始值来进行ci测试

* issue/254: 1.使用CHECK_DTYPE宏来进行数据类型检验
2.在test的utils.py中添加了设备对BF16支持的检验

* issue/254: rms_norm测试fp16容差由
torch.float16: {"atol": 1e-3, "rtol": 1e-3},
改为torch.float16: {"atol": 2e-3, "rtol": 2e-3},
并删除对输入0.5的放缩

* issue/254: 在utils.py中debug方法和debug_all方法中
添加了对BF16的特判

* 修改支持BF16测试的设备类型检查方法

* 修改支持BF16测试的设备检查

* issue/254: reduce redundancy in rms_norm.py

* issue/254: add back the missing comment in rms_norm.py

* issue/254: add fp32 tolerance condition in causal_softmax.py

---------
Co-authored-by: Zimin Li <coollizimin@gmail.com>

f88d4ad8

30 Jun, 2025 3 commits
- Merge pull request #289 from InfiniTensor/issue/288_improve_torch_implementation_compatibility · 105065e2
  PanZezhong1725 authored Jun 30, 2025
```
issue/288: Improve the Compatibility of the Torch Implementations
```
  105065e2
- issue/288: fix spelling error for RearrangeDescriptor in rearrange.py · c132b4cf
  Zimin Li authored Jun 30, 2025
  
  c132b4cf
- issue/288: improve the compatibility of the torch implementations of gemm and random sample · f49235e3
  Zimin Li authored Jun 30, 2025
  
  f49235e3
27 Jun, 2025 7 commits
- Merge pull request #285 from InfiniTensor/issue/137_new · a0abcb2c
  PanZezhong1725 authored Jun 27, 2025
```
issue/137: 添加causal_softmax测例，更新readme（合并）
```
  a0abcb2c
- issue/137: 添加causal_softmax测例，更新readme · be01afcf
  Catheriany authored Jun 27, 2025
  
  be01afcf
- Merge pull request #206 from wooway777/issue/205 · 7eb94082
  PanZezhong1725 authored Jun 27, 2025
```
issue/205 - 添加Sub算子 resolves #205
```
  7eb94082
- issue/205 - 添加Sub算子的gguf测试用例 · 37332d40
  Pepe authored Apr 28, 2025
  
  37332d40
- issue/205 - 添加Sub算子 · 2ccf1d9d
  Pepe authored Apr 27, 2025
```
issue/205 - 添加Sub算子的头文件、CPU实现、cuda实现、及Python测试
```
  2ccf1d9d
- Merge pull request #62 from InfiniTensor/issue/11-randomsample-ascend · 3546e737
  PanZezhong1725 authored Jun 27, 2025
```
issue/11: add random sample ascend
```
  3546e737
- Merge pull request #283 from InfiniTensor/issue/282 · 27b836c9
  PanZezhong1725 authored Jun 27, 2025
```
issue/282: Maca CausalSoftamx精度bug
```
  27b836c9
26 Jun, 2025 2 commits
- issue/282: 添加max_reduction测试 · 31e54f93
  Catheriany authored Jun 26, 2025
  
  31e54f93
- issue/282: 算子贴错导致推理问题修复 · 53468445
  Catheriany authored Jun 26, 2025
  
  53468445
25 Jun, 2025 2 commits
- feat:重构random sample ascend算子 · c1fa267c
  zhangyunze authored May 20, 2025
  
  c1fa267c
- Merge pull request #272 from InfiniTensor/issue/271 · af8bdb43
  PanZezhong1725 authored Jun 25, 2025
```
issue/271: xmake modify in moore gpu
```
  af8bdb43
23 Jun, 2025 1 commit
- Merge pull request #274 from InfiniTensor/issue/273_fix_python_test_debug · 8a22f194
  PanZezhong1725 authored Jun 23, 2025
```
issue/273: Fully Support `equal_nan` Option for `debug()` and `debug_all()`
```
  8a22f194
20 Jun, 2025 1 commit
- issue/273: fully support equal_nan option for debug() and debug_all() · 818db4ae
  Zimin Li authored Jun 20, 2025
  
  818db4ae
19 Jun, 2025 1 commit
- fix: xmake modify in moore gpu · a1fedf0d
  zhushuang authored Jun 19, 2025
  
  a1fedf0d
17 Jun, 2025 5 commits
- Merge pull request #269 from pwhMass/test_rearrange · 7c593b7a
  PanZezhong1725 authored Jun 17, 2025
```
issue/152/feat: 添加 rearrange 算子测例
```
  7c593b7a
- issue/268/feat: 添加 rearrange 算子测例 · 14a278bf
  pwhMass authored Jun 17, 2025
  
  14a278bf
- Merge pull request #153 from YdrMaster/main · 8b3cf2e2
  PanZezhong1725 authored Jun 17, 2025
```
issue/152/feat: 添加 rearrange 算子测例
```
  8b3cf2e2
- issue/152/fix: 改正 rearrange 元信息填充 · 3a0d6510
  YdrMaster authored Jun 16, 2025
```
Signed-off-by: YdrMaster <ydrml@hotmail.com>
```
  3a0d6510
- issue/11: add random sample ascend · b5c6c7b8
  zhangyue authored Feb 18, 2025
  
  b5c6c7b8
13 Jun, 2025 2 commits
- Merge pull request #264 from InfiniTensor/issue/261_optimize_torch_implementation · 384cb5bf
  PanZezhong1725 authored Jun 13, 2025
```
Issue/261: Optimize Torch Implementation of Several Operators
```
  384cb5bf
- issue/261: optimize the torch implementation of add, causal softmax, gemm,... · 505e0d4b
  Zimin Li authored Jun 13, 2025
```
issue/261: optimize the torch implementation of add, causal softmax, gemm, random sample, rearrange, rms norm, rope
```
  505e0d4b
12 Jun, 2025 3 commits
- Merge pull request #258 from InfiniTensor/issue/36 · 2f20af7e
  PanZezhong1725 authored Jun 12, 2025
```
issue/36 - Migrate cuda ramdom sample to metax
```
  2f20af7e
- issue/36 - Migrate cuda ramdom sample to metax, but compile and run too slow · 77070490
  crapromer authored May 25, 2025
  
  77070490
- Merge pull request #257 from InfiniTensor/issue/256 · 8e96d629
  PanZezhong1725 authored Jun 12, 2025
```
issue/256 沐曦通信库
```
  8e96d629
11 Jun, 2025 4 commits
- issue/256 沐曦通信库 · f119c32e
  PanZezhong authored Jun 11, 2025
  
  f119c32e
- Merge pull request #245 from InfiniTensor/issue/238 · 5a4e7a73
  PanZezhong1725 authored Jun 11, 2025
```
issue/238 - Migrate cuda rearrange to metax
```
  5a4e7a73
- Merge pull request #244 from InfiniTensor/issue/39 · 8ccd42bf
  PanZezhong1725 authored Jun 11, 2025
```
issue/39 Migrate cuda causal softmax to metax
```
  8ccd42bf
- Merge pull request #243 from InfiniTensor/issue/37 · 0f132536
  PanZezhong1725 authored Jun 11, 2025
```
issue/37 - Migrate cuda rope to metax
```
  0f132536
10 Jun, 2025 1 commit
- issue/152/feat: 添加 rearrange 算子测例 · 203de1a4
  YdrMaster authored Jun 06, 2025
```
Signed-off-by: YdrMaster <ydrml@hotmail.com>
```
  203de1a4