Commits · efa4f0f469717dcc534f90a8f36b3d5f4a0bcf6e · jerrrrry / infinicore

29 Sep, 2025 5 commits
- issue/453: add AWQ-dequantize in iluvatar gpu · efa4f0f4
  spike-zhu authored Sep 29, 2025
  
  efa4f0f4
- feat: add AWQ dequantize in moore gpu, with test pass (#485) · fbfb0ef6
  PanZezhong1725 authored Sep 29, 2025
  
  fbfb0ef6
- feat: add AWQ dequantize in moore gpu, with test pass · 8896615a
  zhushuang authored Sep 29, 2025
  
  8896615a
- issue/481 fix: support more data type for rmsnorm in moore gpu · 2f3fd75c
  spike-zhu authored Sep 29, 2025
  
  2f3fd75c
- issue/486 Adapt seven operators to Hygon machines. · e698ef6b
  gongchensu authored Sep 29, 2025
```
Co-authored-by: zhuyue <zhuyue@qiyuanlab.com>
```
  e698ef6b
25 Sep, 2025 3 commits
- Issue/472: 接入昆仑芯通信库 (#479) · 3959c943
  zhangyue authored Sep 25, 2025
```
* issue/472: p800 ccl

* issue/472: 删掉无用操作

* issue/472: fix format

* issue/472: memcpy h2h case
```
  3959c943
- Merge pull request #478 from InfiniTensor/issue/477 · 20a2dbd6
  PanZezhong1725 authored Sep 25, 2025
```
issue/477 - Cambricon MLU NeoX
```
  20a2dbd6
- issue/477 - Cambricon MLU NeoX · 6af2e427
  wooway777 authored Sep 25, 2025
```
Added NeoX support to Cambricon RoPE; Added a missing argument in the profiling script;
```
  6af2e427
24 Sep, 2025 1 commit
- Merge pull request #476 from InfiniTensor/issue/474 · 6b903fd9
  PanZezhong1725 authored Sep 24, 2025
```
issue/474: rename Dequantize to DequantizeAWQ in nvidia gpu
```
  6b903fd9
23 Sep, 2025 2 commits
- feat: rename Dequantize to DequantizeAWQ in nvidia gpu · 4217976d
  zhushuang authored Sep 23, 2025
  
  4217976d
- Merge pull request #470 from InfiniTensor/issue/469 · d3d982df
  PanZezhong1725 authored Sep 23, 2025
```
issue/469: disable NVIDIA-dequantize on Iluvatar GPU via ENABLE_NVIDIA_API marco
```
  d3d982df
19 Sep, 2025 1 commit
- fix: disable NVIDIA-dequantize on Iluvatar GPU via ENABLE_NVIDIA_API macro · be117fe4
  zhushuang authored Sep 19, 2025
  
  be117fe4
18 Sep, 2025 7 commits
- issue/458 add AWQ dequantization torch test and improve variable naming readability · 82b2a84c
  spike-zhu authored Sep 18, 2025
  
  82b2a84c
- Issue/459 (#460) · 3a91947e
  thatPepe authored Sep 18, 2025
```
* issue/459 - Support more data type combinations

* issue/459 - added test cases for 9G7B and 9G70B

* issue/459 - modified rms kernel to support larger tensors
```
  3a91947e
- Merge pull request #467 from InfiniTensor/issue/466 · 2a81c8bd
  zhangyue authored Sep 18, 2025
```
issue/466: 昆仑平台rope关于NEOX算法的实现
```
  2a81c8bd
- feat:hccl support bf16 · d0b7bf92
  zhangyunze authored Sep 18, 2025
  
  d0b7bf92
- Merge pull request #462 from InfiniTensor/issue/434-cambricon · ade3b5da
  PanZezhong1725 authored Sep 18, 2025
```
issue/434 - added bf16 support for Cambricon MLU
```
  ade3b5da
- issue/466: success kunlun rope NEOX · c15189bf
  xgqdut2016 authored Sep 18, 2025
  
  c15189bf
- issue/436：修补昆仑芯端到端推理遇到的问题 (#437) · 6680a8c8
  PanZezhong1725 authored Sep 18, 2025
```
* issue/436: support kunlun rope U32

* issue/436: 支持9g7b 4b模型

---------
Co-authored-by: zhangyue <zhangyue@qiyuanlab.com>
```
  6680a8c8
17 Sep, 2025 2 commits
- issue/436: 支持9g7b 4b模型 · 3bdd832e
  zhangyue authored Sep 17, 2025
  
  3bdd832e
- issue/436: support kunlun rope U32 · 6892a7f5
  xgqdut2016 authored Sep 09, 2025
  
  6892a7f5
16 Sep, 2025 12 commits
- issue/434 - added bf16 support for Cambricon MLU · 94280d85
  wooway777 authored Sep 16, 2025
  
  94280d85
- issue/410 Feature: Add infinicore python package · badccb86
  Jiacheng Huang authored Sep 16, 2025
  
  badccb86
- fix: disable topkrouter on Iluvatar GPU via ENABLE_NVIDIA_API macro (#457) · 1f507406
  PanZezhong1725 authored Sep 16, 2025
  
  1f507406
- fix: disable topkrouter on Iluvatar GPU via ENABLE_NVIDIA_API macro · 8c777f97
  zhushuang authored Sep 16, 2025
  
  8c777f97
- Merge pull request #438 from InfiniTensor/issue/434-metax · b9dd0004
  PanZezhong1725 authored Sep 16, 2025
```
issue/434 hccl support bf16
```
  b9dd0004
- fix rope_v2 compiling && update infiniccl_test · 3bb0c930
  Ceng2333 authored Sep 10, 2025
```
Signed-off-by: Ceng <441651826@qq.com>
```
  3bb0c930
- issue/434 hccl support bf16 · b8609df3
  Ceng authored Sep 09, 2025
```
Signed-off-by: Ceng <441651826@qq.com>
```
  b8609df3
- Merge pull request #429 from InfiniTensor/issue/428_merge_rope_and_rope_v2 · f9d16628
  PanZezhong1725 authored Sep 16, 2025
```
Issue/428: Merge `rope_v2` into `rope`
```
  f9d16628
- issue/428: update the rope implementation on Ascend, Cambricon, and Kunlun to... · 9f0ae734
  Ziminli authored Sep 08, 2025
```
issue/428: update the rope implementation on Ascend, Cambricon, and Kunlun to use the refactored interface and return unimplemented error for NEOX-style algorithm
```
  9f0ae734
- issue/428: accommodate the changes to c/gguf tests · f6e8476b
  Ziminli authored Sep 07, 2025
  
  f6e8476b
- issue/428: merge rope_v2 into rope with algorithm selection · 86515765
  Ziminli authored Sep 07, 2025
  
  86515765
- Issue/450 Fix Elementwise Striding Broadcasting Issue (#452) · 15ac0191
  PanZezhong1725 authored Sep 16, 2025
```
* issue/450: change indexToReducedOffset() to indexToOffset in elementwise framework on CPU, NVIDIA, Cambricon, Metax, Moore, and Kunlun

* issue/450: remove indexToReducedOffset() in all platforms

* issue/450: add the testcases that pinpoint the issue in infiniop-test
```
  15ac0191
15 Sep, 2025 3 commits
- issue/450: add the testcases that pinpoint the issue in infiniop-test · 9db54b8f
  Ziminli authored Sep 15, 2025
  
  9db54b8f
- issue/450: remove indexToReducedOffset() in all platforms · 9ef02a16
  Ziminli authored Sep 15, 2025
  
  9ef02a16
- issue/450: change indexToReducedOffset() to indexToOffset in elementwise... · 5e581b8e
  Ziminli authored Sep 15, 2025
```
issue/450: change indexToReducedOffset() to indexToOffset in elementwise framework on CPU, NVIDIA, Cambricon, Metax, Moore, and Kunlun
```
  5e581b8e
10 Sep, 2025 1 commit
- issue/440 feat: add softplus operator · 1635fd92
  PanZezhong1725 authored Sep 10, 2025
  
  1635fd92
09 Sep, 2025 2 commits
- Merge pull request #435 from InfiniTensor/issue/434-nv · 97f9ac7e
  PanZezhong1725 authored Sep 09, 2025
```
issue/434 nccl support bf16
```
  97f9ac7e
- issue/434 nccl support bf16 · 81093e0b
  PanZezhong1725 authored Sep 09, 2025
  
  81093e0b
04 Sep, 2025 1 commit
- Merge pull request #426 from InfiniTensor/issue/425 · e8e25a25
  PanZezhong1725 authored Sep 04, 2025
```
issue/425: implement GEMM with MUBLAS and MUDNN backends in moore gpu
```
  e8e25a25