Commits · 6ec2ea40b98c9b2dba1633cd2dc085222fdc16ec · jerrrrry / infinicore

11 Feb, 2026 16 commits
- Merge pull request #865 from gongchensu/Issue/862 · 6ec2ea40
  thatPepe authored Feb 11, 2026
```
Issue/862 - Fix compilation errors (missing headers, cub namespace) t…
```
  6ec2ea40
- Merge branch 'demo131' into Issue/862 · 8d09630a
  gongchensu authored Feb 11, 2026
  
  8d09630a
- Merge pull request #963 from InfiniTensor/issue/523-020 · 012df56c
  thatPepe authored Feb 11, 2026
```
issue/523 - switched to cambricon mlu 1.22 interface
```
  012df56c
- Merge pull request #879 from InfiniTensor/issue/837 · f1b8ab64
  thatPepe authored Feb 11, 2026
```
issue/837 - support int32 and int64 in cambricon add
```
  f1b8ab64
- Merge pull request #1011 from InfiniTensor/issue/1001 · 84201ad0
  thatPepe authored Feb 11, 2026
```
issue/1001 - feat: add paged attention prefill  and decode for moore gpu referencing nvidia
```
  84201ad0
- Merge pull request #1013 from InfiniTensor/issue/1012 · 718eaf42
  thatPepe authored Feb 11, 2026
```
issue/1012 - feat: add paged caching for moore gpu referencing nvidia
```
  718eaf42
- Merge pull request #839 from InfiniTensor/issue/838 · c112132e
  thatPepe authored Feb 11, 2026
```
issue/838 - Cambricon Batched RoPE
```
  c112132e
- demo131 - remove fp32 from paged tests · d3e27d8c
  wooway777 authored Feb 10, 2026
  
  d3e27d8c
- Merge pull request #1010 from InfiniTensor/issue/899 · 513a8502
  thatPepe authored Feb 11, 2026
```
issue/899 - fix: fix causal_softmax and rearrange bug 
```
  513a8502
- issue/1012 - feat: add paged caching for moore gpu referencing nvidia · 8f710be1
  zhushuang authored Feb 10, 2026
  
  8f710be1
- issue/1001 - feat: add paged attention prefill for moore gpu referencing nvidia · 6074f7b8
  zhushuang authored Feb 10, 2026
  
  6074f7b8
- issue/1001 - feat: add paged attention decode for moore gpu referencing nvidia · 3d3a277f
  zhushuang authored Feb 04, 2026
  
  3d3a277f
- Merge pull request #1009 from InfiniTensor/issue/949 · c312f175
  thatPepe authored Feb 11, 2026
```
issue/949 - feat: add silu_and_mul for moore gpu with test pass
```
  c312f175
- issue/899 - fix: fix causal_softmax and rearrange bug · e4bce369
  zhushuang authored Jan 13, 2026
  
  e4bce369
- issue/949 - feat: add silu_and_mul for moore gpu with test pass · 54635d9f
  zhushuang authored Jan 22, 2026
  
  54635d9f
- Support Quantization (#996) · eb89439d
  qinyiqun authored Feb 11, 2026
```
demo131 - multiple issues regarding quantization, qy, and so forth

* issue/843: success per_channel_quant_int8

* issue/843: success qy quant

* issue/843: modified quant

* Add w8a8int8 performance tests

* add infinicore op linear_w8a8i8

* w8a8 linear module functional nn

* issue/843: QY-GPU Support Int8 scale_mm (#68)

* issue/843: success qy scaled_mm

* issue/843: modified kernel.cuh as per_channel_dequant_int8.cuh

* fix parallel slic in w8

* w8: support multiple batch size

* temp: 修改quantconfig处理

* fix format and delete redundancy code

* fix format

* fix format

* fix format

* Refactor: add new API alongside legacy interfaces with deprecation warnings

* 添加w4 inifnicore相关内容，以及将Quantization config划入InfiniCore

* 量化算子支持图

* solve cub version problem and fix code structure

* fix format

* demo131 - remove commented lines

---------
Co-authored-by: xgqdut2016 <kenan_gewei@163.com>
Co-authored-by: xgqdut2016 <140036308+xgqdut2016@users.noreply.github.com>
Co-authored-by: wooway777 <wooway777@gmail.com>
```
  eb89439d
04 Feb, 2026 4 commits
- Merge pull request #999 from InfiniTensor/issue/988 · abab5652
  thatPepe authored Feb 04, 2026
```
issue/988 - adapt to ali ppu
```
  abab5652
- issue/988 - update readme · e0268b24
  wooway777 authored Feb 04, 2026
  
  e0268b24
- issue/988 - unlock unused operators on ali ppu · 5558e856
  wooway777 authored Feb 04, 2026
  
  5558e856
- issue/988 - adapt to ali ppu · 7e2a4c08
  wooway777 authored Jan 27, 2026
  
  7e2a4c08
29 Jan, 2026 1 commit
- issue/995 fix paged attn on iluvatar · bf0c825d
  zhangyue authored Jan 29, 2026
  
  bf0c825d
27 Jan, 2026 19 commits
- Merge pull request #989 from InfiniTensor/issue/811-fix · 70862bcc
  PanZezhong1725 authored Jan 27, 2026
```
issue/811 use relax graph capture mode
```
  70862bcc
- issue/811 use relax graph capture mode, add compile flag for graph instantiate · 807e5e43
  PanZezhong authored Jan 27, 2026
  
  807e5e43
- demo131 - patch lua flags and includes · 1fa56298
  wooway777 authored Jan 26, 2026
  
  1fa56298
- issue/983 - adapted the optimized paged attention to metax · 7a18d241
  wooway777 authored Jan 26, 2026
  
  7a18d241
- issue/979 - removed commented paged attn codes · 4cd1f688
  wooway777 authored Jan 26, 2026
  
  4cd1f688
- issue/979 optimize paged attention · 1c18c046
  PanZezhong authored Jan 23, 2026
  
  1c18c046
- issue/923 - ninetoothed kv caching for nv, il, mtx · 97eced0e
  wooway777 authored Jan 26, 2026
  
  97eced0e
- issue/931 - ninetoothed swiglu for nv, il, mtx · 5614e1be
  wooway777 authored Jan 26, 2026
  
  5614e1be
- issue/919 - ninetoothed flash attention · 6ac8f906
  wooway777 authored Jan 26, 2026
  
  6ac8f906
- issue/935 - add metax include dir for ninetoothed · 47843aa6
  wooway777 authored Jan 15, 2026
  
  47843aa6
- issue/940 - check build result and implicitly require build.py for build ntops · ca58118f
  wooway777 authored Jan 26, 2026
  
  ca58118f
- issue/925 - Speed up `scripts/build_ntops.py` and... · 32340fc3
  Jiacheng Huang authored Jan 14, 2026
```
issue/925 - Speed up `scripts/build_ntops.py` and `src/infiniop/ninetoothed/build.py` with `concurrent.futures`
```
  32340fc3
- issue/402 - convenient ninetoothed util · 55cd22e3
  Jiacheng Huang authored Aug 25, 2025
```
对 `NineToothedTensor` 进行 C++ 层封装

加入使用数组作为 `shape` 和 `strides` 创建 `ninetoothed::Tensor` 的方式

使用 `ninetoothed::Tensor` 接入九齿的 ReLU 算子

Add an include guard to `ninetoothed/utils.h`
```
  55cd22e3
- issue/985 - adjust cxflags and cxxflags for lua scripts · 7c5aa160
  wooway777 authored Jan 26, 2026
  
  7c5aa160
- issue/810 support more ops as graph op · 81e5fe94
  PanZezhong authored Jan 19, 2026
  
  81e5fe94
- issue/791 - fix add_rmsnorm api on mtx and mth · 0611cb1b
  wooway777 authored Jan 26, 2026
  
  0611cb1b
- issue/632 - adapt to iluvatar core 20 · 4ddc6647
  wooway777 authored Jan 19, 2026
  
  4ddc6647
- issue/884 - add_rms_norm on iluvatar, metax and moore · dfafc21f
  wooway777 authored Jan 07, 2026
  
  dfafc21f
- issue/791 fix add_rmsnorm api and rmsnorm module · 0c204dfd
  PanZezhong authored Jan 23, 2026
  
  0c204dfd