- 11 Feb, 2026 1 commit
-
-
qinyiqun authored
demo131 - multiple issues regarding quantization, qy, and so forth * issue/843: success per_channel_quant_int8 * issue/843: success qy quant * issue/843: modified quant * Add w8a8int8 performance tests * add infinicore op linear_w8a8i8 * w8a8 linear module functional nn * issue/843: QY-GPU Support Int8 scale_mm (#68) * issue/843: success qy scaled_mm * issue/843: modified kernel.cuh as per_channel_dequant_int8.cuh * fix parallel slic in w8 * w8: support multiple batch size * temp: 修改quantconfig处理 * fix format and delete redundancy code * fix format * fix format * fix format * Refactor: add new API alongside legacy interfaces with deprecation warnings * 添加w4 inifnicore相关内容,以及将Quantization config划入InfiniCore * 量化算子支持图 * solve cub version problem and fix code structure * fix format * demo131 - remove commented lines --------- Co-authored-by:
xgqdut2016 <kenan_gewei@163.com> Co-authored-by:
xgqdut2016 <140036308+xgqdut2016@users.noreply.github.com> Co-authored-by:
wooway777 <wooway777@gmail.com>
-
- 04 Feb, 2026 1 commit
-
-
wooway777 authored
-
- 29 Jan, 2026 1 commit
-
-
zhangyue authored
-
- 27 Jan, 2026 5 commits
- 21 Jan, 2026 1 commit
-
-
PanZezhong authored
-
- 12 Jan, 2026 1 commit
-
-
PanZezhong authored
-
- 30 Dec, 2025 1 commit
-
-
PanZezhong authored
-
- 26 Dec, 2025 2 commits
-
-
qinyiqun authored
* can commit * can exec sm_90a * can exec < sm_90 * fix format * fix format * 增加测试,测试对标sglang test * fix format 1 * fix format 2 * add compile option to disable cutlass
-
PanZezhong1725 authored
This reverts commit 25258029.
-
- 25 Dec, 2025 1 commit
-
-
qinyiqun authored
-
- 19 Dec, 2025 1 commit
-
-
qinyiqun authored
-
- 10 Dec, 2025 1 commit
-
-
thatPepe authored
* issue/739 - support batched RoPE on Nvidia and CPU * issue/739 - metax, moore batched rope * issue/739 - adjust metax flags * issue/739 - added a rope module interface to forward inplace in output tensor
-
- 01 Dec, 2025 1 commit
-
-
Ceng authored
* issue/634: InfiniCore 支持InfiniLM Llama模型适配 Signed-off-by:
Ceng23333 <441651826@qq.com> * . Signed-off-by:
Ceng23333 <441651826@qq.com> --------- Signed-off-by:
Ceng23333 <441651826@qq.com>
-
- 21 Nov, 2025 1 commit
-
-
qinyiqun authored
* ISSUE/628 适配QY C610 GPU,增加编译选项,适配已有算子。添加bge类模型所需的算子,包括gelu,layer_norm,lp_norm(支持l1,l2 norm),relu,softmax,tanh。 --------- Co-authored-by:
xgqdut2016 <kenan_gewei@163.com> Co-authored-by:
xgqdut2016 <140036308+xgqdut2016@users.noreply.github.com>
-
- 20 Nov, 2025 1 commit
-
-
crapromer authored
* initial add mc support for meta * add command description for maca compilation * rebase metax maca support to main * issue/445 - clang format code on ubuntu * issue//445 - change config from use_mc to use-mc and format code
-
- 19 Nov, 2025 1 commit
-
-
zhangyue authored
* issue/614: fix device class in python * issue/614: chang list to List in typing * issue/614: undo kunlun.lua
-
- 31 Oct, 2025 1 commit
-
-
Ceng23333 authored
- Implement core modules: Linear, Embedding, RMSNorm - Add PyTorch-like macros for module and parameter definition - INFINICORE_NN_MODULE for single module declaration - INFINICORE_NN_MODULE_VEC for module vectors - INFINICORE_NN_PARAMETER for parameter declaration - Corresponding INIT macros for initialization - Implement hierarchical module system with dynamic path generation - Add state_dict() and load_state_dict() support - Refactor module design: protected registration methods, removed path_ member - Add comprehensive test suite including TinyLlama integration - All parameters are protected with public accessors
-
- 29 Oct, 2025 1 commit
-
-
zhuyue authored
-
- 28 Oct, 2025 1 commit
-
-
zhuyue authored
-
- 16 Oct, 2025 1 commit
-
-
Ceng authored
-
- 29 Sep, 2025 2 commits
-
-
spike-zhu authored
-
gongchensu authored
Co-authored-by:zhuyue <zhuyue@qiyuanlab.com>
-
- 25 Sep, 2025 1 commit
-
-
zhangyue authored
* issue/472: p800 ccl * issue/472: 删掉无用操作 * issue/472: fix format * issue/472: memcpy h2h case
-
- 04 Sep, 2025 1 commit
-
-
zhushuang authored
-
- 27 Aug, 2025 2 commits
- 26 Aug, 2025 1 commit
-
-
zhangyue authored
-
- 18 Aug, 2025 1 commit
-
-
zhushuang authored
-
- 13 Aug, 2025 1 commit
-
-
thatPepe authored
-
- 07 Aug, 2025 1 commit
-
-
zhangyue authored
* 适配p800软件栈 * 支持p800上编译手写算子,重构elementwise 算子组件 * fix format * delete include xtdk_io * 支持 elementwise float16 bfloat16 数据类型 * 修改自定义 size_t 和 ptrdiff_t 中基础数据类型 * p800 拆分 kernel
-
- 06 Aug, 2025 2 commits
- 30 Jul, 2025 1 commit
-
-
Catheriany authored
-
- 18 Jul, 2025 2 commits
-
-
Jiacheng Huang authored
Issue/277: 为 ReLU 的九齿实现接入天数设备
-
pengcheng888 authored
-
- 15 Jul, 2025 1 commit
-
-
Jiacheng Huang authored
-