- 11 Feb, 2026 1 commit
-
-
qinyiqun authored
demo131 - multiple issues regarding quantization, qy, and so forth * issue/843: success per_channel_quant_int8 * issue/843: success qy quant * issue/843: modified quant * Add w8a8int8 performance tests * add infinicore op linear_w8a8i8 * w8a8 linear module functional nn * issue/843: QY-GPU Support Int8 scale_mm (#68) * issue/843: success qy scaled_mm * issue/843: modified kernel.cuh as per_channel_dequant_int8.cuh * fix parallel slic in w8 * w8: support multiple batch size * temp: 修改quantconfig处理 * fix format and delete redundancy code * fix format * fix format * fix format * Refactor: add new API alongside legacy interfaces with deprecation warnings * 添加w4 inifnicore相关内容,以及将Quantization config划入InfiniCore * 量化算子支持图 * solve cub version problem and fix code structure * fix format * demo131 - remove commented lines --------- Co-authored-by:
xgqdut2016 <kenan_gewei@163.com> Co-authored-by:
xgqdut2016 <140036308+xgqdut2016@users.noreply.github.com> Co-authored-by:
wooway777 <wooway777@gmail.com>
-
- 18 Nov, 2025 1 commit
-
-
thatPepe authored
* issue/571 - introduced the DeviceEvent feature --------- Co-authored-by:Jiacheng Huang <huangjiacheng0709@outlook.com>
-
- 31 Oct, 2025 1 commit
-
-
Ceng23333 authored
- Implement core modules: Linear, Embedding, RMSNorm - Add PyTorch-like macros for module and parameter definition - INFINICORE_NN_MODULE for single module declaration - INFINICORE_NN_MODULE_VEC for module vectors - INFINICORE_NN_PARAMETER for parameter declaration - Corresponding INIT macros for initialization - Implement hierarchical module system with dynamic path generation - Add state_dict() and load_state_dict() support - Refactor module design: protected registration methods, removed path_ member - Add comprehensive test suite including TinyLlama integration - All parameters are protected with public accessors
-
- 11 Oct, 2025 1 commit
-
-
PanZezhong1725 authored
Co-authored-by:
Jiacheng Huang <huangjiacheng0709@outlook.com> Co-authored-by:
wooway777 <wooway777@gmail.com>
-
- 16 Sep, 2025 1 commit
-
-
Jiacheng Huang authored
-