- 26 Dec, 2025 2 commits
-
-
PanZezhong1725 authored
issue/125 添加Paged KV Cache接口
-
PanZezhong authored
-
- 24 Dec, 2025 3 commits
-
-
PanZezhong1725 authored
issue/157 - add/adjust cambricon support in scripts
-
PanZezhong1725 authored
issue/125 统一Cache接口
-
wooway777 authored
-
- 23 Dec, 2025 2 commits
-
-
PanZezhong authored
-
Jiacheng Huang authored
-
- 22 Dec, 2025 2 commits
-
-
qinyiqun authored
Issue/147:增加QY机器编译选项
-
xgqdut2016 authored
-
- 19 Dec, 2025 1 commit
-
-
Jiacheng Huang authored
-
- 18 Dec, 2025 3 commits
-
-
PanZezhong1725 authored
-
PanZezhong authored
-
Ceng authored
* issue/122 :更新benchmark脚本和README.md Signed-off-by:
Ceng23333 <441651826@qq.com> * . Signed-off-by:
Ceng23333 <441651826@qq.com> * fix input_ids Signed-off-by:
Ceng23333 <441651826@qq.com> * explicitly split mmul all subject Signed-off-by:
Ceng23333 <441651826@qq.com> --------- Signed-off-by:
Ceng23333 <441651826@qq.com>
-
- 17 Dec, 2025 1 commit
-
-
Jiacheng Huang authored
-
- 11 Dec, 2025 3 commits
-
-
pengcheng888 authored
issue/115 完善bench.py文件
-
pengcheng888 authored
-
thatPepe authored
Issue/121 - cache managements
-
- 10 Dec, 2025 3 commits
-
-
PanZezhong1725 authored
issue/116 - using batched rope
-
pengcheng888 authored
issue/115 - 添加 bench.py
-
pengcheng888 authored
-
- 09 Dec, 2025 5 commits
-
-
wooway777 authored
-
PanZezhong1725 authored
issue/114 - 添加读取.bin文件权重的代码,更新readme
-
pengcheng888 authored
-
PanZezhong1725 authored
issue/114 QKVParallelLinear, GateUpParallelLinear
-
PanZezhong authored
-
- 08 Dec, 2025 3 commits
-
-
PanZezhong1725 authored
issue/111 - 9g7b分布式
-
Your Name authored
-
Ceng authored
-
- 07 Dec, 2025 2 commits
-
-
pengcheng888 authored
issue/102 - 添加逐文件和逐tensor 读取权重文件的函数
-
pengcheng888 authored
-
- 06 Dec, 2025 3 commits
-
-
PanZezhong1725 authored
issue/92 将run改为异步
-
PanZezhong authored
-
PanZezhong1725 authored
-
- 05 Dec, 2025 1 commit
-
-
PanZezhong authored
-
- 04 Dec, 2025 6 commits
-
-
PanZezhong1725 authored
issue/103 新版分布式推理基建
-
pengcheng888 authored
issue/89 在python的llama中使用matmul函数、以及减少Tensor对象创建次数
-
PanZezhong authored
-
pengcheng888 authored
-
PanZezhong1725 authored
issue/97 Attention 和 KVCache 支持 batch 维度
-
PanZezhong1725 authored
issue/99 - relocated pybind contents
-