- 17 Dec, 2025 1 commit
-
-
Jiacheng Huang authored
-
- 11 Dec, 2025 3 commits
-
-
pengcheng888 authored
issue/115 完善bench.py文件
-
pengcheng888 authored
-
thatPepe authored
Issue/121 - cache managements
-
- 10 Dec, 2025 3 commits
-
-
PanZezhong1725 authored
issue/116 - using batched rope
-
pengcheng888 authored
issue/115 - 添加 bench.py
-
pengcheng888 authored
-
- 09 Dec, 2025 5 commits
-
-
wooway777 authored
-
PanZezhong1725 authored
issue/114 - 添加读取.bin文件权重的代码,更新readme
-
pengcheng888 authored
-
PanZezhong1725 authored
issue/114 QKVParallelLinear, GateUpParallelLinear
-
PanZezhong authored
-
- 08 Dec, 2025 3 commits
-
-
PanZezhong1725 authored
issue/111 - 9g7b分布式
-
Your Name authored
-
Ceng authored
-
- 07 Dec, 2025 2 commits
-
-
pengcheng888 authored
issue/102 - 添加逐文件和逐tensor 读取权重文件的函数
-
pengcheng888 authored
-
- 06 Dec, 2025 3 commits
-
-
PanZezhong1725 authored
issue/92 将run改为异步
-
PanZezhong authored
-
PanZezhong1725 authored
-
- 05 Dec, 2025 1 commit
-
-
PanZezhong authored
-
- 04 Dec, 2025 8 commits
-
-
PanZezhong1725 authored
issue/103 新版分布式推理基建
-
pengcheng888 authored
issue/89 在python的llama中使用matmul函数、以及减少Tensor对象创建次数
-
PanZezhong authored
-
pengcheng888 authored
-
PanZezhong1725 authored
issue/97 Attention 和 KVCache 支持 batch 维度
-
PanZezhong1725 authored
issue/99 - relocated pybind contents
-
wooway777 authored
-
PanZezhong authored
-
- 03 Dec, 2025 4 commits
-
-
PanZezhong authored
-
PanZezhong1725 authored
issue/95 将pybind target命名为_infinilm
-
PanZezhong authored
-
PanZezhong1725 authored
Issue/74 基于InfiniCore::nn::module适配Llama模型
-
- 02 Dec, 2025 1 commit
-
-
Ceng23333 authored
Signed-off-by:Ceng23333 <441651826@qq.com>
-
- 26 Nov, 2025 5 commits
-
-
PanZezhong1725 authored
issue/86 - 添加计算结束前的同步函数
-
pengcheng888 authored
-
PanZezhong1725 authored
removed std::move
-
PanZezhong1725 authored
issue/83 - 添加AutoLlama类,支持创建不同backend的模型
-
pengcheng888 authored
-
- 22 Nov, 2025 1 commit
-
-
PanZezhong1725 authored
修复attention prefill计时方式,重构目录
-