- 06 Dec, 2025 1 commit
-
-
PanZezhong1725 authored
-
- 05 Dec, 2025 1 commit
-
-
PanZezhong authored
-
- 04 Dec, 2025 8 commits
-
-
PanZezhong1725 authored
issue/103 新版分布式推理基建
-
pengcheng888 authored
issue/89 在python的llama中使用matmul函数、以及减少Tensor对象创建次数
-
PanZezhong authored
-
pengcheng888 authored
-
PanZezhong1725 authored
issue/97 Attention 和 KVCache 支持 batch 维度
-
PanZezhong1725 authored
issue/99 - relocated pybind contents
-
wooway777 authored
-
PanZezhong authored
-
- 03 Dec, 2025 4 commits
-
-
PanZezhong authored
-
PanZezhong1725 authored
issue/95 将pybind target命名为_infinilm
-
PanZezhong authored
-
PanZezhong1725 authored
Issue/74 基于InfiniCore::nn::module适配Llama模型
-
- 02 Dec, 2025 1 commit
-
-
Ceng23333 authored
Signed-off-by:Ceng23333 <441651826@qq.com>
-
- 26 Nov, 2025 5 commits
-
-
PanZezhong1725 authored
issue/86 - 添加计算结束前的同步函数
-
pengcheng888 authored
-
PanZezhong1725 authored
removed std::move
-
PanZezhong1725 authored
issue/83 - 添加AutoLlama类,支持创建不同backend的模型
-
pengcheng888 authored
-
- 22 Nov, 2025 7 commits
-
-
PanZezhong1725 authored
修复attention prefill计时方式,重构目录
-
PanZezhong authored
-
PanZezhong authored
-
pengcheng888 authored
issue/80 模型文件夹名字改为为model_path,增加llama模型的moore, iluvatar平台的参数
-
pengcheng888 authored
-
pengcheng888 authored
issue/78 - 在test文件夹目录,添加qwen3的moe和attention的耗时测试脚本
-
pengcheng888 authored
-
- 21 Nov, 2025 2 commits
-
-
pengcheng888 authored
issue/76 - 添加python的llama模型实现
-
pengcheng888 authored
-
- 20 Nov, 2025 1 commit
-
-
pengcheng888 authored
-
- 06 Nov, 2025 1 commit
-
-
spike-zhu authored
-
- 03 Nov, 2025 1 commit
-
-
wooway777 authored
-
- 29 Oct, 2025 1 commit
-
-
PanZezhong1725 authored
issue/64 - jiuge.py verbose output
-
- 28 Oct, 2025 1 commit
-
-
wooway777 authored
-
- 22 Oct, 2025 1 commit
-
-
pengcheng888 authored
-
- 10 Oct, 2025 2 commits
-
-
PanZezhong authored
-
PanZezhong authored
-
- 29 Sep, 2025 2 commits
-
-
PanZezhong1725 authored
Compatible with Hygon DCU.
-
zhuyue authored
-
- 24 Sep, 2025 1 commit
-
-
PanZezhong1725 authored
issue/53: update AWQ-dequantize op name to match infinicore op dequantizeAWQ
-