- 02 Dec, 2025 1 commit
-
-
Ceng23333 authored
Signed-off-by:Ceng23333 <441651826@qq.com>
-
- 26 Nov, 2025 5 commits
-
-
PanZezhong1725 authored
issue/86 - 添加计算结束前的同步函数
-
pengcheng888 authored
-
PanZezhong1725 authored
removed std::move
-
PanZezhong1725 authored
issue/83 - 添加AutoLlama类,支持创建不同backend的模型
-
pengcheng888 authored
-
- 22 Nov, 2025 7 commits
-
-
PanZezhong1725 authored
修复attention prefill计时方式,重构目录
-
PanZezhong authored
-
PanZezhong authored
-
pengcheng888 authored
issue/80 模型文件夹名字改为为model_path,增加llama模型的moore, iluvatar平台的参数
-
pengcheng888 authored
-
pengcheng888 authored
issue/78 - 在test文件夹目录,添加qwen3的moe和attention的耗时测试脚本
-
pengcheng888 authored
-
- 21 Nov, 2025 2 commits
-
-
pengcheng888 authored
issue/76 - 添加python的llama模型实现
-
pengcheng888 authored
-
- 20 Nov, 2025 1 commit
-
-
pengcheng888 authored
-
- 06 Nov, 2025 1 commit
-
-
spike-zhu authored
-
- 03 Nov, 2025 1 commit
-
-
wooway777 authored
-
- 29 Oct, 2025 1 commit
-
-
PanZezhong1725 authored
issue/64 - jiuge.py verbose output
-
- 28 Oct, 2025 1 commit
-
-
wooway777 authored
-
- 22 Oct, 2025 1 commit
-
-
pengcheng888 authored
-
- 10 Oct, 2025 2 commits
-
-
PanZezhong authored
-
PanZezhong authored
-
- 29 Sep, 2025 2 commits
-
-
PanZezhong1725 authored
Compatible with Hygon DCU.
-
zhuyue authored
-
- 24 Sep, 2025 1 commit
-
-
PanZezhong1725 authored
issue/53: update AWQ-dequantize op name to match infinicore op dequantizeAWQ
-
- 23 Sep, 2025 1 commit
-
-
zhushuang authored
-
- 19 Sep, 2025 1 commit
-
-
PanZezhong1725 authored
issue50: update infiniopDequantize args to macth IniniCore-infiniopDequantize
-
- 18 Sep, 2025 2 commits
- 16 Sep, 2025 2 commits
-
-
PanZezhong1725 authored
-
PanZezhong1725 authored
-
- 08 Sep, 2025 1 commit
-
-
PanZezhong1725 authored
-
- 04 Sep, 2025 1 commit
-
-
thatPepe authored
Model scripts modualization
-
- 03 Sep, 2025 3 commits
-
-
PanZezhong1725 authored
-
PanZezhong1725 authored
-
PanZezhong1725 authored
-
- 02 Sep, 2025 3 commits
-
-
wooway777 authored
-
blkmjsian authored
- deepseek - jiuge 4B awq
-
PanZezhong1725 authored
issue/37 - fixed an inappropriate qk buffer slicing
-