- 04 Feb, 2026 1 commit
-
-
gongchensu authored
Issue/170 - Add HYGON support and improve device type handling.
-
- 30 Jan, 2026 2 commits
-
-
PanZezhong1725 authored
issue/210 support random sampling with random number
-
PanZezhong authored
-
- 23 Jan, 2026 3 commits
-
-
gongchensu authored
-
PanZezhong1725 authored
issue/199 - 支持qwen3模型
-
wangpengcheng authored
-
- 22 Jan, 2026 1 commit
-
-
wangpengcheng authored
-
- 20 Jan, 2026 1 commit
-
-
PanZezhong1725 authored
-
- 19 Jan, 2026 1 commit
-
-
MaYuhang authored
-
- 16 Jan, 2026 1 commit
-
-
Haojie Wang authored
issue/186 support longrope
-
- 14 Jan, 2026 2 commits
-
-
Haojie Wang authored
Issue/168 InfiniLM接入paged attention接口
-
PanZezhong authored
-
- 13 Jan, 2026 1 commit
-
-
PanZezhong authored
-
- 09 Jan, 2026 1 commit
-
-
PanZezhong authored
-
- 08 Jan, 2026 1 commit
-
-
PanZezhong authored
-
- 06 Jan, 2026 1 commit
-
-
PanZezhong authored
-
- 04 Jan, 2026 1 commit
-
-
PanZezhong authored
-
- 30 Dec, 2025 1 commit
-
-
PanZezhong authored
-
- 29 Dec, 2025 1 commit
-
-
Jiacheng Huang authored
* 将 `cpp.LlamaForCausalLM` 提出,变为 `infinilm.infer_engine.InferEngine` * 将 `Config` 构造逻辑拆分至 `AutoConfig` 中 * 在 `examples` 脚本中直接构造 `InferEngine` * 将 `random_sample` 计算放入模型中 * 为 `InferEngine` 单独实现 `generate` * 允许通过 `GenerationConfig` 传递 `temperature`、`top_k`、`top_p` * 将 `random_sample` 处理从 `LlamaForCausalLM` 中转移到 `RankWorker` 里 * 在 `InferEngine.generate` 中直接 `append(output_id)` * 修复 commit `13aa90c57de369f9985593c0066b6b06a7508b24` 引入的分布式卡死问题 * 将 `InferEngine.forward` 的接口与 C++ 层的 `InferEngine.Input` 对齐 * 提供了 `_measure_and_log_time` 参数来开启之前的 `generate` 内部计时功能
-
- 26 Dec, 2025 4 commits
-
-
PanZezhong1725 authored
issue/125 add pybind for paged attention cache config
-
PanZezhong authored
-
PanZezhong1725 authored
issue/125 添加Paged KV Cache接口
-
PanZezhong authored
-
- 24 Dec, 2025 3 commits
-
-
PanZezhong1725 authored
issue/157 - add/adjust cambricon support in scripts
-
PanZezhong1725 authored
issue/125 统一Cache接口
-
wooway777 authored
-
- 23 Dec, 2025 2 commits
-
-
PanZezhong authored
-
Jiacheng Huang authored
-
- 22 Dec, 2025 2 commits
-
-
qinyiqun authored
Issue/147:增加QY机器编译选项
-
xgqdut2016 authored
-
- 19 Dec, 2025 1 commit
-
-
Jiacheng Huang authored
-
- 18 Dec, 2025 3 commits
-
-
PanZezhong1725 authored
-
PanZezhong authored
-
Ceng authored
* issue/122 :更新benchmark脚本和README.md Signed-off-by:
Ceng23333 <441651826@qq.com> * . Signed-off-by:
Ceng23333 <441651826@qq.com> * fix input_ids Signed-off-by:
Ceng23333 <441651826@qq.com> * explicitly split mmul all subject Signed-off-by:
Ceng23333 <441651826@qq.com> --------- Signed-off-by:
Ceng23333 <441651826@qq.com>
-
- 17 Dec, 2025 1 commit
-
-
Jiacheng Huang authored
-
- 11 Dec, 2025 3 commits
-
-
pengcheng888 authored
issue/115 完善bench.py文件
-
pengcheng888 authored
-
thatPepe authored
Issue/121 - cache managements
-
- 10 Dec, 2025 2 commits
-
-
PanZezhong1725 authored
issue/116 - using batched rope
-
pengcheng888 authored
issue/115 - 添加 bench.py
-