- 08 Aug, 2025 4 commits
- 31 Jul, 2025 1 commit
-
-
wooway777 authored
-
- 30 Jul, 2025 1 commit
-
-
wooway777 authored
-
- 23 Jul, 2025 3 commits
- 22 Jul, 2025 1 commit
-
-
wooway777 authored
-
- 21 Jul, 2025 1 commit
-
-
wooway777 authored
-
- 18 Jul, 2025 1 commit
-
-
wooway777 authored
-
- 10 Jul, 2025 4 commits
-
-
PanZezhong1725 authored
fix: assign scale_* as class members instead of local variables
-
mxCynic authored
-
PanZezhong1725 authored
fix: fix 9G4B model by add coefficent to weight tensor when load
-
mxCynic authored
-
- 09 Jul, 2025 2 commits
-
-
mxCynic authored
-
PanZezhong authored
-
- 07 Jul, 2025 2 commits
-
-
PanZezhong1725 authored
issue/14: 兼容openai协议
-
Catheriany authored
-
- 04 Jul, 2025 1 commit
-
-
Catheriany authored
-
- 03 Jul, 2025 1 commit
-
-
PanZezhong1725 authored
issue/12: 沐曦多卡推理卡死问题修复
-
- 02 Jul, 2025 1 commit
-
-
Catheriany authored
-
- 01 Jul, 2025 1 commit
-
-
PanZezhong authored
-
- 27 Jun, 2025 2 commits
-
-
PanZezhong authored
-
Pan Zezhong authored
-
- 26 Jun, 2025 5 commits
-
-
PanZezhong authored
-
PanZezhong1725 authored
Added memory alignment in the memory pool
-
wooway777 authored
-
wooway777 authored
-
wooway777 authored
-
- 25 Jun, 2025 5 commits
-
-
PanZezhong1725 authored
Fixed a double free bug
-
wooway777 authored
Fixed a bug where allocating 0 workspace causes overlapping memory in memory pool which then results in a memory double free error.
-
PanZezhong1725 authored
支持多请求连续batch以及kvcache池
-
Pan Zezhong authored
-
Pan Zezhong authored
-
- 24 Jun, 2025 1 commit
-
-
Pan Zezhong authored
-
- 23 Jun, 2025 2 commits
-
-
Pan Zezhong authored
-
Pan Zezhong authored
-
- 20 Jun, 2025 1 commit
-
-
thatPepe authored
* Tensor memory pool - initial commit * Adjusted workspace management to use the memory pool. * Re-differentiated create and createAsync in Storage * Set initial memory pool size to 0 * Separated creatAsync and createFromPool * Removed the redundant WorkspaceHandle * Adjusted naming * Set default mempool size to 0 in declaration * Adjusted format
-