- 28 Aug, 2024 4 commits
-
-
UnicornChan authored
[feature] release 0.1.3
-
chenxl authored
-
UnicornChan authored
Update README.md
-
_HYX_ authored
Set the reminder to set CUDA_HOME and CUDA_PATH in the README to the "Quick Start" section under "Install CUDA".
-
- 22 Aug, 2024 4 commits
-
-
UnicornChan authored
Fix: None for load config
-
chenxl authored
-
UnicornChan authored
[fix] f16 dequantize device ignored
-
molamooo authored
-
- 21 Aug, 2024 2 commits
-
-
UnicornChan authored
[fix] Fix bugs about static cache and server param;
-
TangJingqi authored
-
- 16 Aug, 2024 6 commits
-
-
Azure authored
Update README
-
TangJingqi authored
-
Azure authored
-
Azure authored
[fix] fix broken link in tutorial.
-
TangJingqi authored
-
TangJingqi authored
-
- 15 Aug, 2024 8 commits
-
-
Azure authored
-
Azure authored
-
UnicornChan authored
Release v0.1.2
-
UnicornChan authored
[update] Update readme; Add tutorial
-
TangJingqi authored
-
UnicornChan authored
[fix] format classes and files name
-
TangJingqi authored
-
TangJingqi authored
-
- 14 Aug, 2024 2 commits
- 12 Aug, 2024 6 commits
-
-
UnicornChan authored
[feature] support q2_k & q3_k dequantize on gpu
-
BITcyman authored
-
chenxl authored
-
UnicornChan authored
Update task_queue.h
-
Atream authored
-
chenxl authored
-
- 09 Aug, 2024 3 commits
-
-
UnicornChan authored
[Feature] towards 0.1.2
-
UnicornChan authored
[fix] linux and windows can all find CPUInfer in current Directory
-
chenxl authored
-
- 08 Aug, 2024 5 commits
-
-
chenht2022 authored
1) Linear and MLP operators support qlen>1; 2) All operators now share a single memory buffer; 3) Refactor CPUInfer submit/sync logic.
-
Atream authored
-
UnicornChan authored
Windows Support
-
Atream authored
-
chenxl authored
-