- 24 Feb, 2025 1 commit
-
-
Azure authored
-
- 22 Feb, 2025 1 commit
-
-
Azure authored
Add fp8 linear kernel;\n Add empty cache to fit in 16G VRAM; By 'wkGCaSS - 知乎 https://zhuanlan.zhihu.com/p/25491611225'
-
- 19 Feb, 2025 9 commits
-
-
Atream authored
Necessary tips for Node.js related issues
-
Zhoneym authored
-
Atream authored
Add notes to DeepSeek-R1 tutorial documentation
-
Atream authored
clean PR code and disable flashinfer
-
Atream authored
-
Atream authored
fix server and add prefix cache for server
-
Azure authored
Modify and add any incorrect or missing content in the `install.md`
-
Zhoneym authored
-
Zhoneym authored
-
- 18 Feb, 2025 20 commits
-
-
ceerrep authored
-
ceerrep authored
-
ZiWei Yuan authored
🔖 release v0.2.1.post1 -
liam authored
-
Azure authored
Fix cmake error caused by lack of environment variables in Windows environment
-
ceerrep authored
-
Azure authored
Update README_ZH.md
-
Atream authored
Fix precision mla
-
liam authored
-
Atream authored
Add files via upload
-
Atream authored
-
Azure authored
Fix typo in DeepseekR1_V3_tutorial.md Line 174
-
UnicornChan authored
Fix `GLIBCXX_3.4.30' not found in Anaconda
-
liam authored
-
ceerrep authored
Merge branch 'fix_precision_MLA' of https://github.com/kvcache-ai/ktransformers into server-prefix-cache
-
ceerrep authored
-
Xie Weiyu authored
-
Zhoneym authored
-
ceerrep authored
-
dedfaf authored
-
- 17 Feb, 2025 8 commits
-
-
Xie Weiyu authored
-
Xie Weiyu authored
-
ceerrep authored
-
ceerrep authored
Merge branch 'fix_precision_MLA' of https://github.com/kvcache-ai/ktransformers into server-prefix-cache
-
Atream authored
-
xubo authored
-
hoshinohikari authored
-
ceerrep authored
-
- 16 Feb, 2025 1 commit
-
-
John W. Leimgruber III authored
-