- 03 Mar, 2025 7 commits
-
-
Yi Pan authored
-
Atream authored
Update __init__.py
-
Atream authored
-
Atream authored
Introduce Testing Jobs for kTransformers Setup on Self-Hosted Runner
-
Jiaqi Liao authored
-
Jiaqi Liao authored
-
Jiaqi Liao authored
-
- 02 Mar, 2025 3 commits
-
-
wang jiahao authored
fix typo for top_p
-
wang jiahao authored
fix ollama api temperature bug
-
1668068727@qq.com authored
-
- 01 Mar, 2025 10 commits
-
-
Wix Woo authored
-
Atream authored
Update:Solve `torch.backends.cuda.sdp_kernel()` is deprecated.
-
Atream authored
-
Atream authored
Update local_chat.py
-
宁鹏涛 authored
修复config.architectures[0] == "DeepseekV2ForCausalLM" or "DeepseekV3ForCausalLM" 永远为真
-
Atream authored
Update DeepseekR1_V3_tutorial.md
-
Atream authored
-
Atream authored
Support chunk prefill. Support 139K context for DeepSeek-R1 139K with in 24G VRAM.
-
Atream authored
-
Atream authored
-
- 28 Feb, 2025 11 commits
-
-
ZiWei Yuan authored
fix cache_lens bug in server and rm test prompt.txt
-
-
liam authored
-
Atream authored
Delete duplicate code
-
liam authored
-
ZiWei Yuan authored
⚡ fox docker build -
liam authored
-
Azure authored
[fix] Fix template name
-
Azure authored
-
Azure authored
[UPDATE] Update ZH/EN issue template
-
Azure authored
-
- 27 Feb, 2025 9 commits
-
-
Shuaiyi authored
-
wang jiahao authored
fix temperature
-
qiyuxinlin authored
-
Atream authored
use generation config from json file in official repo
-
Atream authored
-
wang jiahao authored
Allow temperature and top_p from /v1/chat/completions
-
lazymio authored
-
wang jiahao authored
-
Azure authored
Update issue templates
-