Commits · 68c2b2e6e6c6c4f161b69f8fd8572c25879210d4 · OpenDAS / ktransformers

28 Apr, 2025 3 commits
- support qwen3 · 68c2b2e6
  djw authored Apr 28, 2025
  
  68c2b2e6
- support qwen3 · 0da3792b
  djw authored Apr 28, 2025
  
  0da3792b
- support qwen3, dont speak human language · 3f9bbf11
  djw authored Apr 28, 2025
  
  3f9bbf11
25 Apr, 2025 3 commits
- support AMX · f3d842a0
  chenht2022 authored Apr 25, 2025
  
  f3d842a0
- Merge pull request #1198 from kvcache-ai/fix-max_new_tokens · b90362b5
  wang jiahao authored Apr 25, 2025
```
fix load default max_new_tokens
```
  b90362b5
- fix load default max_new_tokens · 7af83f9e
  qiyuxinlin authored Apr 25, 2025
  
  7af83f9e
24 Apr, 2025 2 commits
- Merge pull request #1193 from kvcache-ai/fix-chat-template-encoding · 67042d11
  Atream authored Apr 23, 2025
```
fix chat template encoding
```
  67042d11
- fix chat template encoding · 46493789
  Atream authored Apr 24, 2025
  
  46493789
23 Apr, 2025 3 commits
- Merge pull request #1183 from kvcache-ai/check-para · 449a83df
  wang jiahao authored Apr 23, 2025
```
add check-para
```
  449a83df
- Merge remote-tracking branch 'origin/main' into check-para · f7d93931
  Alisehen authored Apr 23, 2025
  
  f7d93931
- add check parameters · 99540ad0
  Alisehen authored Apr 23, 2025
  
  99540ad0
22 Apr, 2025 9 commits
- Merge pull request #1184 from kvcache-ai/update_param · 7e4813e8
  wang jiahao authored Apr 22, 2025
```
change test
```
  7e4813e8
- change test · 3a044e6b
  qiyuxinlin authored Apr 22, 2025
  
  3a044e6b
- add check-para · c995bdbb
  Alisehen authored Apr 22, 2025
  
  c995bdbb
- Merge pull request #1182 from kvcache-ai/fix-kill-balance_serve · 73935878
  wang jiahao authored Apr 22, 2025
```
kill serve lead to kill sched and engine
```
  73935878
- kill serve lead to kill sched and engine · 4f9950e3
  qiyuxinlin authored Apr 22, 2025
  
  4f9950e3
- Merge pull request #1180 from kvcache-ai/update_param · 4c41f3a3
  wang jiahao authored Apr 22, 2025
```
update speed test
```
  4c41f3a3
- update speed test · b17ab865
  qiyuxinlin authored Apr 22, 2025
  
  b17ab865
- Merge pull request #1177 from kvcache-ai/update_param · 48558801
  wang jiahao authored Apr 22, 2025
```
Update param
```
  48558801
- fix no balance_serve import error · f5287e90
  qiyuxinlin authored Apr 22, 2025
  
  f5287e90
21 Apr, 2025 1 commit
- roll back ktransformers backend, add max_tokens, max_completion_tokens param · 03a65d6b
  qiyuxinlin authored Apr 21, 2025
  
  03a65d6b
19 Apr, 2025 2 commits
- Merge pull request #1158 from Creeper-MZ/function_call · a1162eea
  wang jiahao authored Apr 19, 2025
```
Update Function call
```
  a1162eea
- 优化提示词，解决部分Deepseek r1的兼容性 · 133ba746
  Creeper-MZ authored Apr 18, 2025
```
优化提示词，解决部分Deepseek r1的兼容性

fix non stream
```
  133ba746
18 Apr, 2025 9 commits
- Merge pull request #1170 from onepick/fix-cmake-error · 34c19940
  Atream authored Apr 18, 2025
```
Fix cmake config error
```
  34c19940
- Merge pull request #1172 from kvcache-ai/move_create_sched · 0892d37d
  wang jiahao authored Apr 18, 2025
```
Move KV cache creation to balance_serve
```
  0892d37d
- Move KV cache creation to balance_serve · 38e84190
  qiyuxinlin authored Apr 18, 2025
  
  38e84190
- Fix cmake config error · c5edd3fd
  mykg authored Apr 18, 2025
```
Signed-off-by: onepick <jiajuku12@163.com>
```
  c5edd3fd
- Merge pull request #1163 from cyhasuka/main · e44c45e7
  Atream authored Apr 18, 2025
```
Enh: Make Ollama perf data more accurate, consistent with OpenAI's implementation
```
  e44c45e7
- Merge pull request #1168 from kvcache-ai/Atream-patch-1 · 08f0bd5e
  Atream authored Apr 17, 2025
```
remove hard code max_length
```
  08f0bd5e
- remove hard code max_length · e6fb4d5a
  Atream authored Apr 18, 2025
  
  e6fb4d5a
- Merge pull request #1167 from kvcache-ai/update-llama4-tutorial-patch-1 · 22a30d70
  Jianwei Dong authored Apr 18, 2025
```
update llama4 tutorial
```
  22a30d70
- update llama4 tutorial · dfaf2b20
  djw authored Apr 18, 2025
  
  dfaf2b20
17 Apr, 2025 8 commits
- Fixed #1155 · 62c40231
  Creeper-MZ authored Apr 17, 2025
  
  62c40231
- Merge branch 'kvcache-ai:main' into main · eff5bbc2
  Yuhao Tsui authored Apr 17, 2025
  
  eff5bbc2
- Update chat.py · 4fb19bfc
  Creeper-MZ authored Apr 17, 2025
  
  4fb19bfc
- Merge pull request #1159 from onepick/fix-rocm-build-error · 8770b6d5
  ZiWei Yuan authored Apr 17, 2025
```
Fix some build error for ROCM
```
  8770b6d5
- Change the logic to build device since cuda is as default · 6a7624fe
  mykg authored Apr 17, 2025
```
Signed-off-by: onepick <jiajuku12@163.com>
```
  6a7624fe
- Modify the performance calculation module · 8ce34b3b
  Yuhao Tsui authored Apr 17, 2025
```
Modify the performance data calculation module from estimation to retrieving from `raw_usage`.
```
  8ce34b3b
- Merge pull request #978 from cyhasuka/main · 6e4da83d
  wang jiahao authored Apr 17, 2025
```
Feat: Support Non-streaming chat in Ollama backend
```
  6e4da83d
- Merge pull request #1154 from 344303947/features/add-function-calling · b0551323
  wang jiahao authored Apr 17, 2025
```
Fix the error caused by the client not passing temperature and top_p being empty
```
  b0551323