Commits · 1264f9407b997d97e8a921c658f54ac17d30c21c · OpenDAS / ktransformers

28 Feb, 2025 6 commits
- Merge pull request #732 from KMSorSMS/main · 1264f940
  ZiWei Yuan authored Feb 28, 2025
```
⚡ fox docker build
```
  1264f940
- ⚡ fox docker build · a0e7afa4
  liam authored Feb 28, 2025
  
  a0e7afa4
- Merge pull request #731 from Azure-Tang/update-template · add41512
  Azure authored Feb 28, 2025
```
[fix] Fix template name
```
  add41512
- fix name · bc529699
  Azure authored Feb 28, 2025
  
  bc529699
- Merge pull request #730 from Azure-Tang/update-template · 0439cb36
  Azure authored Feb 28, 2025
```
[UPDATE] Update ZH/EN issue template
```
  0439cb36
- update ZH/EN template · 31b01f5b
  Azure authored Feb 28, 2025
  
  31b01f5b
27 Feb, 2025 16 commits
- Merge pull request #721 from kvcache-ai/fix_temperature · 7a19f3b7
  wang jiahao authored Feb 27, 2025
```
fix temperature
```
  7a19f3b7
- fix temperature · 22df52e9
  qiyuxinlin authored Feb 27, 2025
  
  22df52e9
- Merge pull request #719 from kvcache-ai/fix-use-generation-json · 85e2cc7b
  Atream authored Feb 27, 2025
```
use generation config from json file in official repo
```
  85e2cc7b
- use generation config from json file in official repo · e645d847
  Atream authored Feb 27, 2025
  
  e645d847
- Merge pull request #644 from wtdcode/temperature_top_p_from_request · 5e3c6b4f
  wang jiahao authored Feb 27, 2025
```
Allow temperature and top_p from /v1/chat/completions
```
  5e3c6b4f
- Fix according to upstream changes · b121ca4d
  lazymio authored Feb 27, 2025
  
  b121ca4d
- Merge branch 'main' into temperature_top_p_from_request · 26f7b4af
  wang jiahao authored Feb 27, 2025
  
  26f7b4af
- Merge pull request #717 from kvcache-ai/issue-template · 1f28f75f
  Azure authored Feb 27, 2025
```
Update issue templates
```
  1f28f75f
- Update issue templates · c61805dd
  Azure authored Feb 27, 2025
  
  c61805dd
- Merge pull request #622 from akemimadoka/fix-msvc · 50c69129
  Atream authored Feb 27, 2025
```
Fix missing macro definition for KTRANSFORMERS_USE_CUDA and <chrono> includes on MSVC
```
  50c69129
- Merge pull request #670 from akemimadoka/fix-win · 0422152c
  Atream authored Feb 27, 2025
```
Fix RuntimeError on Windows caused by integer overflow in np.prod
```
  0422152c
- Merge pull request #532 from xv44586/fix-sse-formatting · 798e1d0c
  Atream authored Feb 27, 2025
```
fix: fix SSE formatting
```
  798e1d0c
- Merge pull request #650 from ceerRep/main · f403cde6
  Atream authored Feb 27, 2025
```
feat: basic api key support
```
  f403cde6
- Merge pull request #626 from cyhasuka/main · 1d5d5fae
  Atream authored Feb 27, 2025
```
Feat: Clear cache during weight loading to prevent OOM on GPUs with <=8GB VRAM
```
  1d5d5fae
- Merge branch 'main' into main · 8db6a4d4
  Atream authored Feb 27, 2025
  
  8db6a4d4
- Merge pull request #691 from swu-hyk/ollama_api_chat · 3c8c5805
  wang jiahao authored Feb 27, 2025
```
feat:implementation of chat routing for Ollama
```
  3c8c5805
26 Feb, 2025 13 commits
- Merge pull request #702 from Azure-Tang/update-readme · ca93cf75
  Azure authored Feb 26, 2025
```
[UPDATE] Update documents.
```
  ca93cf75
- Update fp8 doc; Update install.md broken link · c05ebb74
  Azure authored Feb 26, 2025
  
  c05ebb74
- Merge pull request #699 from kvcache-ai/Atream-patch-1 · 3ebe17eb
  Atream authored Feb 26, 2025
```
Update DeepseekR1_V3_tutorial.md
```
  3ebe17eb
- Update DeepseekR1_V3_tutorial.md · 369f4d91
  Atream authored Feb 26, 2025
  
  369f4d91
- Merge pull request #697 from kvcache-ai/fix-yaml · 9650893a
  Atream authored Feb 26, 2025
```
Update DeepSeek-V3-Chat-multi-gpu-marlin.yaml
```
  9650893a
- Update DeepSeek-V3-Chat-multi-gpu-marlin.yaml · 90eb87b3
  Atream authored Feb 26, 2025
  
  90eb87b3
- modify · ec7e912f
  swu-hyk authored Feb 26, 2025
  
  ec7e912f
- implementation of chat routing for Ollama · 68e7df3a
  swu-hyk authored Feb 26, 2025
  
  68e7df3a
- Merge pull request #685 from vproxy-tools/main · 9660b2cc
  Chen Hongtao authored Feb 26, 2025
```
fix numa cpu distribution
```
  9660b2cc
- Merge pull request #684 from KMSorSMS/main · e7ebb263
  ZiWei Yuan authored Feb 26, 2025
```
fix dockerfile in devcontainer and fix expert torch
```
  e7ebb263
- ⚡ fix experts torch · ffb86c66
  liam authored Feb 26, 2025
  
  ffb86c66
- ⚡ fix cd error · de082f14
  liam authored Feb 26, 2025
  
  de082f14
- fix numa cpu distribution · b2bff177
  wkgcass authored Feb 26, 2025
```
The numa node location would be calculated based on the total number
of worker threads.
So we should always use the actual number of threads instead of using a min() op.
```
  b2bff177
25 Feb, 2025 5 commits
- Fix RuntimeError on Windows caused by integer overflow in np.prod · 8817777e
  akemimadoka authored Feb 26, 2025
  
  8817777e
- Merge pull request #668 from KMSorSMS/main · 99f6e421
  Azure authored Feb 26, 2025
```
📝 update benchmark.md
```
  99f6e421
- 📝 update more detail and fix typo · 3ad12751
  liam authored Feb 26, 2025
  
  3ad12751
- Merge pull request #667 from Azure-Tang/update-readme · 31bc9906
  Azure authored Feb 26, 2025
```
[update] Update doc.
```
  31bc9906
- 📝 update benchmark.md · 05339ad0
  liam authored Feb 25, 2025
  
  05339ad0