Commits · c2b4dc805c71f05bd61e47be75a8aa44b76bd38f · OpenDAS / ktransformers

04 Nov, 2024 7 commits
- 🚑️:roll back transformer.py and find that it's multiple chat hsitory have minor accurate error · c2b4dc80
  liam authored Nov 01, 2024
  
  c2b4dc80
- ✨: rm sensitive info in config.yaml, add readme of makefile. support old model_path config · a148da2c
  liam authored Oct 31, 2024
  
  a148da2c
- wjh fix change · 9a2e7057
  anyanqilin authored Oct 30, 2024
  
  9a2e7057
- wjh change · a72dc6ed
  anyanqilin authored Oct 30, 2024
  
  a72dc6ed
- wjh-change · 2d67016d
  anyanqilin authored Oct 29, 2024
  
  2d67016d
- 🚑️: back transformer.py bugs version, and fix typo error in local_chat.py · 7c94df4b
  liam authored Oct 28, 2024
  
  7c94df4b
- ✨: refactor local_chat and fix message slice bug in server · dd1d8667
  liam authored Oct 21, 2024
  
  dd1d8667
09 Oct, 2024 4 commits
- Merge pull request #99 from chenht2022/main · 43fc7f44
  Chen Hongtao authored Oct 09, 2024
```
Adapt Windows
```
  43fc7f44
- Adapt Windows · 14869b55
  chenht2022 authored Oct 09, 2024
  
  14869b55
- Merge pull request #77 from TKONIY/fix-prefill-and-generate · a81a7ffe
  UnicornChan authored Oct 09, 2024
```
Fix: Wrong type of token list returned by prefill_and_generate
```
  a81a7ffe
- Merge pull request #83 from sayap/task-queue-cond-var · b4904537
  Chen Hongtao authored Oct 09, 2024
```
Use cond var to avoid busy loop
```
  b4904537
19 Sep, 2024 1 commit
- Merge pull request #86 from xhedit/main · 43e8848d
  UnicornChan authored Sep 19, 2024
```
typo fix: KMisrtal -> KMistral
```
  43e8848d
15 Sep, 2024 1 commit
- Merge pull request #88 from Azure-Tang/main · 49539ac4
  UnicornChan authored Sep 15, 2024
```
[fix] Fix some gpu dequant function doesn't support multi gpu bug
```
  49539ac4
13 Sep, 2024 2 commits
- update readme · 7953bd9c
  Azure authored Sep 13, 2024
  
  7953bd9c
- fix some dequant function dosen't support multi gpu bug · 3758afb5
  Azure authored Sep 13, 2024
  
  3758afb5
12 Sep, 2024 1 commit
- typo fix: KMisrtal -> KMistral · 234faf79
  xhedit authored Sep 12, 2024
  
  234faf79
11 Sep, 2024 1 commit
- Use cond var to avoid busy loop · 6666d622
  Yap Sok Ann authored Sep 10, 2024
  
  6666d622
06 Sep, 2024 1 commit
- Merge pull request #72 from sayap/dequantize-iq4-xs · 3ed8a043
  UnicornChan authored Sep 06, 2024
```
Support IQ4_XS dequantize
```
  3ed8a043
05 Sep, 2024 1 commit
- Fix: the tokens return by prefill_and_generate · ee72cee0
  yangshen authored Sep 05, 2024
  
  ee72cee0
02 Sep, 2024 3 commits
- Merge pull request #71 from Azure-Tang/main · be81269e
  UnicornChan authored Sep 02, 2024
```
[fix] Fix qlen > chunk_size mask is none error
```
  be81269e
- fix qlen > 1000 mask is none error · c55de02f
  Azure authored Sep 02, 2024
  
  c55de02f
- Support IQ4_XS dequantize · be356c1b
  Yap Sok Ann authored Sep 02, 2024
  
  be356c1b
30 Aug, 2024 3 commits
- Merge pull request #67 from UnicornChan/main · 022b8938
  UnicornChan authored Aug 30, 2024
```
[fix] fix bugs about Qwen2-57B, install requirement, DockerFile
```
  022b8938
- [fix] bugs about Qwen57B, install requirement, Dockerfile · 49cce0c4
  chenxl authored Aug 30, 2024
  
  49cce0c4
- Merge pull request #64 from eltociear/patch-1 · 351698c3
  UnicornChan authored Aug 30, 2024
```
docs: update long_context_introduction.md
```
  351698c3
29 Aug, 2024 10 commits
- [fix] some bugs while package in github action · c80490a9
  chenxl authored Aug 29, 2024
  
  c80490a9
- docs: update long_context_introduction.md · e961adde
  Ikko Eltociear Ashimine authored Aug 30, 2024
```
accuary -> accuracy
```
  e961adde
- Merge pull request #62 from Azure-Tang/main · f536a708
  UnicornChan authored Aug 29, 2024
```
[Fix] Fix problem that ktransformers cannot offload whole layer in cpu
```
  f536a708
- update yaml example; update version idx; update docker file · 8747c099
  TangJingqi authored Aug 29, 2024
  
  8747c099
- Fix cannot offload whole layer in cpu · 6735beb5
  TangJingqi authored Aug 29, 2024
  
  6735beb5
- Merge pull request #60 from sammcj/patch-1 · 35d7aed2
  Atream authored Aug 29, 2024
```
fix(docs): fix broken link
```
  35d7aed2
- fix(docs): fix broken link · 0b57627b
  Sam authored Aug 29, 2024
  
  0b57627b
- Merge pull request #58 from Azure-Tang/main · 1dcb8dae
  Azure authored Aug 29, 2024
```
[fix]  Fix readme datas
```
  1dcb8dae
- update readme · 440d827e
  TangJingqi authored Aug 29, 2024
  
  440d827e
- fix readme; adjust param · abd4214b
  TangJingqi authored Aug 29, 2024
  
  abd4214b
28 Aug, 2024 4 commits
- Merge pull request #57 from UnicornChan/develop-0.1.3 · 233bbb8c
  UnicornChan authored Aug 29, 2024
```
[feature] release 0.1.3
```
  233bbb8c
- [feature] release 0.1.3 · 4d1d561d
  chenxl authored Aug 28, 2024
  
  4d1d561d
- Merge pull request #56 from hyx1999/patch-1 · 67f8b370
  UnicornChan authored Aug 28, 2024
```
Update README.md
```
  67f8b370
- Update README.md · ea1143e5
  _HYX_ authored Aug 28, 2024
```
Set the reminder to set CUDA_HOME and CUDA_PATH in the README to the "Quick Start" section under "Install CUDA".
```
  ea1143e5
22 Aug, 2024 1 commit
- Merge pull request #52 from UnicornChan/fix-bug-load-config · 0f054fe4
  UnicornChan authored Aug 22, 2024
```
Fix: None for load config
```
  0f054fe4