Commits · b6dc35fe8a29d0d229faa63d02e813a9d3afc905 · ModelZoo / Qwen_lmdeploy

12 Jul, 2023 1 commit
- [Improve] Add docstrings to pytorch submodule (#93) · b6dc35fe
  WRH authored Jul 12, 2023
```
* add some docstrings.

* update docstring.

fix

* ignore magic methods
```
  b6dc35fe
11 Jul, 2023 3 commits
- set chuk_size=1 andxport tp to config.ini (#94) · 69b6eabe
  lvhan028 authored Jul 11, 2023
  
  69b6eabe
- feat(deploy.py): support w pack qkv (#83) · ac638b37
  tpoisonooo authored Jul 11, 2023
```
* feat(deploy.py): support w pack qkv
```
  ac638b37
- [Fix] Remaining Issues in #19 (#75) · a6ac981d
  WRH authored Jul 11, 2023
```
* previous merged

* add chinese

* support torch<2

* add a docstring

* fix typo

* rename torch submodule

* rename to pytorch

* rename in readme
```
  a6ac981d
06 Jul, 2023 4 commits

[Feature] Add a torch client (#19) · 009075d8

WRH authored Jul 06, 2023

* draft torch client

* deal with space of tokenizer

* support tensor parallel

* fix

* fix

* move folder

* move instruction to readme

* move to torch/

* rename client to chat

* very bad response

* stash

* rename streamer

* support internlm

* change default args

* remove test

* improve instructions

* remove module docstring

* decrease header level of torch model

009075d8

Streaming output (#71) · 74a4f3c9

q.yao authored Jul 06, 2023



* streaming-output

* fix end

* fix profile

* support chinese streaming

* lint

* update chat

* lint

* fix benchmark

---------
Co-authored-by: grimoire <yaoqian@pjlab.org.cn>

74a4f3c9

fix(project): interlm run error (#69) · 22d403f5
tpoisonooo authored Jul 06, 2023

22d403f5
add internlm url (#67) · 7c6edc83
pppppM authored Jul 06, 2023

7c6edc83

05 Jul, 2023 5 commits

update internlm‘s chat template (#54) · 3de27ead

lvhan028 authored Jul 05, 2023

* update internlm model

* update

* update

* update

* update

* update temperature, topk and top_p

* update

* update

* loosen log level

3de27ead

fix(kv_qparams.py): zp use min (#59) · ec53d63f

tpoisonooo authored Jul 05, 2023

* fix(kv_qparams.py): zp use min

* revert(qparams.py): revert format

* fix(kv_qparams.py): update formula

ec53d63f

remove tokenizer_path from chat_example and move it to lmdeploy/turbomind (#55) · 61e8d2c6
q.yao authored Jul 05, 2023

61e8d2c6

[Feature] Stats Quantization Parameters for KV Cache (#45) · 3fff964d

pppppM authored Jul 05, 2023

* add cal qparams

* support offload inference

* add collect funtions (mod,weight)

* stats kv scales

* update init

* add user guide

* fix hints

* fix comments & support turbomind format

* update user guide

* fix slice kv cache error & support pileval dataset (used in llm-awq)

* fix wrong num heads slice

* update default dataset

* fix conflict

* fix hints

* fix hints

* add gitignore

3fff964d

Python ffi (#34) · 4fd6e710

q.yao authored Jul 05, 2023



* wip

* wip

* example finish

* fix include and namespace

* wtf

* install lib

* batchize

* update cmake install

* multithread

* fix comment

* fix

* add mmengine

* bind llamamodel

---------
Co-authored-by: grimoire <yaoqian@pjlab.org.cn>

4fd6e710

04 Jul, 2023 2 commits
- fix model conversion (#51) · 9bbd39b7
  Li Zhang authored Jul 04, 2023
  
  9bbd39b7
- export attn_bias as int type into config (#48) · 0d19a95d
  lvhan028 authored Jul 04, 2023
  
  0d19a95d
03 Jul, 2023 1 commit
- install triton_example and TransformerTritonBackend to runtime and lib respectively (#39) · bb6f8060
  lvhan028 authored Jul 03, 2023
  
  bb6f8060
01 Jul, 2023 1 commit

Change target tritonfastertransformerbackend to trtonturbomindbackend (#36) · 70e6ab26

lvhan028 authored Jul 01, 2023

* change target tritonfastertransformerbackend to tritonturbomindbackend

* install targets to backends/turbomind

* changge model_dir

70e6ab26

30 Jun, 2023 2 commits
- rename serve/fastertransformer to serve/turbomind (#31) · e8ab4ba3
  lvhan028 authored Jun 30, 2023
```
* rename lmdeploy/serve/fastertransformer to lmdeploy/serve/turbomind

* update

* update
```
  e8ab4ba3
- rename llmdeploy to lmdeploy (#30) · 46f4738c
  lvhan028 authored Jun 30, 2023
```
* change llmdeploy to lmdeploy

* update logo

* update readme
```
  46f4738c