Commit 59edee96 authored by weishb's avatar weishb
Browse files

新增transformers推理说明

parent 997d1621
...@@ -61,7 +61,7 @@ docker run -it \ ...@@ -61,7 +61,7 @@ docker run -it \
**以`Spark-Chemistry-X1-13B`为例** **以`Spark-Chemistry-X1-13B`为例**
```bash ```bash
cd /path/to/model/Spark-Chemistry-X1-13B #进入模型目录内 cd /path/to/model/Spark-Chemistry-X1-13B #进入模型目录内
python transformers_run.py python transformers_run.py
``` ```
**transformers_run.py参考:** **transformers_run.py参考:**
...@@ -109,6 +109,8 @@ response = tokenizer.decode( ...@@ -109,6 +109,8 @@ response = tokenizer.decode(
print(response) print(response)
``` ```
**如果用transformers推理Spark-Scilit-X1-13B,需要额外修改模型的config.json文件,将"_attn_implementation":"flash_attention_2"改成"_attn_implementation":"eager"**
### vLLM ### vLLM
#### 单机推理 #### 单机推理
```bash ```bash
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment