Commit 7a1647e3 authored by chenych's avatar chenych
Browse files

Add train loss

parent bc137ddf
...@@ -109,12 +109,21 @@ curl http://<your-node-url>:8000/v1/chat/completions \ ...@@ -109,12 +109,21 @@ curl http://<your-node-url>:8000/v1/chat/completions \
``` ```
## result ## result
Prompt: "Explain Machine Learning to me in a nutshell."
<div align=center> <div align=center>
<img src="./doc/results.pngcd ../"/> <img src="./doc/results.png"/>
</div> </div>
### 精度 ### 精度
DCU与GPU精度一致,推理框架:pytorch。 训练框架:[Llama-Factory](https://developer.sourcefind.cn/codes/OpenDAS/llama-factory)
训练脚本:[mistral_lora_sft.yaml](llama-factory/train_lora/mistral_lora_sft.yaml)
| device | train_loss |
|:----------:|:-------:|
| DCU K100AI | 0.8139 |
| GPU A800 | 0.497 |
## 应用场景 ## 应用场景
### 算法类别 ### 算法类别
......
...@@ -23,6 +23,7 @@ save_steps: 500 ...@@ -23,6 +23,7 @@ save_steps: 500
plot_loss: true plot_loss: true
overwrite_output_dir: true overwrite_output_dir: true
save_only_model: false save_only_model: false
report_to: none # choices: [none, wandb, tensorboard, swanlab, mlflow]
### train ### train
per_device_train_batch_size: 1 per_device_train_batch_size: 1
......
...@@ -26,6 +26,7 @@ save_steps: 500 ...@@ -26,6 +26,7 @@ save_steps: 500
plot_loss: true plot_loss: true
overwrite_output_dir: true overwrite_output_dir: true
save_only_model: false save_only_model: false
report_to: none # choices: [none, wandb, tensorboard, swanlab, mlflow]
### train ### train
per_device_train_batch_size: 1 per_device_train_batch_size: 1
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment