Add train loss

8a7ff08b · chenych · dc061ea0 · 8a7ff08b · 8a7ff08b · 8a7ff08b
Commit 8a7ff08b authored Jun 16, 2025 by chenych
4 changed files
--- a/README.md
+++ b/README.md
@@ -84,13 +84,32 @@ SFT训练脚本示例，参考`llama-factory/train_lora`下对应yaml文件。
 参数解释同[#全参微调](#全参微调)

 ## 推理
-暂无
+### vllm 推理
+#### offline
+需要先修改vllm代码：如图
+<div align=center>
+    <img src="./doc/img_v3_02n6_c32e8a36-8338-4dff-9aec-d710746a61fg.jpg"/>
+</div>
+修改完成后，执行下面的方法。
+```bash
+python infer_vllm.py --model_name_or_path /path_of/model
+```

 ## result
-暂无
+Prompt: "Give me 5 non-formal ways to say 'See you later' in French."
+<div align=center>
+    <img src="./doc/results.png"/>
+</div>

 ### 精度
-DCU与GPU精度一致，推理框架：pytorch。
+训练框架：[Llama-Factory](https://developer.sourcefind.cn/codes/OpenDAS/llama-factory)
+
+训练脚本：[mistral_small_lora_sft.yaml](llama-factory/train_lora/mistral_small_lora_sft.yaml)
+
+|   device   |   train_loss   |
+|:----------:|:-------:|
+| DCU K100AI | 0.7417 |
+|  GPU A800  | 0.7424 |

 ## 应用场景
 ### 算法类别
@@ -108,4 +127,3 @@ DCU与GPU精度一致，推理框架：pytorch。

 ## 参考资料
 - https://mistral.ai/news/mistral-small-3-1
- https://github.com/hiyouga/LLaMA-Factory/
--- a/doc/results.png
+++ b/doc/results.png
--- a/llama-factory/train_full/mistral_small_full_sft.yaml
+++ b/llama-factory/train_full/mistral_small_full_sft.yaml
@@ -23,6 +23,7 @@ save_steps: 500
 plot_loss: true
 overwrite_output_dir: true
 save_only_model: false
+report_to: none  # choices: [none, wandb, tensorboard, swanlab, mlflow]

 ### train
 per_device_train_batch_size: 1

--- a/llama-factory/train_lora/mistral_small_lora_sft.yaml
+++ b/llama-factory/train_lora/mistral_small_lora_sft.yaml
@@ -26,6 +26,7 @@ save_steps: 500
 plot_loss: true
 overwrite_output_dir: true
 save_only_model: false
+report_to: none  # choices: [none, wandb, tensorboard, swanlab, mlflow]

 ### train
 per_device_train_batch_size: 1