Commit dfc186ee authored by yuguo's avatar yuguo
Browse files

perf

parent c8e0c5a3
......@@ -50,17 +50,17 @@ train.dist.pipeline_parallel_size = 1
cd libai
bash tools/train.sh tools/train_net.py configs/bert_large_pretrain.py 4
### 性能和收敛性
### 精度
训练数据:[https://oneflow-static.oss-cn-beijing.aliyuncs.com/ci-files/dataset/libai/gpt_dataset](链接)
使用的GPGPU:4张DCU-Z100-16G。
模型性能及收敛性
模型精度
| 卡数 | 分布式工具 | 性能 | 收敛性 |
| :--: | :--------: | :--------------: | :----------------------------------------------------------: |
| 4 | Libai-main | 161.23 samples/s | total_loss: 6.555 lm_loss: 5.973 sop_loss: 0.583/10000 iters |
| 卡数 | 分布式工具 | 收敛性 |
| :--: | :--------: | :----------------------------------------------------------: |
| 4 | Libai-main | total_loss: 6.555 lm_loss: 5.973 sop_loss: 0.583/10000 iters |
## 参考
* https://libai.readthedocs.io/en/latest/tutorials/get_started/quick_run.html
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment