"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "4b26a61631b8fd30f845cf08ebcc5ed65fe83c9b"
[deepspeed docs] misc additions (#15585)
* [deepspeed docs] round_robin_gradients * training and/or eval/predict loss is * Update docs/source/main_classes/deepspeed.mdx Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Showing
Please register or sign in to comment