"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "2d184cb553ee20943b03b253f44300e466357871"
feat(model parallelism): moving the labels to the same device as the logits...
feat(model parallelism): moving the labels to the same device as the logits for gpt2 and bart (#22591)
Showing
Please register or sign in to comment