"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "7d5ce6802ec5bab29d60e3501337d3477f31b866"
feat(model parallelism): moving the labels to the same device as the logits...
feat(model parallelism): moving the labels to the same device as the logits for gpt2 and bart (#22591)
Showing
Please register or sign in to comment