"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "b11386e158e86e62d4041eabd86d044cd1695737"
load and prepare CNN/Daily Mail data
We write a function to load an preprocess the CNN/Daily Mail dataset as provided by Li Dong et al. The issue is that this dataset has already been tokenized by the authors, so we actually need to find the original, plain-text dataset if we want to apply it to all models.
Showing
Please register or sign in to comment