Unverified Commit 37afce3b authored by Jonathan Tow's avatar Jonathan Tow Committed by GitHub
Browse files

Merge pull request #32 from bigscience-workshop/kkawamu1/gem_xsum

Remove stopping_criteria and set max_generation_length to 64
parents f4600a9f 08358031
""" """
Dont Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization
https://arxiv.org/pdf/1808.08745.pdf https://arxiv.org/pdf/1808.08745.pdf
The dataset is for the task of abstractive summarization in its extreme form, its about summarizing a document in a single sentence. It introduces extreme summarization, a new single-document summarization task which does not favor extractive strategies and calls for an abstractive modeling approach. The idea is to create a short, one-sentence news summary answering the question "What is the article about?". The dataset is for the task of abstractive summarization in its extreme form, its about summarizing a document in a single sentence. It introduces extreme summarization, a new single-document summarization task which does not favor extractive strategies and calls for an abstractive modeling approach. The idea is to create a short, one-sentence news summary answering the question "What is the article about?".
...@@ -59,6 +60,9 @@ class GEMXSUMBase(PromptSourceTask): ...@@ -59,6 +60,9 @@ class GEMXSUMBase(PromptSourceTask):
if self.has_test_docs(): if self.has_test_docs():
return self.dataset["test"] return self.dataset["test"]
def max_generation_length(self):
return 64
class GEMXSUM(GEMXSUMBase): class GEMXSUM(GEMXSUMBase):
'''this is for train/validation/test''' '''this is for train/validation/test'''
SPLIT = '' SPLIT = ''
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment