Merge pull request #562 from apappu97/roc_stories_lmlabels_fix

Small fix to remove shifting of lm labels during pre process of RocStories.

Merge pull request #562 from apappu97/roc_stories_lmlabels_fix
Small fix to remove shifting of lm labels during pre process of RocStories.
3ae8c8be · Thomas Wolf · GitHub · e8952017 · 365fb34c · 3ae8c8be
Unverified Commit 3ae8c8be authored May 01, 2019 by Thomas Wolf Committed by GitHub May 01, 2019
Hide whitespace changes
Inline Side-by-side

Showing with 2 additions and 2 deletions

examples/run_openai_gpt.py examples/run_openai_gpt.py +2 -2

No files found.
--- a/examples/run_openai_gpt.py
+++ b/examples/run_openai_gpt.py
@@ -83,8 +83,8 @@ def pre_process_datasets(encoded_datasets, input_len, cap_length, start_token, d
            input_ids[i, 1, :len(with_cont2)] = with_cont2
            mc_token_ids[i, 0] = len(with_cont1) - 1
            mc_token_ids[i, 1] = len(with_cont2) - 1
-            lm_labels[i, 0, :len(with_cont1)-1] = with_cont1[1:]
-            lm_labels[i, 1, :len(with_cont2)-1] = with_cont2[1:]
+            lm_labels[i, 0, :len(with_cont1)] = with_cont1
+            lm_labels[i, 1, :len(with_cont2)] = with_cont2
            mc_labels[i] = mc_label
        all_inputs = (input_ids, mc_token_ids, lm_labels, mc_labels)
        tensor_datasets.append(tuple(torch.tensor(t) for t in all_inputs))