Always use SequentialSampler during evaluation

When evaluating, shouldn't we always use the SequentialSampler instead of DistributedSampler? Evaluation only runs on 1 GPU no matter what, so if you use the DistributedSampler with N GPUs, I think you'll only evaluate on 1/N of the evaluation set. That's at least what I'm finding when I run an older/modified version of this repo.

Always use SequentialSampler during evaluation
When evaluating, shouldn't we always use the SequentialSampler instead of DistributedSampler? Evaluation only runs on 1 GPU no matter what, so if you use the DistributedSampler with N GPUs, I think you'll only evaluate on 1/N of the evaluation set. That's at least what I'm finding when I run an older/modified version of this repo.
96e83506 · Ethan Perez · Lysandre Debut · 3b48806f · 96e83506
Commit 96e83506 authored Nov 29, 2019 by Ethan Perez Committed by Lysandre Debut Dec 03, 2019
Show whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

examples/run_squad.py examples/run_squad.py +1 -1

No files found.
--- a/examples/run_squad.py
+++ b/examples/run_squad.py
@@ -216,7 +216,7 @@ def evaluate(args, model, tokenizer, prefix=""):

    args.eval_batch_size = args.per_gpu_eval_batch_size * max(1, args.n_gpu)
    # Note that DistributedSampler samples randomly
-    eval_sampler = SequentialSampler(dataset) if args.local_rank == -1 else DistributedSampler(dataset)
+    eval_sampler = SequentialSampler(dataset)
    eval_dataloader = DataLoader(dataset, sampler=eval_sampler, batch_size=args.eval_batch_size)

    # multi-gpu evaluate