Commits · 0563d879245d3c6bb04f2302782c935794635924 · OpenDAS / Fairseq

12 Aug, 2019 5 commits

ignore files starting with . e.g. .ipynb_checkpoints (#819) · 0563d879

Ilia Kulikov authored Aug 12, 2019

Summary:
.ipynb_checkpoints folder in models folders crashed the importlib
now there is a check for this
Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/819

Differential Revision: D16772192

Pulled By: myleott

fbshipit-source-id: 01c956aef4ed312bc7645c31c83dbf98af89d931

0563d879

Minor fixes for RACE finetuning (#818) · d0036640

Myle Ott authored Aug 12, 2019

Summary:
- remove unnecessary extra spaces in RACE data in preprocessing
- fix finetuning instructions (add `--truncate-sequence` and add `--dropout` params)
- close file handle in SentenceRankingTask
Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/818

Differential Revision: D16770055

Pulled By: myleott

fbshipit-source-id: 2c80084e92cdf8692f2ea7e43f7c344c402b9e61

d0036640

Lint · 2b68e91f

Myle Ott authored Aug 12, 2019

Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/817

Differential Revision: D16762905

Pulled By: myleott

fbshipit-source-id: d920595bec44ed26b72dfc6fbc15c0aa107b4e56

2b68e91f

Remove LAMB optimizer (at least until we can test it more) · 969f4474

Myle Ott authored Aug 12, 2019

Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/1008

Differential Revision: D16763315

Pulled By: myleott

fbshipit-source-id: d4bad8384eec273f2d5de4ed29fb8d158ab9187c

969f4474

Update --restore-file logic (partially fixes #999) · 3bbdc554

Myle Ott authored Aug 12, 2019

Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/1007

Differential Revision: D16762490

Pulled By: myleott

fbshipit-source-id: d67137bcf581887850323d188bb4ea643a35ac9e

3bbdc554

10 Aug, 2019 3 commits

Fix torch.hub for MNLI · c0a5d29e

Myle Ott authored Aug 10, 2019

Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/1006

Differential Revision: D16753078

Pulled By: myleott

fbshipit-source-id: 970055632edffcce4e75931ed93b42a249120a4a

c0a5d29e

Add WSC task and criterion · 83249196

Myle Ott authored Aug 10, 2019

Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/1004

Differential Revision: D16751443

Pulled By: myleott

fbshipit-source-id: f70acd6c7be6d69da45b5b32fe4c4eff021539ab

83249196

Fix Python 3.5 compat · a00ce132

Myle Ott authored Aug 09, 2019

Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/1005

Differential Revision: D16751489

Pulled By: myleott

fbshipit-source-id: 6e372ac23643e32a3791044c13f4466bdc28f049

a00ce132

09 Aug, 2019 3 commits

added sentence ranking task and loss (#809) · b6c55b62

Jingfei Du authored Aug 09, 2019

Summary:
This task and loss are used for sentence ranking and multiple choice tasks such as RACE
Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/809

Reviewed By: myleott

Differential Revision: D16715745

Pulled By: jingfeidu

fbshipit-source-id: cb4d1c7b26ebb3e2382449ba51af5745ef56f30f

b6c55b62

MacOS requires c++ flag (#1000) · 838e108a

Vincent Quenneville-Belair authored Aug 09, 2019

Summary:
To install on MacOS, `-stdlib=libc++` needs to be specified.
Pull Request resolved: https://github.com/pytorch/fairseq/pull/1000

Differential Revision: D16733819

Pulled By: myleott

fbshipit-source-id: 7a1ed11e2b4e1071e61c64c379c84f72e02ad2b5

838e108a

added superglue dev set results to readme · 3563e59a

Naman Goyal authored Aug 09, 2019

Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/815

Differential Revision: D16733633

fbshipit-source-id: 0a5029e41b6dbb9fb28e9703ad057d939d489d90

3563e59a

08 Aug, 2019 3 commits

replace 'mkdir' with 'mkdir -p' (#997) · 6398aa9e

Hafiz Shafruddin authored Aug 08, 2019

Summary:
Allow shell script to create sub directories with -p flag. Amends readme file too.
Pull Request resolved: https://github.com/pytorch/fairseq/pull/997

Differential Revision: D16710813

Pulled By: myleott

fbshipit-source-id: 89abefa27e8fac99d212fc9b7b0dbc3690043ba0

6398aa9e

Integrate with Apache Arrow/Plasma in-memory store for large datasets (#995) · 439ead5a

Myle Ott authored Aug 08, 2019

Summary:
Datasets with many examples can generate very large indexes in TokenBlockDataset (and possibly elsewhere). When using `--num-workers>0` these indexes are pickled and transferred via a multiprocessing pipe, which is slow and can fail if the index grows beyond 4GB (~0.5B examples). Apache Arrow has an in-memory store called Plasma that will offload these arrays to shared memory, which both reduces duplication of the data and avoids needing to pickle.
Pull Request resolved: https://github.com/pytorch/fairseq/pull/995

Differential Revision: D16697219

Pulled By: myleott

fbshipit-source-id: 1b679ee5b3d2726af54ff418f6159a3671173fb8

439ead5a

Asr initial push (#810) · 72f9364c

Dmytro Okhonko authored Aug 08, 2019

Summary:
Initial code for speech recognition task.
Right now only one ASR model added - https://arxiv.org/abs/1904.11660

unit test testing:
python -m unittest discover tests

also run model training with this code and obtained
5.0 test_clean | 13.4 test_other
on librispeech with pytorch/audio features
Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/810

Reviewed By: cpuhrsch

Differential Revision: D16706659

Pulled By: okhonko

fbshipit-source-id: 89a5f9883e50bc0e548234287aa0ea73f7402514

72f9364c

07 Aug, 2019 4 commits

fixed reloading from checkpoint (#811) · 9a1038f6

Naman Goyal authored Aug 07, 2019

Summary:
Tested by starting training from (a) `roberta.large`, (b) `roberta.large.mnli`, (c) `checkpoints/checkpoint_last.pt`
Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/811

Reviewed By: myleott

Differential Revision: D16689528

Pulled By: myleott

fbshipit-source-id: 849d72ede9d526c34b4753c1bffd689554d1f837

9a1038f6

Added mask_fill api and some examples in README (#807) · a9eda736

Naman Goyal authored Aug 07, 2019

Summary:
1) This currently works only for single `<mask>` token as multi mask, we might have to look more into order of factorization.
2) This is currently only for single BPE token
Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/807

Differential Revision: D16674509

fbshipit-source-id: 0a020030ee5df6a5115e5f85d5a9ef52b1ad9e1c

a9eda736

Fix tests and GLUE finetuning (fixes #989) · 1e55bbdb

Myle Ott authored Aug 07, 2019

Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/991

Differential Revision: D16687970

Pulled By: myleott

fbshipit-source-id: d877fc16891a8ab97aec47a8d440baa56c2b5f46

1e55bbdb

Add code to realign RoBERTa features to word-level tokenizers · 2b7843da

Myle Ott authored Aug 07, 2019

Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/805

Differential Revision: D16670825

Pulled By: myleott

fbshipit-source-id: 872a1a0274681a34d54bda00bfcfcda2e94144c6

2b7843da

06 Aug, 2019 1 commit

Add back set_epoch functionality lost in RoBERTa merge · e40e4b21

Myle Ott authored Aug 06, 2019

Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/982

Differential Revision: D16668353

Pulled By: myleott

fbshipit-source-id: 699243d6c028c47cd0e3f801d89051b3f919b17e

e40e4b21

05 Aug, 2019 1 commit

fixed roberta finetuning with --find-unused-parameters on multiGPU · 5d543f9b

Naman Goyal authored Aug 05, 2019

Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/806

Differential Revision: D16649933

fbshipit-source-id: 6eeda6e2caf8019228e3efc0c27ddfcc3c4d8674

5d543f9b

04 Aug, 2019 1 commit

Add doc string for Roberta.encode function · 1684e166

Myle Ott authored Aug 04, 2019

Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/969

Differential Revision: D16642388

Pulled By: myleott

fbshipit-source-id: c5b1655dbddb697822feefa433f33f6bb08253ab

1684e166

03 Aug, 2019 2 commits

remove default params from args so architecture works properly · c728b864

alexeib authored Aug 03, 2019

Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/798

Reviewed By: myleott

Differential Revision: D16619502

Pulled By: alexeib

fbshipit-source-id: af20c90c4522458850d8f42cab001259ef4293cc

c728b864

Fix generating with a fixed prefix · 12258e57

Myle Ott authored Aug 03, 2019

Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/801

Differential Revision: D16628318

Pulled By: myleott

fbshipit-source-id: 50e93bb9108afd2ba90f1edd4f34306a7c9964a4

12258e57

02 Aug, 2019 5 commits

Avoid cast in PositionalEmbeddings to fix BLEU drop in pytorch native export · 9012e87d

Ning Dong authored Aug 02, 2019

Summary:
Tracing mode doesn't generalize correctly in positional embedding calculation, which caused -5 BLEU at transformer export when using pytorch native.

Details: The original issue was that in ensemble_export, _to_tensor(x) in scripting mode turns integer x into 1-d tensor torch.tensor([x]), not 0-d tensor (scalar x) which is expected in the embedding. So the return value in embedding forward() is actually of wrong shape. When self.weights is of size [x,y], the return value should be (bsz, y, 1) but it was (bsz, 1, y), which caused problem in downstream computation. Tracing only becomes an issue when I used pos = timestep.view(-1)[0] to fix the shape. Then casting the scalar to primary int, to be used as index is not generalizable by tracing mode. Thus I need to convert everything to tensor and replace the advanced indexing with index_select operator.

In summary, less understood features in both scripting&tracing sides caused the bleu drop. :)

Reviewed By: myleott

Differential Revision: D16623025

fbshipit-source-id: 0c7a2c3eafbd774760a5c880c6034009ee084abb

9012e87d

Fewer torch.hub requirements (#959) · 3903f469

Myle Ott authored Aug 02, 2019

Summary:
We will raise exceptions if these are needed and aren't available. Only keep minimum set of reqs
Pull Request resolved: https://github.com/pytorch/fairseq/pull/959

Differential Revision: D16623304

Pulled By: myleott

fbshipit-source-id: 8e65253742e393b527e8396a9433e64ebec9bb55

3903f469

Add single-models for WMT'19 for hub tutorial · f02f70cc

Myle Ott authored Aug 02, 2019

Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/800

Differential Revision: D16621509

Pulled By: myleott

fbshipit-source-id: d3e8e97d30bcafbc35c3f67cd8bbc657b6fa5fe7

f02f70cc

Update READMEs for torch.hub · abb7ed4c

Myle Ott authored Aug 02, 2019

Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/795

Differential Revision: D16620488

Pulled By: myleott

fbshipit-source-id: 1998a9ccd8816fc7f590861fb4898f910a36bc1e

abb7ed4c

Update beam search code to support torch.bool change · 5f342527

Myle Ott authored Aug 02, 2019

Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/797

Differential Revision: D16617067

Pulled By: myleott

fbshipit-source-id: 52e3aeb98d6e3b55ff9154b784028bf13eabfe38

5f342527

01 Aug, 2019 7 commits

Fix wmt19 links (#796) · ccb5dea5

Nathan Ng authored Aug 01, 2019

Summary:
fix links to .tar.gz vs .tar.bz2
Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/796

Reviewed By: myleott

Differential Revision: D16611740

Pulled By: nng555

fbshipit-source-id: 76210484225ed917ff14ef626845680d918948f5

ccb5dea5

Use ==/!= to compare str, bytes, and int literals (#948) · ea6cc1da

Christian Clauss authored Aug 01, 2019

Summary:
Identity is not the same thing as equality in Python.
Pull Request resolved: https://github.com/pytorch/fairseq/pull/948

Differential Revision: D16608269

Pulled By: myleott

fbshipit-source-id: be203d62e7824c96c59400d1b342196adb89a839

ea6cc1da

Add more details for bulk BPE encoding · 45f23f66

Myle Ott authored Aug 01, 2019

Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/793

Differential Revision: D16603930

Pulled By: myleott

fbshipit-source-id: b302db3743db4f36c14fb0dc7f3456fe8a0079dd

45f23f66

Changed tensor comparison return type from uint8 to bool (#21113) · 430905d7

Iurii Zdebskyi authored Aug 01, 2019

Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/21113
ghimport-source-id: 9c4ba63457a72bfc41894387e0b01be3fd9a9baf

Test Plan: Imported from OSS

Differential Revision: D15552204

Pulled By: izdeby

fbshipit-source-id: a608213668649d058e22b510d7755cb99e7d0037

430905d7

Fix sampling with beam>1 · 4abadbdf

Myle Ott authored Aug 01, 2019

Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/792

Differential Revision: D16591987

Pulled By: myleott

fbshipit-source-id: d27c490ae75f80ded19226b8384f4776485dd694

4abadbdf

Update PyTorch Hub interface · 5b2be870

Myle Ott authored Aug 01, 2019

Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/782

Differential Revision: D16542256

Pulled By: myleott

fbshipit-source-id: ea3279e7a1ce4687a5914f32b76787c419be1ffa

5b2be870

Fix small syntax error in hub_utils.py (fixes #942) · 3e0e5bec

Myle Ott authored Jul 31, 2019

Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/944

Differential Revision: D16593568

Pulled By: myleott

fbshipit-source-id: 611bccae2ad0b8dc704c47a8a3343161010c2356

3e0e5bec

31 Jul, 2019 5 commits

Fix citation errors (#791) · 94722a9f

Nathan Ng authored Jul 31, 2019

Summary:
Fixing booktitle in wmt19 citation
Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/791

Reviewed By: myleott

Differential Revision: D16589372

Pulled By: nng555

fbshipit-source-id: 28402784bb6ef0615e46b8d8383bfa52d79e46de

94722a9f

Roberta add classification finetuning example readme (#790) · fe8a1639

ngoyal2707 authored Jul 31, 2019

Summary:
Added readme for IMDB classification as tutorial for custm finetuning of roberta
Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/790

Reviewed By: myleott

Differential Revision: D16587877

Pulled By: myleott

fbshipit-source-id: ed265b7254e6fa2fc8a899ba04c0d2bb45a7f5c4

fe8a1639

Update language_model README.md (#941) · c5650bfc

Dongjin Na authored Jul 31, 2019

Summary:
Adding a backslash in the convolutional language model training usage.
Pull Request resolved: https://github.com/pytorch/fairseq/pull/941

Differential Revision: D16581388

Pulled By: myleott

fbshipit-source-id: 7e2e05ecf13e86cb844dc5200d49f560c63b12ff

c5650bfc

Use commandline interface in preprocess_GLUE_tasks.sh (#937) · 37eb9f2b

Johannes Villmow authored Jul 31, 2019

Summary:
Just a small fix for issue https://github.com/pytorch/fairseq/issues/936 .
Pull Request resolved: https://github.com/pytorch/fairseq/pull/937

Differential Revision: D16580263

Pulled By: myleott

fbshipit-source-id: 1777e782491c63697726e95bd555892da3fed4ec

37eb9f2b

Wmt19 models (#767) · b651b000

Nathan Ng authored Jul 31, 2019

Summary:
Release of the WMT 19 pretrained models
Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/767

Reviewed By: edunov

Differential Revision: D16472717

Pulled By: nng555

fbshipit-source-id: acf0fa3548c33f2bf2b5f71e551c782ad8c31a42

b651b000