1. 12 Aug, 2019 5 commits
  2. 10 Aug, 2019 3 commits
  3. 09 Aug, 2019 3 commits
  4. 08 Aug, 2019 3 commits
    • Hafiz Shafruddin's avatar
      replace 'mkdir' with 'mkdir -p' (#997) · 6398aa9e
      Hafiz Shafruddin authored
      Summary:
      Allow shell script to create sub directories with -p flag. Amends readme file too.
      Pull Request resolved: https://github.com/pytorch/fairseq/pull/997
      
      Differential Revision: D16710813
      
      Pulled By: myleott
      
      fbshipit-source-id: 89abefa27e8fac99d212fc9b7b0dbc3690043ba0
      6398aa9e
    • Myle Ott's avatar
      Integrate with Apache Arrow/Plasma in-memory store for large datasets (#995) · 439ead5a
      Myle Ott authored
      Summary:
      Datasets with many examples can generate very large indexes in TokenBlockDataset (and possibly elsewhere). When using `--num-workers>0` these indexes are pickled and transferred via a multiprocessing pipe, which is slow and can fail if the index grows beyond 4GB (~0.5B examples). Apache Arrow has an in-memory store called Plasma that will offload these arrays to shared memory, which both reduces duplication of the data and avoids needing to pickle.
      Pull Request resolved: https://github.com/pytorch/fairseq/pull/995
      
      Differential Revision: D16697219
      
      Pulled By: myleott
      
      fbshipit-source-id: 1b679ee5b3d2726af54ff418f6159a3671173fb8
      439ead5a
    • Dmytro Okhonko's avatar
      Asr initial push (#810) · 72f9364c
      Dmytro Okhonko authored
      Summary:
      Initial code for speech recognition task.
      Right now only one ASR model added - https://arxiv.org/abs/1904.11660
      
      unit test testing:
      python -m unittest discover tests
      
      also run model training with this code and obtained
      5.0 test_clean | 13.4 test_other
      on librispeech with pytorch/audio features
      Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/810
      
      Reviewed By: cpuhrsch
      
      Differential Revision: D16706659
      
      Pulled By: okhonko
      
      fbshipit-source-id: 89a5f9883e50bc0e548234287aa0ea73f7402514
      72f9364c
  5. 07 Aug, 2019 4 commits
  6. 06 Aug, 2019 1 commit
  7. 05 Aug, 2019 1 commit
  8. 04 Aug, 2019 1 commit
  9. 03 Aug, 2019 2 commits
  10. 02 Aug, 2019 5 commits
    • Ning Dong's avatar
      Avoid cast in PositionalEmbeddings to fix BLEU drop in pytorch native export · 9012e87d
      Ning Dong authored
      Summary:
      Tracing mode doesn't generalize correctly in positional embedding calculation, which caused -5 BLEU at transformer export when using pytorch native.
      
      Details: The original issue was that in ensemble_export, _to_tensor(x) in scripting mode turns integer x into 1-d tensor torch.tensor([x]), not 0-d tensor (scalar x) which is expected in the embedding. So the return value in embedding forward() is actually of wrong shape. When self.weights is of size [x,y], the return value should be (bsz, y, 1) but it was (bsz, 1, y), which caused problem in downstream computation. Tracing only becomes an issue when I used pos = timestep.view(-1)[0] to fix the shape. Then casting the scalar to primary int, to be used as index is not generalizable by tracing mode. Thus I need to convert everything to tensor and replace the advanced indexing with index_select operator.
      
      In summary, less understood features in both scripting&tracing sides caused the bleu drop. :)
      
      Reviewed By: myleott
      
      Differential Revision: D16623025
      
      fbshipit-source-id: 0c7a2c3eafbd774760a5c880c6034009ee084abb
      9012e87d
    • Myle Ott's avatar
      Fewer torch.hub requirements (#959) · 3903f469
      Myle Ott authored
      Summary:
      We will raise exceptions if these are needed and aren't available. Only keep minimum set of reqs
      Pull Request resolved: https://github.com/pytorch/fairseq/pull/959
      
      Differential Revision: D16623304
      
      Pulled By: myleott
      
      fbshipit-source-id: 8e65253742e393b527e8396a9433e64ebec9bb55
      3903f469
    • Myle Ott's avatar
      Add single-models for WMT'19 for hub tutorial · f02f70cc
      Myle Ott authored
      Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/800
      
      Differential Revision: D16621509
      
      Pulled By: myleott
      
      fbshipit-source-id: d3e8e97d30bcafbc35c3f67cd8bbc657b6fa5fe7
      f02f70cc
    • Myle Ott's avatar
      Update READMEs for torch.hub · abb7ed4c
      Myle Ott authored
      Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/795
      
      Differential Revision: D16620488
      
      Pulled By: myleott
      
      fbshipit-source-id: 1998a9ccd8816fc7f590861fb4898f910a36bc1e
      abb7ed4c
    • Myle Ott's avatar
      Update beam search code to support torch.bool change · 5f342527
      Myle Ott authored
      Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/797
      
      Differential Revision: D16617067
      
      Pulled By: myleott
      
      fbshipit-source-id: 52e3aeb98d6e3b55ff9154b784028bf13eabfe38
      5f342527
  11. 01 Aug, 2019 7 commits
  12. 31 Jul, 2019 5 commits