1. 20 Aug, 2019 1 commit
  2. 19 Aug, 2019 6 commits
  3. 17 Aug, 2019 1 commit
  4. 16 Aug, 2019 2 commits
  5. 15 Aug, 2019 5 commits
  6. 14 Aug, 2019 5 commits
  7. 13 Aug, 2019 4 commits
  8. 12 Aug, 2019 5 commits
  9. 10 Aug, 2019 3 commits
  10. 09 Aug, 2019 3 commits
  11. 08 Aug, 2019 3 commits
    • Hafiz Shafruddin's avatar
      replace 'mkdir' with 'mkdir -p' (#997) · 6398aa9e
      Hafiz Shafruddin authored
      Summary:
      Allow shell script to create sub directories with -p flag. Amends readme file too.
      Pull Request resolved: https://github.com/pytorch/fairseq/pull/997
      
      Differential Revision: D16710813
      
      Pulled By: myleott
      
      fbshipit-source-id: 89abefa27e8fac99d212fc9b7b0dbc3690043ba0
      6398aa9e
    • Myle Ott's avatar
      Integrate with Apache Arrow/Plasma in-memory store for large datasets (#995) · 439ead5a
      Myle Ott authored
      Summary:
      Datasets with many examples can generate very large indexes in TokenBlockDataset (and possibly elsewhere). When using `--num-workers>0` these indexes are pickled and transferred via a multiprocessing pipe, which is slow and can fail if the index grows beyond 4GB (~0.5B examples). Apache Arrow has an in-memory store called Plasma that will offload these arrays to shared memory, which both reduces duplication of the data and avoids needing to pickle.
      Pull Request resolved: https://github.com/pytorch/fairseq/pull/995
      
      Differential Revision: D16697219
      
      Pulled By: myleott
      
      fbshipit-source-id: 1b679ee5b3d2726af54ff418f6159a3671173fb8
      439ead5a
    • Dmytro Okhonko's avatar
      Asr initial push (#810) · 72f9364c
      Dmytro Okhonko authored
      Summary:
      Initial code for speech recognition task.
      Right now only one ASR model added - https://arxiv.org/abs/1904.11660
      
      unit test testing:
      python -m unittest discover tests
      
      also run model training with this code and obtained
      5.0 test_clean | 13.4 test_other
      on librispeech with pytorch/audio features
      Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/810
      
      Reviewed By: cpuhrsch
      
      Differential Revision: D16706659
      
      Pulled By: okhonko
      
      fbshipit-source-id: 89a5f9883e50bc0e548234287aa0ea73f7402514
      72f9364c
  12. 07 Aug, 2019 2 commits