1. 21 Oct, 2020 19 commits
  2. 20 Oct, 2020 14 commits
  3. 19 Oct, 2020 7 commits
    • Patrick von Platen's avatar
      [EncoderDecoder] Fix Typo (#7915) · c912ba5f
      Patrick von Platen authored
      * fix encoder decoder models
      
      * add .gitignore
      c912ba5f
    • Bram Vanroy's avatar
      Raise error when using AMP on non-CUDA device (#7869) · 55bcd0cb
      Bram Vanroy authored
      * Raise error when using AMP on non-CUDA device
      
      * make style
      
      * make style
      55bcd0cb
    • Patrick von Platen's avatar
      fix t5 training docstring (#7911) · e3d2bee8
      Patrick von Platen authored
      e3d2bee8
    • Ayub Subhaniya's avatar
      `decoder_config` used before intialisation (#7903) · df1ddced
      Ayub Subhaniya authored
      Seeing error when sending `decoder_config` as a parameter while initializing a encoder-decoder model from pretrained. 
      fixed "UnboundLocalError: local variable 'decoder_config' referenced before assignment"
      df1ddced
    • Quentin Lhoest's avatar
      Allow Custom Dataset in RAG Retriever (#7763) · 033f29c6
      Quentin Lhoest authored
      * add CustomHFIndex
      
      * typo in config
      
      * update tests
      
      * add custom dataset example
      
      * clean script
      
      * update test data
      
      * minor in test
      
      * docs
      
      * docs
      
      * style
      
      * fix imports
      
      * allow to pass the indexed dataset directly
      
      * update tests
      
      * use multiset DPR
      
      * address thom and patrick's comments
      
      * style
      
      * update dpr tokenizer
      
      * add output_dir flag in use_own_knowledge_dataset.py
      
      * allow custom datasets in examples/rag/finetune.py
      
      * add test for custom dataset in distributed rag retriever
      033f29c6
    • Julien Rossi's avatar
      Trainer with Iterable Dataset (#7858) · a09fe140
      Julien Rossi authored
      * fix 5990
      
      * accomodate iterable dataset without predefined length
      * set it as 1 use case: provide max_steps, and NO num_epochs
      * Is a merge of master and PR 5995
      
      * fix trainer test under TF
      
      * fix only for torch
      * TF trainer untouched
      * trainer tests are skipped when no torch
      
      * address comments
      
      * fix quality checks
      
      * remove torch.dataset from test_trainer
      
      * unnecessary inheritance
      * RegressionDataset implements all needed methods __len__ and __getitem__
      
      * fix quality checks
      
      * restore RegressionDataset
      
      * was wrongly under is_torch_available()
      a09fe140
    • Weizhen's avatar
      ProphetNet (#7157) · 2422cda0
      Weizhen authored
      
      
      * add new model prophetnet
      
      prophetnet modified
      
      modify codes as suggested v1
      
      add prophetnet test files
      
      * still bugs, because of changed output formats of encoder and decoder
      
      * move prophetnet into the latest version
      
      * clean integration tests
      
      * clean tokenizers
      
      * add xlm config to init
      
      * correct typo in init
      
      * further refactoring
      
      * continue refactor
      
      * save parallel
      
      * add decoder_attention_mask
      
      * fix use_cache vs. past_key_values
      
      * fix common tests
      
      * change decoder output logits
      
      * fix xlm tests
      
      * make common tests pass
      
      * change model architecture
      
      * add tokenizer tests
      
      * finalize model structure
      
      * no weight mapping
      
      * correct n-gram stream attention mask as discussed with qweizhen
      
      * remove unused import
      
      * fix index.rst
      
      * fix tests
      
      * delete unnecessary code
      
      * add fast integration test
      
      * rename weights
      
      * final weight remapping
      
      * save intermediate
      
      * Descriptions for Prophetnet Config File
      
      * finish all models
      
      * finish new model outputs
      
      * delete unnecessary files
      
      * refactor encoder layer
      
      * add dummy docs
      
      * code quality
      
      * fix tests
      
      * add model pages to doctree
      
      * further refactor
      
      * more refactor, more tests
      
      * finish code refactor and tests
      
      * remove unnecessary files
      
      * further clean up
      
      * add docstring template
      
      * finish tokenizer doc
      
      * finish prophetnet
      
      * fix copies
      
      * fix typos
      
      * fix tf tests
      
      * fix fp16
      
      * fix tf test 2nd try
      
      * fix code quality
      
      * add test for each model
      
      * merge new tests to branch
      
      * Update model_cards/microsoft/prophetnet-large-uncased-cnndm/README.md
      Co-authored-by: default avatarSam Shleifer <sshleifer@gmail.com>
      
      * Update model_cards/microsoft/prophetnet-large-uncased-cnndm/README.md
      Co-authored-by: default avatarSam Shleifer <sshleifer@gmail.com>
      
      * Update src/transformers/modeling_prophetnet.py
      Co-authored-by: default avatarSam Shleifer <sshleifer@gmail.com>
      
      * Update utils/check_repo.py
      Co-authored-by: default avatarSam Shleifer <sshleifer@gmail.com>
      
      * apply sams and sylvains comments
      
      * make style
      
      * remove unnecessary code
      
      * Update README.md
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update README.md
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/configuration_prophetnet.py
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * implement lysandres comments
      
      * correct docs
      
      * fix isort
      
      * fix tokenizers
      
      * fix copies
      Co-authored-by: default avatarweizhen <weizhen@mail.ustc.edu.cn>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSam Shleifer <sshleifer@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      2422cda0