"sgl-router/vscode:/vscode.git/clone" did not exist on "fd05b56750eec9d8bde1119bc78dc746490d5979"
  • Weizhen's avatar
    ProphetNet (#7157) · 2422cda0
    Weizhen authored
    
    
    * add new model prophetnet
    
    prophetnet modified
    
    modify codes as suggested v1
    
    add prophetnet test files
    
    * still bugs, because of changed output formats of encoder and decoder
    
    * move prophetnet into the latest version
    
    * clean integration tests
    
    * clean tokenizers
    
    * add xlm config to init
    
    * correct typo in init
    
    * further refactoring
    
    * continue refactor
    
    * save parallel
    
    * add decoder_attention_mask
    
    * fix use_cache vs. past_key_values
    
    * fix common tests
    
    * change decoder output logits
    
    * fix xlm tests
    
    * make common tests pass
    
    * change model architecture
    
    * add tokenizer tests
    
    * finalize model structure
    
    * no weight mapping
    
    * correct n-gram stream attention mask as discussed with qweizhen
    
    * remove unused import
    
    * fix index.rst
    
    * fix tests
    
    * delete unnecessary code
    
    * add fast integration test
    
    * rename weights
    
    * final weight remapping
    
    * save intermediate
    
    * Descriptions for Prophetnet Config File
    
    * finish all models
    
    * finish new model outputs
    
    * delete unnecessary files
    
    * refactor encoder layer
    
    * add dummy docs
    
    * code quality
    
    * fix tests
    
    * add model pages to doctree
    
    * further refactor
    
    * more refactor, more tests
    
    * finish code refactor and tests
    
    * remove unnecessary files
    
    * further clean up
    
    * add docstring template
    
    * finish tokenizer doc
    
    * finish prophetnet
    
    * fix copies
    
    * fix typos
    
    * fix tf tests
    
    * fix fp16
    
    * fix tf test 2nd try
    
    * fix code quality
    
    * add test for each model
    
    * merge new tests to branch
    
    * Update model_cards/microsoft/prophetnet-large-uncased-cnndm/README.md
    Co-authored-by: default avatarSam Shleifer <sshleifer@gmail.com>
    
    * Update model_cards/microsoft/prophetnet-large-uncased-cnndm/README.md
    Co-authored-by: default avatarSam Shleifer <sshleifer@gmail.com>
    
    * Update src/transformers/modeling_prophetnet.py
    Co-authored-by: default avatarSam Shleifer <sshleifer@gmail.com>
    
    * Update utils/check_repo.py
    Co-authored-by: default avatarSam Shleifer <sshleifer@gmail.com>
    
    * apply sams and sylvains comments
    
    * make style
    
    * remove unnecessary code
    
    * Update README.md
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Update README.md
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Update src/transformers/configuration_prophetnet.py
    Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
    
    * implement lysandres comments
    
    * correct docs
    
    * fix isort
    
    * fix tokenizers
    
    * fix copies
    Co-authored-by: default avatarweizhen <weizhen@mail.ustc.edu.cn>
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    Co-authored-by: default avatarSam Shleifer <sshleifer@gmail.com>
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
    2422cda0
test_modeling_t5.py 45.4 KB