"include/ck/utility/utility.hpp" did not exist on "23f633cdc5eafb6f110046794ad1384d723f1157"
  1. 07 Jul, 2020 13 commits
    • Suraj Patil's avatar
      [examples] Add trainer support for question-answering (#4829) · e49393c3
      Suraj Patil authored
      
      
      * add SquadDataset
      
      * add DataCollatorForQuestionAnswering
      
      * update __init__
      
      * add run_squad with  trainer
      
      * add DataCollatorForQuestionAnswering in __init__
      
      * pass data_collator to trainer
      
      * doc tweak
      
      * Update run_squad_trainer.py
      
      * Update __init__.py
      
      * Update __init__.py
      Co-authored-by: default avatarJulien Chaumond <chaumond@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      e49393c3
    • Quentin Lhoest's avatar
      Add DPR model (#5279) · fbd87921
      Quentin Lhoest authored
      
      
      * beginning of dpr modeling
      
      * wip
      
      * implement forward
      
      * remove biencoder + better init weights
      
      * export dpr model to embed model for nlp lib
      
      * add new api
      
      * remove old code
      
      * make style
      
      * fix dumb typo
      
      * don't load bert weights
      
      * docs
      
      * docs
      
      * style
      
      * move the `k` parameter
      
      * fix init_weights
      
      * add pretrained configs
      
      * minor
      
      * update config names
      
      * style
      
      * better config
      
      * style
      
      * clean code based on PR comments
      
      * change Dpr to DPR
      
      * fix config
      
      * switch encoder config to a dict
      
      * style
      
      * inheritance -> composition
      
      * add messages in assert startements
      
      * add dpr reader tokenizer
      
      * one tokenizer per model
      
      * fix base_model_prefix
      
      * fix imports
      
      * typo
      
      * add convert script
      
      * docs
      
      * change tokenizers conf names
      
      * style
      
      * change tokenizers conf names
      
      * minor
      
      * minor
      
      * fix wrong names
      
      * minor
      
      * remove unused convert functions
      
      * rename convert script
      
      * use return_tensors in tokenizers
      
      * remove n_questions dim
      
      * move generate logic to tokenizer
      
      * style
      
      * add docs
      
      * docs
      
      * quality
      
      * docs
      
      * add tests
      
      * style
      
      * add tokenization tests
      
      * DPR full tests
      
      * Stay true to the attention mask building
      
      * update docs
      
      * missing param in bert input docs
      
      * docs
      
      * style
      Co-authored-by: default avatarLysandre <lysandre.debut@reseau.eseo.fr>
      fbd87921
    • Sava艧 Y谋ld谋r谋m's avatar
      Update model card (#5491) · d2a93991
      Sava艧 Y谋ld谋r谋m authored
      d2a93991
    • Sava艧 Y谋ld谋r谋m's avatar
      Update model card (#5492) · 2e653d89
      Sava艧 Y谋ld谋r谋m authored
      2e653d89
    • Sava艧 Y谋ld谋r谋m's avatar
    • Manuel Romero's avatar
      electra-small-finetuned-squadv1 model card (#5430) · e6eba841
      Manuel Romero authored
      * Create model card
      
      Create model card for electra-small-discriminator finetuned on SQUAD v1.1
      
      * Set right model path in code example
      e6eba841
    • Vitalii Radchenko's avatar
      ukr-roberta-base model card (#5514) · 43b7ad5d
      Vitalii Radchenko authored
      43b7ad5d
    • Manuel Romero's avatar
      87aa857d
    • Moseli Motsoehli's avatar
      zuBERTa model card (#5536) · c7d96b60
      Moseli Motsoehli authored
      
      
      * Create README
      
      * Update README.md
      Co-authored-by: default avatarKevin Canwen Xu <canwenxu@126.com>
      c7d96b60
    • Manuel Romero's avatar
      b95dfcf1
    • Abel's avatar
      Make T5 compatible with ONNX (#5518) · 69122657
      Abel authored
      
      
      * Default decoder inputs to encoder ones for T5 if neither are specified.
      
      * Fixing typo, now all tests are passing.
      
      * Changing einsum to operations supported by onnx
      
      * Adding a test to ensure T5 can be exported to onnx op>9
      
      * Modified test for onnx export to make it faster
      
      * Styling changes.
      
      * Styling changes.
      
      * Changing notation for matrix multiplication
      Co-authored-by: default avatarAbel Riboulot <tkai@protomail.com>
      69122657
    • Patrick von Platen's avatar
      [Reformer] Adapt Reformer MaskedLM Attn mask (#5560) · 989ae326
      Patrick von Platen authored
      * fix attention mask
      
      * fix slow test
      
      * refactor attn masks
      
      * fix fp16 generate test
      989ae326
    • Shashank Gupta's avatar
      Added data collator for permutation (XLNet) language modeling and related calls (#5522) · 3dcb748e
      Shashank Gupta authored
      * Added data collator for XLNet language modeling and related calls
      
      Added DataCollatorForXLNetLanguageModeling in data/data_collator.py
      to generate necessary inputs for language modeling training with
      XLNetLMHeadModel. Also added related arguments, logic and calls in
      examples/language-modeling/run_language_modeling.py.
      
      Resolves: #4739, #2008 (partially)
      
      * Changed name to `DataCollatorForPermutationLanguageModeling`
      
      Changed the name of `DataCollatorForXLNetLanguageModeling` to the more general `DataCollatorForPermutationLanguageModelling`.
      Removed the `--mlm` flag requirement for the new collator and defined a separate `--plm_probability` flag for its use.
      CTRL uses a CLM loss just like GPT and GPT-2, so should work out of the box with this script (provided `past` is taken care of
      similar to `mems` for XLNet).
      Changed calls and imports appropriately.
      
      * Added detailed comments, changed variable names
      
      Added more detailed comments to `DataCollatorForPermutationLanguageModeling` in `data/data_collator.py` to explain working. Also cleaned up variable names and made them more informative.
      
      * Added tests for new data collator
      
      Added tests in `tests/test_trainer.py` for DataCollatorForPermutationLanguageModeling based on those in DataCollatorForLanguageModeling. A specific test has been added to check for odd-length sequences.
      
      * Fixed styling issues
      3dcb748e
  2. 06 Jul, 2020 13 commits
  3. 03 Jul, 2020 10 commits
  4. 02 Jul, 2020 4 commits