"...git@developer.sourcefind.cn:lacacy/qwen_lmdeploy.git" did not exist on "5ed6bb5948b0c1cf8796758a1f64ce5c1297256a"
  • Suraj Patil's avatar
    Longformer for question answering (#4500) · 03d8527d
    Suraj Patil authored
    * added LongformerForQuestionAnswering
    
    * add LongformerForQuestionAnswering
    
    * fix import for LongformerForMaskedLM
    
    * add LongformerForQuestionAnswering
    
    * hardcoded sep_token_id
    
    * compute attention_mask if not provided
    
    * combine global_attention_mask with attention_mask when provided
    
    * update example in  docstring
    
    * add assert error messages, better attention combine
    
    * add test for longformerForQuestionAnswering
    
    * typo
    
    * cast gloabl_attention_mask to long
    
    * make style
    
    * Update src/transformers/configuration_longformer.py
    
    * Update src/transformers/configuration_longformer.py
    
    * fix the code quality
    
    * Merge branch 'longformer-for-question-answering' of https://github.com/patil-suraj/transformers
    
     into longformer-for-question-answering
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    03d8527d
test_modeling_longformer.py 12.1 KB