"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "f38cd4373f36408105bab29e6851f62836b1452a"
  • Yih-Dar's avatar
    Make Flax GPT2 working with cross attention (#13008) · 2e20c0f3
    Yih-Dar authored
    
    
    * make flax gpt2 working with cross attention
    
    * Remove encoder->decoder projection layer
    
    * A draft (incomplete) for FlaxEncoderDecoderModel
    
    * Add the method from_encoder_decoder_pretrained + the docstrings
    
    * Fix the mistakes of using EncoderDecoderModel
    
    * Fix style
    
    * Add FlaxEncoderDecoderModel to the library
    
    * Fix cyclic imports
    
    * Add FlaxEncoderDecoderModel to modeling_flax_auto.py
    
    * Remove question comments
    
    * add tests for FlaxEncoderDecoderModel
    
    * add flax_encoder_decoder to the lists of ignored entries in check_repo.py
    
    * fix missing required positional arguments
    
    * Remove **kwargs when creating FlaxEncoderDecoderModel in from_encoder_decoder_pretrained()
    
    Also fix generation eos/pad tokens issue
    
    * Fix: Use sequences from the generated_output
    
    * Change a check from assert to raise ValueError
    
    * Fix examples and token ids issues
    
    * Fix missing all_cross_attentions when outputting tuple in modeling_gpt2
    
    * Remove the changes in configuration docstrings.
    
    * allow for bert 2 gpt2
    
    * make fix-copies
    
    * Apply suggestions from code review
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * Change remaining examples to bert2gpt2
    
    * Change the test to Bert2GPT2
    
    * Fix examples
    
    * Fix import
    
    * Fix unpack bug
    
    * Rename to FlaxEncoderDecoderModelTest and change the test to bert2gpt2
    
    * Apply suggestions from code review
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * Fix: NotImplentedError -> NotImplementedError
    
    * Apply suggestions from code review
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * up
    
    * finalize
    Co-authored-by: default avatarydshieh <ydshieh@user.noreply>
    Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    2e20c0f3
encoderdecoder.rst 2.38 KB