"git@developer.sourcefind.cn:gaoqiong/migraphx.git" did not exist on "25b334314e57c465f45611d46cbc2582a877c0b5"
  • Sanchit Gandhi's avatar
    [FlaxBert] Add ForCausalLM (#16995) · cd9274d0
    Sanchit Gandhi authored
    * [FlaxBert] Add ForCausalLM
    
    * make style
    
    * fix output attentions
    
    * Add RobertaForCausalLM
    
    * remove comment
    
    * fix fx-to-pt model loading
    
    * remove comment
    
    * add modeling tests
    
    * add enc-dec model tests
    
    * add big_bird
    
    * add electra
    
    * make style
    
    * make repo-consitency
    
    * add to docs
    
    * remove roberta test
    
    * quality
    
    * amend cookiecutter
    
    * fix attention_mask bug in flax bert model tester
    
    * tighten pt-fx thresholds to 1e-5
    
    * add 'copied from' statements
    
    * amend 'copied from' statements
    
    * amend 'copied from' statements
    
    * quality
    cd9274d0
roberta.mdx 4.69 KB