"docs/vscode:/vscode.git/clone" did not exist on "0201f6420b186664095844666b565dd0c5c897b3"
  • Suraj Patil's avatar
    FlaxGPT2 (#11556) · ca33278f
    Suraj Patil authored
    
    
    * flax gpt2
    
    * combine masks
    
    * handle shared embeds
    
    * add causal LM sample
    
    * style
    
    * add tests
    
    * style
    
    * fix imports, docs, quality
    
    * don't use cache
    
    * add cache
    
    * add cache 1st version
    
    * make use cache work
    
    * start adding test for generation
    
    * finish generation loop compilation
    
    * rewrite test
    
    * finish
    
    * update
    
    * update
    
    * apply sylvains suggestions
    
    * update
    
    * refactor
    
    * fix typo
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    ca33278f
auto.rst 8.55 KB