• Shinya Otani's avatar
    Add support for Japanese GPT-NeoX-based model by ABEJA, Inc. (#18814) · f5f430e5
    Shinya Otani authored
    * add gpt-neox-japanese model and tokenizer as new model
    
    * Correction to PR's comment for GPT NeoX Japanese
    - Fix to be able to use gpu
    - Add comment # Copied... at the top of RotaryEmbedding
    - Implement nn.Linear instead of original linear class
    - Add generation test under @slow
    
    * fix bias treatment for gpt-neox-japanese
    
    * Modidy gpt-neox-japanese following PR
    - add doc for bias_dropout_add
    - style change following a PR comment
    
    * add document for gpt-neox-japanese
    
    * remove unused import from gpt-neox-japanese
    
    * fix README for gpt-neox-japanese
    f5f430e5
README.md 64.5 KB