• Kamal Raj Kanakarajan's avatar
    Add BioGPT (#20420) · 13e73668
    Kamal Raj Kanakarajan authored
    * biogpt initial commit
    
    * updated init
    
    * fix faster decoding with use_cache
    
    * 1. fix input_ids and input_embeds with correct device
    2. added _keys_to_ignore_on_load_missing
    3. updated prepare_inputs_for_generation
    
    * add activation_dropout and scale_embedding
    
    * replace fsmt attention with bart attention
    
    * added test
    
    * run make fix-copies
    
    * doc init and fix build
    
    * updated README with proper information
    
    * 1. added tips to docs
    2. updated BioGptTokenizer func
    
    * 1. added tokenizer test
    2. refactor tokenizer
    
    * make fixup
    
    * add biogpt fairseq to hf converter
    
    * updated layer names more
    similar to original checkpoints
    
    * config update doc string and set defaults
    
    * added "#copied" from bart model and
    updated doc strings
    
    * enable model_input_names in tokenizer
    
    * 1.  positionalembedding depending on attention_mask
    2. added attention mask to prepare for generation
    
    * added test to verify past and generation
    
    * BioGptLMHeadModel -> BioGptForCausalLM
    
    * fix typo
    
    * tokenization and test
    Copyright and updated assertion
    
    * updated Copyright and
    one func at time in line
    
    * Copyright updates and
    minor doc fix
    
    * replace assertion with ValueError
    
    * rm extra space
    
    * added code syntax
    
    * revert cmnt position change
    
    * add tokenizer to auto
    
    * updated doc string
    
    * tokenizer doc string update
    
    * biogpt hub model update to microsoft/biogpt
    
    * make fixup
    
    * rm cmnt to fix flake8 5.0.4 vs 6 error
    13e73668
README.md 72.1 KB