• Lysandre Debut's avatar
    ELECTRA (#3257) · d5d7d886
    Lysandre Debut authored
    * Electra wip
    
    * helpers
    
    * Electra wip
    
    * Electra v1
    
    * ELECTRA may be saved/loaded
    
    * Generator & Discriminator
    
    * Embedding size instead of halving the hidden size
    
    * ELECTRA Tokenizer
    
    * Revert BERT helpers
    
    * ELECTRA Conversion script
    
    * Archive maps
    
    * PyTorch tests
    
    * Start fixing tests
    
    * Tests pass
    
    * Same configuration for both models
    
    * Compatible with base + large
    
    * Simplification + weight tying
    
    * Archives
    
    * Auto + Renaming to standard names
    
    * ELECTRA is uncased
    
    * Tests
    
    * Slight API changes
    
    * Update tests
    
    * wip
    
    * ElectraForTokenClassification
    
    * temp
    
    * Simpler arch + tests
    
    Removed ElectraForPreTraining which will be in a script
    
    * Conversion script
    
    * Auto model
    
    * Update links to S3
    
    * Split ElectraForPreTraining and ElectraForTokenClassification
    
    * Actually test PreTraining model
    
    * Remove num_labels from configuration
    
    * wip
    
    * wip
    
    * From discriminator and generator to electra
    
    * Slight API changes
    
    * Better naming
    
    * TensorFlow ELECTRA tests
    
    * Accurate conversion script
    
    * Added to conversion script
    
    * Fast ELECTRA tokenizer
    
    * Style
    
    * Add ELECTRA to README
    
    * Modeling Pytorch Doc + Real style
    
    * TF Docs
    
    * Docs
    
    * Correct links
    
    * Correct model intialized
    
    * random fixes
    
    * style
    
    * Addressing Patrick's and Sam's comments
    
    * Correct links in docs
    d5d7d886
README.md 39.4 KB