• Daniel Stancl's avatar
    Implement head_mask for Flax BERT and other models copied from BERT (#14620) · ff066119
    Daniel Stancl authored
    * Implement head_mask for Flax BERT and other models copied from BERT
    
    * Remove `from jax._src.nn.functions import sigmoid`
    
    Remove `from jax._src.nn.functions import sigmoid` unintentionally added by IDE
    
    * Remove no more valid copy statement
    
    * Apply patil-suraj's suggestions from code review
    
    * Apply suggestions from the code review
    
    * Update Flax template
    
    * Fix a typo
    
    * Also update template for CausalLM modules
    ff066119
test_modeling_flax_electra.py 4.76 KB