1. 03 Jun, 2021 1 commit
    • Nicholas Vadivelu's avatar
      Fix weight decay masking in `run_flax_glue.py` (#11964) · 4674061b
      Nicholas Vadivelu authored
      
      
      * Fix weight decay masking in `run_flax_glue.py`
      
      Issues with the previous implementation:
      - The `dict` from `traverse_util.flatten_dict` has keys which are tuples of strings, not one long string with the path separated by periods.
      - `optax.masked` applies the transformation wherever the mask is True, so the masks are flipped.
      - Flax's LayerNorm calls the scale parameter `scale` not `weight`
      
      * Fix formatting with black
      
      * adapt results
      Co-authored-by: default avatarPatrick von Platen <patrick@huggingface.co>
      4674061b
  2. 31 May, 2021 1 commit
    • Nicholas Vadivelu's avatar
      Remove redundant `nn.log_softmax` in `run_flax_glue.py` (#11920) · 1ab147d6
      Nicholas Vadivelu authored
      * Remove redundant `nn.log_softmax` in `run_flax_glue.py`
      
      `optax.softmax_cross_entropy` expects unnormalized logits, and so it already calls `nn.log_softmax`, so I believe it is not needed here. `nn.log_softmax` is idempotent so mathematically it shouldn't have made a difference.
      
      * Remove unused 'flax.linen' import
      1ab147d6
  3. 26 May, 2021 1 commit
  4. 24 May, 2021 1 commit
  5. 21 May, 2021 3 commits
  6. 19 May, 2021 1 commit
  7. 18 May, 2021 1 commit
  8. 17 May, 2021 1 commit
  9. 14 May, 2021 2 commits
  10. 12 May, 2021 1 commit
  11. 11 May, 2021 1 commit
  12. 04 May, 2021 1 commit
  13. 23 Apr, 2021 1 commit
  14. 21 Apr, 2021 1 commit