1. 17 Sep, 2021 2 commits
    • Patrick von Platen's avatar
      [Trainer] Add nan/inf logging filter (#13619) · 1f9dcfc1
      Patrick von Platen authored
      * finish
      
      * add test
      
      * push
      
      * remove unnecessary code
      
      * up
      
      * correct test
      
      * Update src/transformers/training_args.py
      1f9dcfc1
    • Ibraheem Moosa's avatar
      Optimize Token Classification models for TPU (#13096) · eae7a96b
      Ibraheem Moosa authored
      * Optimize Token Classification models for TPU
      
      As per the XLA document XLA cannot handle masked indexing well. So token classification
      models for BERT and others use an implementation based on `torch.where`. This implementation
      works well on TPU. 
      
      ALBERT token classification model uses the masked indexing which causes performance issues
      on TPU. This PR fixes this issue by following the BERT implementation.
      
      * Same fix for ELECTRA
      
      * Same fix for LayoutLM
      eae7a96b
  2. 16 Sep, 2021 11 commits
  3. 15 Sep, 2021 4 commits
  4. 14 Sep, 2021 8 commits
  5. 13 Sep, 2021 10 commits
  6. 10 Sep, 2021 5 commits