1. 06 Aug, 2024 1 commit
    • Ao Tang's avatar
      Add Nemotron HF Support (#31699) · 6a03942d
      Ao Tang authored
      * Add nemotron support
      
      * fix inference
      
      * add unit test
      
      * add layernorm1p as a class to avoid meta device mismatch
      
      * test fixed
      
      * Add copied_from statements
      
      * remove pretraining_tp args
      
      * remove nemotronlayernorm
      
      * force LN computation done in FP32
      
      * remove nemotrontokenizer and use llamatokenizer
      
      * license update
      
      * add option for kv_channels for minitron8b
      
      * remove assert
      
      * o_proj fixed
      
      * o_proj reshape
      
      * add gated_proj option
      
      * typo
      
      * remove todos
      
      * fix broken test after merging latest main
      
      * remove nezha/nat after meging main
      
      * chnage default config to 15b model
      
      * add nemo conversion script
      
      * rename conversion script
      
      * remove gate_proj option
      
      * pr comment resolved
      
      * fix unit test
      
      * rename kv_channels to head_dim
      
      * resolve PR issue
      
      * add nemotron md
      
      * fix broken tests
      
      * refactor rope for nemotron
      
      * test fix
      
      * remove linearscaling
      
      * whitespace and import
      
      * fix some copied-from
      
      * code style fix
      
      * reformatted
      
      * add position_embedding to nemotronattention
      
      * rope refactor to only use config, copied-from fix
      
      * format
      
      * Run make fix-copies
      
      * nemotron md with autodoc
      
      * doc  fix
      
      * fix order
      
      * pass check_config_docstrings.py
      
      * fix config_attributes
      
      * remove all llama BC related code
      
      * Use PreTrainedTokenizerFast
      
      * ruff check examples
      
      * conversion script update
      
      * add nemotron to toctree
      6a03942d
  2. 21 May, 2024 1 commit
  3. 11 Dec, 2020 1 commit
  4. 20 Jun, 2020 1 commit
    • Kevin Canwen Xu's avatar
      Add BERT Loses Patience (Patience-based Early Exit) (#5078) · 2fd28d43
      Kevin Canwen Xu authored
      * Add BERT Loses Patience (Patience-based Early Exit)
      
      * update model archive
      
      * update format
      
      * sort import
      
      * flake8
      
      * Add results
      
      * full results
      
      * align the table
      
      * refactor to inherit
      
      * default per gpu eval = 1
      
      * Formatting
      
      * Formatting
      
      * isort
      
      * modify readme
      
      * Add check
      
      * Fix format
      
      * Fix format
      
      * Doc strings
      
      * ALBERT & BERT for sequence classification don't inherit from the original anymore
      
      * Remove incorrect comments
      
      * Remove incorrect comments
      
      * Remove incorrect comments
      
      * Sync up with new code
      
      * Sync up with new code
      
      * Add a test
      
      * Add a test
      
      * Add a test
      
      * Add a test
      
      * Add a test
      
      * Add a test
      
      * Finishing up!
      2fd28d43
  5. 03 Mar, 2020 1 commit
    • Sam Shleifer's avatar
      Summarization Examples: add Bart CNN Evaluation (#3082) · 5b396457
      Sam Shleifer authored
      * Rename and improve example
      
      * Add test
      
      * slightly faster test
      
      * style
      
      * This breaks remy prolly
      
      * shorter test string
      
      * no slow
      
      * newdir structure
      
      * New tree
      
      * Style
      
      * shorter
      
      * docs
      
      * clean
      
      * Attempt future import
      
      * more import hax
      5b396457
  6. 06 Jan, 2020 2 commits
  7. 22 Dec, 2019 1 commit
  8. 26 Sep, 2019 1 commit
  9. 05 Jul, 2019 1 commit
  10. 02 Jul, 2019 1 commit