"vscode:/vscode.git/clone" did not exist on "0db5d911fc94604f9568b4b212e005ec4600d157"
  1. 25 Apr, 2022 1 commit
  2. 21 Apr, 2022 1 commit
  3. 20 Apr, 2022 1 commit
  4. 19 Apr, 2022 5 commits
    • Jeevesh Juneja's avatar
      Correct Logging of Eval metric to Tensorboard (#16825) · b5c6a63e
      Jeevesh Juneja authored
      * Correct Logging of Eval metric to Tensorboard
      
      An empty dictionary ``eval_metrics`` was being logged, is replaced by ``eval_metric`` which is the output dictionary of ``metric.compute()``.
      
      * Remove unused variable
      b5c6a63e
    • NielsRogge's avatar
      Add image classification script, no trainer (#16727) · b96e82c8
      NielsRogge authored
      * Add first draft
      
      * Improve README and run fixup
      
      * Make script aligned with other scripts, improve README
      
      * Improve script and add test
      
      * Remove print statement
      
      * Apply suggestions from code review
      
      * Add num_labels to make test pass
      
      * Improve README
      b96e82c8
    • Wonjae Kim's avatar
      b74a9553
    • Suraj Patil's avatar
      [Flax] improve large model init and loading (#16148) · d3bd9ac7
      Suraj Patil authored
      
      
      * begin do_init
      
      * add params_shape_tree
      
      * raise error if params are accessed when do_init is False
      
      * don't allow do_init=False when keys are missing
      
      * make shape tree a property
      
      * assign self._params at the end
      
      * add test for do_init
      
      * add do_init arg to all flax models
      
      * fix param setting
      
      * disbale do_init for composite models
      
      * update test
      
      * add do_init in FlaxBigBirdForMultipleChoice
      
      * better names and errors
      
      * improve test
      
      * style
      
      * add a warning when do_init=False
      
      * remove extra if
      
      * set params after _required_params
      
      * add test for from_pretrained
      
      * do_init => _do_init
      
      * chage warning to info
      
      * fix typo
      
      * add params in init_weights
      
      * add params to gpt neo init
      
      * add params to init_weights
      
      * update do_init test
      
      * Trigger CI
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * update template
      
      * trigger CI
      
      * style
      
      * style
      
      * fix template
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      d3bd9ac7
    • NielsRogge's avatar
      Add semantic script no trainer, v2 (#16788) · 7db7aab4
      NielsRogge authored
      * Add first draft from previous PR
      
      * First draft
      
      * Improve README and remove num_labels
      
      * Make script more aligned with other scripts
      
      * Improve README and apply suggestion from code review
      7db7aab4
  5. 15 Apr, 2022 1 commit
  6. 14 Apr, 2022 1 commit
  7. 13 Apr, 2022 2 commits
    • Zachary Mueller's avatar
      Fixup no_trainer examples scripts and add more tests (#16765) · be752d12
      Zachary Mueller authored
      * Change tracking to store_true
      
      * Remove step param and use it in the log dictionary directly
      
      * use vars(args) when passing args to init_trackers
      
      * Include tracking tests since tensorboard is already a dep
      be752d12
    • Tu Vu's avatar
      Add self training code for text classification (#16738) · 34ef029d
      Tu Vu authored
      * Add self-training code for text-classification
      
      * Add self-training code for text-classification
      
      * Add self-training code for text-classification
      
      * Add self-training code for text-classification
      
      * Add self-training code for text-classification
      
      * Delete strata
      34ef029d
  8. 12 Apr, 2022 2 commits
  9. 11 Apr, 2022 4 commits
    • Zachary Mueller's avatar
      Fix example logs repeating themselves (#16669) · 69233cf0
      Zachary Mueller authored
      Move declaration of log streams to before tests, so that results won't get compounded on top of each other
      69233cf0
    • Zachary Mueller's avatar
      Don't push checkpoints to hub in `no_trainer` scripts (#16703) · d4b3e359
      Zachary Mueller authored
      Adds checkpoint prefixes to the gitignore if `push_to_hub` is used along with `checkpointint_steps`
      d4b3e359
    • Ahmed Elnaggar's avatar
      Fix t5 shard on TPU Pods (#16527) · 5e686757
      Ahmed Elnaggar authored
      
      
      * Fix t5 shard on TPU Pods
      
      The current script doesn't work properly on a TPU pod because the global batch is not divided correctly per host.
      This pull request fixes this issue by dividing the global batch to each host before it is shared on each host.
      
      * fix style
      Co-authored-by: default avatarahmed-elnaggar <ahmed.elnaggar@allianz.com>
      5e686757
    • Jia LI's avatar
      Jia multi gpu eval (#16428) · 4868a830
      Jia LI authored
      
      
      * add simple multi gpu complet
      
      * add human_eval_multi_gpu
      
      * use copy strategy to distribute across gpu, to avoid padding
      
      * add doc string
      
      * update code style
      
      * use task id to arrange output
      
      * truncate input to avoid zero pad
      
      * Stop the copy mechanism
      
      * update style
      
      * restore copies to scale better in distributed mode
      
      * update style
      
      * replace human eval
      
      * Apply suggestions from code review
      
      1. Tokenize all input at the same time
      2. use attention_mask to get the input length
      3. other small fixes
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      
      * correct typo and update docstring
      
      * update code style
      
      * remove num sample division constraint
      
      * remove max len calculation
      
      * use accelerator.gather once to speed up
      
      * use accelerate set_seed; update accelerate version
      
      * correct gather bug
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      4868a830
  10. 08 Apr, 2022 2 commits
    • Zachary Mueller's avatar
      Add tests for no_trainer and fix existing examples (#16656) · d57da992
      Zachary Mueller authored
      * Fixed some bugs involving saving during epochs
      * Added tests mimicking the existing examples tests
      * Added in json exporting to all `no_trainer` examples for consistency 
      d57da992
    • NielsRogge's avatar
      Add TAPEX (#16473) · 4ef0abb7
      NielsRogge authored
      
      
      * Add TapexTokenizer
      
      * Improve docstrings and provide option to provide answer
      
      * Remove option for pretokenized inputs
      
      * Add TAPEX to README
      
      * Fix copies
      
      * Remove option for pretokenized inputs
      
      * Initial commit: add tapex fine-tuning examples on both table-based question answering and table-based fact verification.
      
      * - Draft a README file for running the script and introducing some background.
      - Remove unused code lines in tabfact script.
      - Disable the deafult `pad_to_max_length` option which is memory-consuming.
      
      * * Support `as_target_tokenizer` function for TapexTokenizer.
      * Fix the do_lower_case behaviour of TapexTokenizer.
      * Add unit tests for target scenarios and cased/uncased scenarios for both source and target.
      
      * * Replace the label BartTokenizer with TapexTokenizer's as_target_tokenizer function.
      * Fix typos in tapex example README.
      
      * * fix the evaluation script - remove the property `task_name`
      
      * * Make the label space more clear for tabfact tasks
      
      * * Using a new fine-tuning script for tapex-base on tabfact.
      
      * * Remove the lowercase code outside the tokenizer - we use the tokenizer to control whether do_lower_case
      * Guarantee the hyper-parameter can be run without out-of-memory on 16GB card and report the new reproduced number on wikisql
      
      * * Remove the default tokenizer_name option.
      * Provide evaluation command.
      
      * * Support for WikiTableQuestion dataset.
      
      * Fix a typo in README.
      
      * * Fix the datasets's key name in WikiTableQuestions
      
      * Run make fixup and move test to folder
      
      * Fix quality
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Apply some more suggestions from code review
      
      * Improve docstrings
      
      * Overwrite failing test
      
      * Improve comment in example scripts
      
      * Fix rebase
      
      * Add TAPEX to Auto mapping
      
      * Add TAPEX to auto config mappings
      
      * Put TAPEX higher than BART in auto mapping
      
      * Add TAPEX to doc tests
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MBP.localdomain>
      Co-authored-by: default avatarSivilTaram <qianlxc@outlook.com>
      Co-authored-by: default avatarNiels Rogge <nielsrogge@nielss-mbp.home>
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      4ef0abb7
  11. 06 Apr, 2022 2 commits
  12. 04 Apr, 2022 1 commit
  13. 01 Apr, 2022 1 commit
  14. 31 Mar, 2022 2 commits
  15. 30 Mar, 2022 1 commit
  16. 29 Mar, 2022 2 commits
  17. 28 Mar, 2022 2 commits
  18. 25 Mar, 2022 2 commits
  19. 24 Mar, 2022 1 commit
  20. 23 Mar, 2022 3 commits
    • Edward Beeching's avatar
      Decision transformer gym (#15845) · aff9bc40
      Edward Beeching authored
      
      
      * Created the Decision Transformer Modle
      
      * updating tests, copy to other machine
      
      * Added last hidden size to Decision Transformer modelling outputs
      
      * Removed copy of original DT file
      
      * made a temporary change to gpt2 to have it conform with the Decision Transformer version
      
      * Updated tests
      
      * Ignoring a file used to test the DT model
      
      * added comments to config file
      
      * added comments and argument descriptions to decision transformer file
      
      * Updated doc
      
      * Ran "make style"
      
      * Remove old model imports
      
      * Removed unused imports, cleaned up init file
      
      * Update docs/source/model_doc/decision_transformer.mdx
      
      added my username
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * Reverted changes made to gpt2
      
      * Removed datasets submodule
      
      * Update the modeling outputs to include gpt2 attentions, hidden states and last hidden states
      
      * Added support for return of hidden states, attentions and return dict of gpt2 model.
      
      * Updated tests to include many of the ModelTesterMixin tests. 
      
      The following tests are skipped: test_generate_without_input_ids, test_pruning, test_resize_embeddings, test_head_masking, test_attention_outputs, test_hidden_states_output, test_inputs_embeds, test_model_common_attributes
      
      * Added missing line to the end of gpt2 file
      
      * Added an integration test for the Decision Transformer
      
      Test performs and autoregressive evaluation for two time steps
      
      * Set done and info to _ to fix failing test
      
      * Updated integration test to be deterministic and check expected outputs
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Removed unnecessary config options
      
      * Cleaned up commented code and old comments.
      
      * Cleaned up commented code.
      
      * Changed DecisionTransformer to Decision Transformer
      
      * Added Decision Transformer to the main README file
      
      * Added copy of GTP2 called DecisionTranformerGPT2Model
      
      * isorted imports
      
      * isorted imports
      
      * Added model to non-English README files
      
      * Ran make fix-copies and corrected some cases.
      
      * Updated index file to include Decision Transformer
      
      * Added gpt2 model as copy inside the Decision Transformer model file
      
      * Added the unit test file to the list of TEST_FILES_WITH_NO_COMMON_TESTS
      
      * Deleted redundant checkpoint files (I don't know how these got committed)
      
      * Removed testing files. (These should have never been committed)
      
      * Removed accidentally committed files
      
      * Moved the Decision Transformer test to its own directory
      
      * Add type hints for Pegasus (#16324)
      
      * Funnel type hints (#16323)
      
      * add pt funnel type hints
      
      * add tf funnel type hints
      
      * Add type hints for ProphetNet PyTorch (#16272)
      
      * [GLPN] Improve docs (#16331)
      
      * Add link to notebook
      
      * Add link
      
      * Fix bug
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      
      * Added type hints for Pytorch Marian calls (#16200)
      
      * Added type hinting for forward functions in pytorch marian
      
      * typo correction
      
      * Removed type hints on functions from BART per Suraj Patil request
      
      * fix import pb
      
      * fix typo
      
      * corrected tuple call
      
      * ran black
      
      * after fix-copies
      Some optional tags on primitives were removed, past_key_values in MarianForCausalLM changed from Tuple of Tuple to List
      
      * Fixing copies to roformer and pegasus
      Co-authored-by: default avatarClementine Fourrier <cfourrie@inria.fr>
      Co-authored-by: default avatarmatt <rocketknight1@gmail.com>
      
      * Moved DecisionTransformOutput to modeling_decision_transformer
      
      * Moved the example usage to research project and cleaned comments
      
      * Made tests ignore the copy of gpt2 in Decision Transformer
      
      * Added module output to modelling decision transformer
      
      * removed copied gpt2 model from list of transformers models
      
      * Updated tests and created __init__ file for new test location
      
      * Update README.md
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/decision_transformer/configuration_decision_transformer.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Removed unneeded summary type from config file
      
      * Fixed copies
      
      * Updated pretrained config map to refer to hopper-medium checkpoint
      
      * done (#16340)
      
      * Added Decision transformer to model docs
      
      * Update src/transformers/models/decision_transformer/modeling_decision_transformer.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/decision_transformer/modeling_decision_transformer.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/decision_transformer/configuration_decision_transformer.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Add type annotations for Rembert/Splinter and copies (#16338)
      
      * undo black autoformat
      
      * minor fix to rembert forward with default
      
      * make fix-copies, make quality
      
      * Adding types to template model
      
      * Removing List from the template types
      
      * Remove `Optional` from a couple of types that don't accept `None`
      Co-authored-by: default avatarmatt <rocketknight1@gmail.com>
      
      * [Bug template] Shift responsibilities for long-range (#16344)
      
      * Fix code repetition in serialization guide (#16346)
      
      * Adopt framework-specific blocks for content (#16342)
      
      *  refactor code samples with framework-specific blocks
      
      *  update training.mdx
      
      * 🖍
      
       apply feedback
      
      * Updates the default branch from master to main (#16326)
      
      * Updates the default branch from master to main
      
      * Links from `master` to `main`
      
      * Typo
      
      * Update examples/flax/README.md
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Updated model with custom docstring example
      
      * Created the Decision Transformer Modle
      
      * updating tests, copy to other machine
      
      * Added last hidden size to Decision Transformer modelling outputs
      
      * Removed copy of original DT file
      
      * made a temporary change to gpt2 to have it conform with the Decision Transformer version
      
      * Updated tests
      
      * Ignoring a file used to test the DT model
      
      * added comments to config file
      
      * added comments and argument descriptions to decision transformer file
      
      * Updated doc
      
      * Ran "make style"
      
      * Remove old model imports
      
      * Removed unused imports, cleaned up init file
      
      * Update docs/source/model_doc/decision_transformer.mdx
      
      added my username
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * Reverted changes made to gpt2
      
      * Removed datasets submodule
      
      * Update the modeling outputs to include gpt2 attentions, hidden states and last hidden states
      
      * Added support for return of hidden states, attentions and return dict of gpt2 model.
      
      * Updated tests to include many of the ModelTesterMixin tests. 
      
      The following tests are skipped: test_generate_without_input_ids, test_pruning, test_resize_embeddings, test_head_masking, test_attention_outputs, test_hidden_states_output, test_inputs_embeds, test_model_common_attributes
      
      * Added missing line to the end of gpt2 file
      
      * Added an integration test for the Decision Transformer
      
      Test performs and autoregressive evaluation for two time steps
      
      * Set done and info to _ to fix failing test
      
      * Updated integration test to be deterministic and check expected outputs
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Removed unnecessary config options
      
      * Cleaned up commented code and old comments.
      
      * Cleaned up commented code.
      
      * Changed DecisionTransformer to Decision Transformer
      
      * Added Decision Transformer to the main README file
      
      * Added copy of GTP2 called DecisionTranformerGPT2Model
      
      * isorted imports
      
      * isorted imports
      
      * Added model to non-English README files
      
      * Ran make fix-copies and corrected some cases.
      
      * Updated index file to include Decision Transformer
      
      * Added gpt2 model as copy inside the Decision Transformer model file
      
      * Added the unit test file to the list of TEST_FILES_WITH_NO_COMMON_TESTS
      
      * Deleted redundant checkpoint files (I don't know how these got committed)
      
      * Removed testing files. (These should have never been committed)
      
      * Removed accidentally committed files
      
      * Moved the Decision Transformer test to its own directory
      
      * Moved DecisionTransformOutput to modeling_decision_transformer
      
      * Moved the example usage to research project and cleaned comments
      
      * Made tests ignore the copy of gpt2 in Decision Transformer
      
      * Added module output to modelling decision transformer
      
      * removed copied gpt2 model from list of transformers models
      
      * Updated tests and created __init__ file for new test location
      
      * Update README.md
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/decision_transformer/configuration_decision_transformer.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Removed unneeded summary type from config file
      
      * Fixed copies
      
      * Updated pretrained config map to refer to hopper-medium checkpoint
      
      * Added Decision transformer to model docs
      
      * Update src/transformers/models/decision_transformer/modeling_decision_transformer.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/decision_transformer/modeling_decision_transformer.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/decision_transformer/configuration_decision_transformer.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Updated model with custom docstring example
      
      * Updated copies, config auto, and readme files.
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarDan Tegzes <48134725+Tegzes@users.noreply.github.com>
      Co-authored-by: default avatarAdam Montgomerie <adam@avanssion.com>
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      Co-authored-by: default avatarClémentine Fourrier <22726840+clefourrier@users.noreply.github.com>
      Co-authored-by: default avatarClementine Fourrier <cfourrie@inria.fr>
      Co-authored-by: default avatarmatt <rocketknight1@gmail.com>
      Co-authored-by: default avatarFrancesco Saverio Zuppichini <francesco.zuppichini@gmail.com>
      Co-authored-by: default avatarJacob Dineen <54680234+jacobdineen@users.noreply.github.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarOmar Sanseviero <osanseviero@gmail.com>
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      Co-authored-by: default avatarLysandre Debut <lysandre.debut@reseau.eseo.fr>
      aff9bc40
    • Sylvain Gugger's avatar
      Reorganize file utils (#16264) · 4975002d
      Sylvain Gugger authored
      * Split file_utils in several submodules
      
      * Fixes
      
      * Add back more objects
      
      * More fixes
      
      * Who exactly decided to import that from there?
      
      * Second suggestion to code with code review
      
      * Revert wront move
      
      * Fix imports
      
      * Adapt all imports
      
      * Adapt all imports everywhere
      
      * Revert this import, will fix in a separate commit
      4975002d
    • Lysandre Debut's avatar
      Updates the default branch from master to main (#16326) · eca77f47
      Lysandre Debut authored
      
      
      * Updates the default branch from master to main
      
      * Links from `master` to `main`
      
      * Typo
      
      * Update examples/flax/README.md
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      eca77f47
  21. 21 Mar, 2022 1 commit
  22. 17 Mar, 2022 1 commit
  23. 16 Mar, 2022 1 commit