"...lm-evaluation-harness.git" did not exist on "0ba4ae157df91744e4bf10fe31d31200b37b706a"
  • Edward Beeching's avatar
    Decision transformer gym (#15845) · aff9bc40
    Edward Beeching authored
    
    
    * Created the Decision Transformer Modle
    
    * updating tests, copy to other machine
    
    * Added last hidden size to Decision Transformer modelling outputs
    
    * Removed copy of original DT file
    
    * made a temporary change to gpt2 to have it conform with the Decision Transformer version
    
    * Updated tests
    
    * Ignoring a file used to test the DT model
    
    * added comments to config file
    
    * added comments and argument descriptions to decision transformer file
    
    * Updated doc
    
    * Ran "make style"
    
    * Remove old model imports
    
    * Removed unused imports, cleaned up init file
    
    * Update docs/source/model_doc/decision_transformer.mdx
    
    added my username
    Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
    
    * Reverted changes made to gpt2
    
    * Removed datasets submodule
    
    * Update the modeling outputs to include gpt2 attentions, hidden states and last hidden states
    
    * Added support for return of hidden states, attentions and return dict of gpt2 model.
    
    * Updated tests to include many of the ModelTesterMixin tests. 
    
    The following tests are skipped: test_generate_without_input_ids, test_pruning, test_resize_embeddings, test_head_masking, test_attention_outputs, test_hidden_states_output, test_inputs_embeds, test_model_common_attributes
    
    * Added missing line to the end of gpt2 file
    
    * Added an integration test for the Decision Transformer
    
    Test performs and autoregressive evaluation for two time steps
    
    * Set done and info to _ to fix failing test
    
    * Updated integration test to be deterministic and check expected outputs
    
    * Apply suggestions from code review
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Removed unnecessary config options
    
    * Cleaned up commented code and old comments.
    
    * Cleaned up commented code.
    
    * Changed DecisionTransformer to Decision Transformer
    
    * Added Decision Transformer to the main README file
    
    * Added copy of GTP2 called DecisionTranformerGPT2Model
    
    * isorted imports
    
    * isorted imports
    
    * Added model to non-English README files
    
    * Ran make fix-copies and corrected some cases.
    
    * Updated index file to include Decision Transformer
    
    * Added gpt2 model as copy inside the Decision Transformer model file
    
    * Added the unit test file to the list of TEST_FILES_WITH_NO_COMMON_TESTS
    
    * Deleted redundant checkpoint files (I don't know how these got committed)
    
    * Removed testing files. (These should have never been committed)
    
    * Removed accidentally committed files
    
    * Moved the Decision Transformer test to its own directory
    
    * Add type hints for Pegasus (#16324)
    
    * Funnel type hints (#16323)
    
    * add pt funnel type hints
    
    * add tf funnel type hints
    
    * Add type hints for ProphetNet PyTorch (#16272)
    
    * [GLPN] Improve docs (#16331)
    
    * Add link to notebook
    
    * Add link
    
    * Fix bug
    Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
    
    * Added type hints for Pytorch Marian calls (#16200)
    
    * Added type hinting for forward functions in pytorch marian
    
    * typo correction
    
    * Removed type hints on functions from BART per Suraj Patil request
    
    * fix import pb
    
    * fix typo
    
    * corrected tuple call
    
    * ran black
    
    * after fix-copies
    Some optional tags on primitives were removed, past_key_values in MarianForCausalLM changed from Tuple of Tuple to List
    
    * Fixing copies to roformer and pegasus
    Co-authored-by: default avatarClementine Fourrier <cfourrie@inria.fr>
    Co-authored-by: default avatarmatt <rocketknight1@gmail.com>
    
    * Moved DecisionTransformOutput to modeling_decision_transformer
    
    * Moved the example usage to research project and cleaned comments
    
    * Made tests ignore the copy of gpt2 in Decision Transformer
    
    * Added module output to modelling decision transformer
    
    * removed copied gpt2 model from list of transformers models
    
    * Updated tests and created __init__ file for new test location
    
    * Update README.md
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Update src/transformers/models/decision_transformer/configuration_decision_transformer.py
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Removed unneeded summary type from config file
    
    * Fixed copies
    
    * Updated pretrained config map to refer to hopper-medium checkpoint
    
    * done (#16340)
    
    * Added Decision transformer to model docs
    
    * Update src/transformers/models/decision_transformer/modeling_decision_transformer.py
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Update src/transformers/models/decision_transformer/modeling_decision_transformer.py
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Update src/transformers/models/decision_transformer/configuration_decision_transformer.py
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Add type annotations for Rembert/Splinter and copies (#16338)
    
    * undo black autoformat
    
    * minor fix to rembert forward with default
    
    * make fix-copies, make quality
    
    * Adding types to template model
    
    * Removing List from the template types
    
    * Remove `Optional` from a couple of types that don't accept `None`
    Co-authored-by: default avatarmatt <rocketknight1@gmail.com>
    
    * [Bug template] Shift responsibilities for long-range (#16344)
    
    * Fix code repetition in serialization guide (#16346)
    
    * Adopt framework-specific blocks for content (#16342)
    
    *  refactor code samples with framework-specific blocks
    
    *  update training.mdx
    
    * 🖍
    
     apply feedback
    
    * Updates the default branch from master to main (#16326)
    
    * Updates the default branch from master to main
    
    * Links from `master` to `main`
    
    * Typo
    
    * Update examples/flax/README.md
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Updated model with custom docstring example
    
    * Created the Decision Transformer Modle
    
    * updating tests, copy to other machine
    
    * Added last hidden size to Decision Transformer modelling outputs
    
    * Removed copy of original DT file
    
    * made a temporary change to gpt2 to have it conform with the Decision Transformer version
    
    * Updated tests
    
    * Ignoring a file used to test the DT model
    
    * added comments to config file
    
    * added comments and argument descriptions to decision transformer file
    
    * Updated doc
    
    * Ran "make style"
    
    * Remove old model imports
    
    * Removed unused imports, cleaned up init file
    
    * Update docs/source/model_doc/decision_transformer.mdx
    
    added my username
    Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
    
    * Reverted changes made to gpt2
    
    * Removed datasets submodule
    
    * Update the modeling outputs to include gpt2 attentions, hidden states and last hidden states
    
    * Added support for return of hidden states, attentions and return dict of gpt2 model.
    
    * Updated tests to include many of the ModelTesterMixin tests. 
    
    The following tests are skipped: test_generate_without_input_ids, test_pruning, test_resize_embeddings, test_head_masking, test_attention_outputs, test_hidden_states_output, test_inputs_embeds, test_model_common_attributes
    
    * Added missing line to the end of gpt2 file
    
    * Added an integration test for the Decision Transformer
    
    Test performs and autoregressive evaluation for two time steps
    
    * Set done and info to _ to fix failing test
    
    * Updated integration test to be deterministic and check expected outputs
    
    * Apply suggestions from code review
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Removed unnecessary config options
    
    * Cleaned up commented code and old comments.
    
    * Cleaned up commented code.
    
    * Changed DecisionTransformer to Decision Transformer
    
    * Added Decision Transformer to the main README file
    
    * Added copy of GTP2 called DecisionTranformerGPT2Model
    
    * isorted imports
    
    * isorted imports
    
    * Added model to non-English README files
    
    * Ran make fix-copies and corrected some cases.
    
    * Updated index file to include Decision Transformer
    
    * Added gpt2 model as copy inside the Decision Transformer model file
    
    * Added the unit test file to the list of TEST_FILES_WITH_NO_COMMON_TESTS
    
    * Deleted redundant checkpoint files (I don't know how these got committed)
    
    * Removed testing files. (These should have never been committed)
    
    * Removed accidentally committed files
    
    * Moved the Decision Transformer test to its own directory
    
    * Moved DecisionTransformOutput to modeling_decision_transformer
    
    * Moved the example usage to research project and cleaned comments
    
    * Made tests ignore the copy of gpt2 in Decision Transformer
    
    * Added module output to modelling decision transformer
    
    * removed copied gpt2 model from list of transformers models
    
    * Updated tests and created __init__ file for new test location
    
    * Update README.md
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Update src/transformers/models/decision_transformer/configuration_decision_transformer.py
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Removed unneeded summary type from config file
    
    * Fixed copies
    
    * Updated pretrained config map to refer to hopper-medium checkpoint
    
    * Added Decision transformer to model docs
    
    * Update src/transformers/models/decision_transformer/modeling_decision_transformer.py
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Update src/transformers/models/decision_transformer/modeling_decision_transformer.py
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Update src/transformers/models/decision_transformer/configuration_decision_transformer.py
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * Updated model with custom docstring example
    
    * Updated copies, config auto, and readme files.
    Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    Co-authored-by: default avatarDan Tegzes <48134725+Tegzes@users.noreply.github.com>
    Co-authored-by: default avatarAdam Montgomerie <adam@avanssion.com>
    Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
    Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
    Co-authored-by: default avatarClémentine Fourrier <22726840+clefourrier@users.noreply.github.com>
    Co-authored-by: default avatarClementine Fourrier <cfourrie@inria.fr>
    Co-authored-by: default avatarmatt <rocketknight1@gmail.com>
    Co-authored-by: default avatarFrancesco Saverio Zuppichini <francesco.zuppichini@gmail.com>
    Co-authored-by: default avatarJacob Dineen <54680234+jacobdineen@users.noreply.github.com>
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    Co-authored-by: default avatarOmar Sanseviero <osanseviero@gmail.com>
    Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
    Co-authored-by: default avatarLysandre Debut <lysandre.debut@reseau.eseo.fr>
    aff9bc40
README.md 53.1 KB