1. 06 Jul, 2023 3 commits
    • Sourab Mangrulkar's avatar
      DeepSpeed/FSDP ckpt saving utils fixes and FSDP training args fixes (#24591) · 66a37842
      Sourab Mangrulkar authored
      * update ds and fsdp ckpt logic
      
      * refactoring
      
      * fix 🐛
      
      * resolve comment
      
      * fix issue with overriding of the fsdp config set by accelerate
      66a37842
    • Zhao Tianyu's avatar
      Add dropouts to GPT-NeoX (#24680) · 39274045
      Zhao Tianyu authored
      * add attention dropout, post attention dropout, post mlp dropout to gpt-neox
      
      * fix typo
      
      * add documentation
      
      * fix too long line
      
      * ran Checking/fixing src/transformers/models/gpt_neox/configuration_gpt_neox.py src/transformers/models/gpt_neox/modeling_gpt_neox.py
      python utils/custom_init_isort.py
      python utils/sort_auto_mappings.py
      doc-builder style src/transformers docs/source --max_len 119 --path_to_docs docs/source
      python utils/check_doc_toc.py --fix_and_overwrite
      running deps_table_update
      updating src/transformers/dependency_versions_table.py
      python utils/check_copies.py
      python utils/check_table.py
      python utils/check_dummies.py
      python utils/check_repo.py
      Checking all models are included.
      Checking all models are public.
      Checking all models are properly tested.
      Checking all objects are properly documented.
      Checking all models are in at least one auto class.
      Checking all names in auto name mappings are defined.
      Checking all keys in auto name mappings are defined in `CONFIG_MAPPING_NAMES`.
      Checking all auto mappings could be imported.
      Checking all objects are equally (across frameworks) in the main __init__.
      python utils/check_inits.py
      python utils/check_config_docstrings.py
      python utils/check_config_attributes.py
      python utils/check_doctest_list.py
      python utils/update_metadata.py --check-only
      python utils/check_task_guides.py
      39274045
    • Yuchao Dai's avatar
      LlamaTokenizer should be picklable (#24681) · fb3b22c3
      Yuchao Dai authored
      * LlamaTokenizer should be picklable
      
      * make fixup
      fb3b22c3
  2. 05 Jul, 2023 6 commits
  3. 04 Jul, 2023 7 commits
  4. 03 Jul, 2023 7 commits
  5. 01 Jul, 2023 1 commit
  6. 30 Jun, 2023 8 commits
  7. 29 Jun, 2023 8 commits
    • Sourab Mangrulkar's avatar
      fix peft ckpts not being pushed to hub (#24578) · 9e287502
      Sourab Mangrulkar authored
      * fix push to hub for peft ckpts
      
      * oops
      9e287502
    • MS Kim(tony9402)'s avatar
      Fix annotations (#24582) · 232c898f
      MS Kim(tony9402) authored
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      
      * fix annotations
      232c898f
    • Yih-Dar's avatar
      c817bc44
    • Sylvain Gugger's avatar
      Fix ESM models buffers (#24576) · 8c4471d1
      Sylvain Gugger authored
      * Fix ESM models buffers
      
      * Remove modifs
      
      * Tied weights keys are needed silly
      
      * quality
      8c4471d1
    • amyeroberts's avatar
      Removal of deprecated vision methods and specify deprecation versions (#24570) · b324557a
      amyeroberts authored
      * Removal of deprecated methods and specify versions
      
      * Fix tests
      b324557a
    • Yih-Dar's avatar
      Update some torchscript tests after #24505 (#24566) · 77db28dc
      Yih-Dar authored
      
      
      * fix
      
      * fix
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      77db28dc
    • Sanchit Gandhi's avatar
      Add Musicgen (#24109) · 1c1c9075
      Sanchit Gandhi authored
      
      
      * Add Audiocraft
      
      * add cross attention
      
      * style
      
      * add for lm
      
      * convert and verify
      
      * introduce t5
      
      * split configs
      
      * load t5 + lm
      
      * clean conversion
      
      * copy from t5
      
      * style
      
      * start pattern provider
      
      * make generation work
      
      * style
      
      * fix pos embs
      
      * propagate shape changes
      
      * propagate shape changes
      
      * style
      
      * delay pattern: pad tokens at end
      
      * audiocraft -> musicgen
      
      * fix inits
      
      * add mdx
      
      * style
      
      * fix pad token in processor
      
      * override generate and add todos
      
      * add init to test
      
      * undo pattern delay mask after gen
      
      * remove cfg logits processor
      
      * remove cfg logits processor
      
      * remove logits processor in favour of mask
      
      * clean pos embs
      
      * make fix copies
      
      * update readmes
      
      * clean pos emb
      
      * refactor encoder/decoder
      
      * make fix copies
      
      * update conversion
      
      * fix config imports
      
      * update config docs
      
      * make style
      
      * send pattern mask to device
      
      * pattern mask with delay
      
      * recover prompted audio tokens
      
      * fix docstrings
      
      * laydown test file
      
      * pattern edge case
      
      * remove t5 ref
      
      * add processing class
      
      * config refactor
      
      * better pattern comment
      
      * check if mask is not present
      
      * check if mask is not present
      
      * refactor to auto class
      
      * remove encoder configs
      
      * fix processor
      
      * processor import
      
      * start updating conversion
      
      * start updating tests
      
      * make style
      
      * convert t5, encodec, lm
      
      * convert as composite
      
      * also convert processor
      
      * run generate
      
      * classifier free gen
      
      * comments and clean up
      
      * make style
      
      * docs for logit proc
      
      * docstring for uncond gen
      
      * start lm tests
      
      * work tests
      
      * let the lm generate
      
      * refactor: reshape inside forward
      
      * undo greedy loop changes
      
      * from_enc_dec -> from_sub_model
      
      * fix input id shapes in docstrings
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * undo generate changes
      
      * from sub model config
      
      * Update src/transformers/models/musicgen/modeling_musicgen.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * make generate work again
      
      * generate uncond -> get uncond inputs
      
      * remove prefix allowed tokens fn
      
      * better error message
      
      * logit proc checks
      
      * Apply suggestions from code review
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * make decoder only tests work
      
      * composite fast tests
      
      * make style
      
      * uncond generation
      
      * feat extr padding
      
      * make audio prompt work
      
      * fix inputs docstrings
      
      * unconditional inputs: dict -> model output
      
      * clean up tests
      
      * more clean up tests
      
      * make style
      
      * t5 encoder -> auto text encoder
      
      * remove comments
      
      * deal with frames
      
      * fix auto text
      
      * slow tests
      
      * nice mdx
      
      * remove can generate
      
      * todo - hub id
      
      * convert m/l
      
      * make fix copies
      
      * only import generation with torch
      
      * ignore decoder from tests
      
      * don't wrap uncond inputs
      
      * make style
      
      * cleaner uncond inputs
      
      * add example to musicgen forward
      
      * fix docs
      
      * ignore MusicGen Model/ForConditionalGeneration in auto mapping
      
      * add doc section to toctree
      
      * add to doc tests
      
      * add processor tests
      
      * fix push to hub in conversion
      
      * tips for decoder only loading
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * fix conversion for s / m / l checkpoints
      
      * import stopping criteria from module
      
      * remove from pipeline tests
      
      * fix uncond docstring
      
      * decode audio method
      
      * fix docs
      
      * org: sanchit-gandhi -> facebook
      
      * fix max pos embeddings
      
      * remove auto doc (not compatible with shapes)
      
      * bump max pos emb
      
      * make style
      
      * fix doc
      
      * fix config doc
      
      * fix config doc
      
      * ignore musicgen config from docstring
      
      * make style
      
      * fix config
      
      * fix config for doctest
      
      * consistent from_sub_models
      
      * don't automap decoder
      
      * fix mdx save audio file
      
      * fix mdx save audio file
      
      * processor batch decode for audio
      
      * remove keys to ignore
      
      * update doc md
      
      * update generation config
      
      * allow changes for default generation config
      
      * update tests
      
      * make style
      
      * fix docstring for uncond
      
      * fix processor test
      
      * fix processor test
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      1c1c9075
    • Sylvain Gugger's avatar
      Revert "Fix typing annotations for FSDP and DeepSpeed in TrainingArguments" (#24574) · 2dc5e1a1
      Sylvain Gugger authored
      Revert "Fix typing annotations for FSDP and DeepSpeed in TrainingArguments (#24549)"
      
      This reverts commit c5e29d43.
      2dc5e1a1