- 27 Jun, 2023 1 commit
-
-
hukuda222 authored
* group_beam_search now works correctly * add argument descriptions * add a comment * format * make style * change comment * Update src/transformers/generation/beam_search.py Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> --------- Co-authored-by:
shogo.fujita <shogo.fujita@legalontech.jp> Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com>
-
- 26 Jun, 2023 15 commits
-
-
Gema Parre帽o authored
* fix link * new link --------- Co-authored-by:Gema <gema@mbp-de-gema-2.lan>
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Tomoko Uchida authored
fix attribute error
-
Yih-Dar authored
* fix * fix * fix * fix * fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Younes Belkada authored
* add accelerate support for instructblip * add `_keep_in_fp32_modules` * dynamically adapt `_no_split_modules` * better fix * same logic for `_keep_in_fp32_modules`
-
Sylvain Gugger authored
Add support for for loops
-
condor-cp authored
Add link to pytorch CrossEntropyLoss so that one understand why '-100' is ignore by the loss function.
-
Yih-Dar authored
* fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sourab Mangrulkar authored
* deepspeed z2/z1 state_dict bloating fix * update * version check
-
Wang, Yi authored
-
Younes Belkada authored
* fix str device issue * fixup * adapt from suggestions * forward contrib credits from suggestions * better fix * added backward compatibility for older PT versions * final fixes * oops * Attempting something with less branching. --------- Co-authored-by:
amyeroberts <amyeroberts@users.noreply.github.com> Co-authored-by:
Nicolas Patry <patry.nicolas@protonmail.com>
-
amyeroberts authored
Update type annotation
-
Meghan Cowan authored
fix for tpu metrics debugs string
-
Matthijs Hollemans authored
add missing alignment heads
-
NielsRogge authored
* Squash 88 commits * Use markdown * Remove mdx files due to bad rebase * Fix modeling files due to bad rebase * Fix style * Update comment * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 23 Jun, 2023 11 commits
-
-
Matt authored
* An end to accursed version-specific imports * No more K.is_keras_tensor() either * Update dependency tables * Use a cleaner call context function getter * Add a cap to <2.14 * Add cap to examples requirements too
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sanchit Gandhi authored
* Allow dict input for audio classification pipeline * make style * Empty commit to trigger CI * Empty commit to trigger CI * check for torchaudio * add pip instructions Co-authored-by:
Sylvain <sylvain.gugger@gmail.com> * Update src/transformers/pipelines/audio_classification.py Co-authored-by:
Nicolas Patry <patry.nicolas@protonmail.com> * asr -> audio class * asr -> audio class --------- Co-authored-by:
Sylvain <sylvain.gugger@gmail.com> Co-authored-by:
Nicolas Patry <patry.nicolas@protonmail.com>
-
Sourab Mangrulkar authored
-
Yih-Dar authored
* fix * fix * fix * fix * fix * fix * fix * fix * fix * Update src/transformers/models/whisper/modeling_tf_whisper.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/whisper/modeling_tf_whisper.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * fix * fix * fix --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
Moon Gi Cho authored
-
Bowen Bao authored
* Replace python random with torch.rand to enable dynamo.export * revert changes to flax model code * Remove unused random import * Fix torch template * Move torch.manual_seed(0) to right location
-
Sourab Mangrulkar authored
* fix the grad_acc issue at epoch boundaries Co-Authored-By:
Zach Mueller <7831895+muellerzr@users.noreply.github.com> * add contributors. Co-authored-by: sumpster * address comments --------- Co-authored-by:
Zach Mueller <7831895+muellerzr@users.noreply.github.com>
-
Younes Belkada authored
* fix `.to` call on 4bit models * better check
-
Sanchit Gandhi authored
* [AutoModel] Add AutoModelForTextEncoding * add mt5 * add other models * add to docs * fix tf imports * add tf to docs / init * up * fix inits * add to dummy objects
-
Weiming Zhao authored
Explain the reason to clone tensor
-
- 22 Jun, 2023 11 commits
-
-
Yih-Dar authored
* fix * fix * Upgrade complete! --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sylvain Gugger authored
-
Alex Hall authored
* Refactor hyperparameter search backends * Simpler refactoring without abstract base class * black * review comments: specify name in class use methods instead of callable class attributes name constant better * review comments: safer bool checking, log multiple available backends * test ALL_HYPERPARAMETER_SEARCH_BACKENDS vs HPSearchBackend in unit test, not module. format with black. * copyright
-
Matt authored
Fix segformer so compilation can figure out the channel dim
-
Josh authored
Update outdated hyperlink hpo_train.md Link to RayTune search space API docs was outdated - have provided correct new link for docs. Co-authored-by:Joshua Samuel <66880119+Joshsamuel101@users.noreply.github.com>
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Younes Belkada authored
Revert "Fix gradient checkpointing + fp16 autocast for most models (#24247)" This reverts commit 285a4801.
-
Younes Belkada authored
* fix bnb issue * fixup * revert and do simple patching instead * add more details
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Matt authored
* Slight comment cleanup * Reduce peak mem usage when loading TF-format safetensor weights * Tweak the PyTorch loading code to support lazy loading from safetensors * Pass safe_open objects to the PyTorch loading function * Do GPU transposes for speed * One more tweak to reduce peak usage further * One-line hasattr * Fix bug when there's a shape mismatch * Rename state_dict in the loading code to be clearer * Use TF format everywhere for consistency
-
Sanchit Gandhi authored
* [ASR pipeline] Check for torchaudio * add pip instructions Co-authored-by:
Sylvain Gugger <sylvain.gugger@gmail.com> --------- Co-authored-by:
Sylvain Gugger <sylvain.gugger@gmail.com>
-
- 21 Jun, 2023 2 commits
-
-
Yih-Dar authored
* fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Zach Mueller authored
Remove redundant code
-