- 27 Jun, 2023 12 commits
-
-
Sylvain Gugger authored
* Find module name in an OS-agnostic fashion * address review comment
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Wang, Yi authored
-
Sebastian authored
* Adding T5ForQuestionAnswering * Changed weight initialization that results in better initial loss when fine-tuning * Update to class variables * Running make fixup * Running make fix-copies * Remove model_parallel * Adding MT5ForQuestionAnswering * Adding docs * Fix wrong doc * Update src/transformers/models/mt5/modeling_mt5.py Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update src/transformers/models/t5/modeling_t5.py Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * File formatting * Undoing change --------- Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
-
Sourab Mangrulkar authored
* Update hyperparameter_search.py * resolve comments
-
Wang, Yi authored
use accelerate autocast in jit eval path, since mix precision logic is in accelerator currently Signed-off-by:Wang, Yi A <yi.a.wang@intel.com>
-
Hyeonseo Yun authored
* docs: ko: tflite.mdx * feat: nmt and manual edit `tflite.mdx` * revised: resolve suggestions tflite.mdx Co-authored-by:
Wonhyeong Seo <wonhseo@kakao.com> * revised: resolve suggestions and new line tflite.mdx Co-Authored-By:
Wonhyeong Seo <wonhseo@kakao.com> Co-Authored-By:
Kihoon Son <75935546+KIHOON71@users.noreply.github.com> Co-Authored-By:
Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-Authored-By:
Gabriel Yang <gabrielwithhappy@gmail.com> Co-Authored-By:
Nayeon Han <nayeon2.han@gmail.com> Co-Authored-By:
Jungnerd <46880056+jungnerd@users.noreply.github.com> --------- Co-authored-by:
Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by:
Kihoon Son <75935546+KIHOON71@users.noreply.github.com> Co-authored-by:
Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by:
Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by:
Nayeon Han <nayeon2.han@gmail.com> Co-authored-by:
Jungnerd <46880056+jungnerd@users.noreply.github.com>
-
Yih-Dar authored
* fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Xiaoli Wang authored
* Fix TypeError: Object of type int64 is not JSON serializable * Convert numpy.float64 and numpy.int64 to float and int for json serialization * Black reformatted examples/pytorch/token-classification/run_ner_no_trainer.py * * make style
-
Joao Gante authored
-
Joao Gante authored
* add exception * update docs
-
hukuda222 authored
* group_beam_search now works correctly * add argument descriptions * add a comment * format * make style * change comment * Update src/transformers/generation/beam_search.py Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> --------- Co-authored-by:
shogo.fujita <shogo.fujita@legalontech.jp> Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com>
-
- 26 Jun, 2023 15 commits
-
-
Gema Parreño authored
* fix link * new link --------- Co-authored-by:Gema <gema@mbp-de-gema-2.lan>
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Tomoko Uchida authored
fix attribute error
-
Yih-Dar authored
* fix * fix * fix * fix * fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Younes Belkada authored
* add accelerate support for instructblip * add `_keep_in_fp32_modules` * dynamically adapt `_no_split_modules` * better fix * same logic for `_keep_in_fp32_modules`
-
Sylvain Gugger authored
Add support for for loops
-
condor-cp authored
Add link to pytorch CrossEntropyLoss so that one understand why '-100' is ignore by the loss function.
-
Yih-Dar authored
* fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sourab Mangrulkar authored
* deepspeed z2/z1 state_dict bloating fix * update * version check
-
Wang, Yi authored
-
Younes Belkada authored
* fix str device issue * fixup * adapt from suggestions * forward contrib credits from suggestions * better fix * added backward compatibility for older PT versions * final fixes * oops * Attempting something with less branching. --------- Co-authored-by:
amyeroberts <amyeroberts@users.noreply.github.com> Co-authored-by:
Nicolas Patry <patry.nicolas@protonmail.com>
-
amyeroberts authored
Update type annotation
-
Meghan Cowan authored
fix for tpu metrics debugs string
-
Matthijs Hollemans authored
add missing alignment heads
-
NielsRogge authored
* Squash 88 commits * Use markdown * Remove mdx files due to bad rebase * Fix modeling files due to bad rebase * Fix style * Update comment * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 23 Jun, 2023 11 commits
-
-
Matt authored
* An end to accursed version-specific imports * No more K.is_keras_tensor() either * Update dependency tables * Use a cleaner call context function getter * Add a cap to <2.14 * Add cap to examples requirements too
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sanchit Gandhi authored
* Allow dict input for audio classification pipeline * make style * Empty commit to trigger CI * Empty commit to trigger CI * check for torchaudio * add pip instructions Co-authored-by:
Sylvain <sylvain.gugger@gmail.com> * Update src/transformers/pipelines/audio_classification.py Co-authored-by:
Nicolas Patry <patry.nicolas@protonmail.com> * asr -> audio class * asr -> audio class --------- Co-authored-by:
Sylvain <sylvain.gugger@gmail.com> Co-authored-by:
Nicolas Patry <patry.nicolas@protonmail.com>
-
Sourab Mangrulkar authored
-
Yih-Dar authored
* fix * fix * fix * fix * fix * fix * fix * fix * fix * Update src/transformers/models/whisper/modeling_tf_whisper.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/whisper/modeling_tf_whisper.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * fix * fix * fix --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
Moon Gi Cho authored
-
Bowen Bao authored
* Replace python random with torch.rand to enable dynamo.export * revert changes to flax model code * Remove unused random import * Fix torch template * Move torch.manual_seed(0) to right location
-
Sourab Mangrulkar authored
* fix the grad_acc issue at epoch boundaries Co-Authored-By:
Zach Mueller <7831895+muellerzr@users.noreply.github.com> * add contributors. Co-authored-by: sumpster * address comments --------- Co-authored-by:
Zach Mueller <7831895+muellerzr@users.noreply.github.com>
-
Younes Belkada authored
* fix `.to` call on 4bit models * better check
-
Sanchit Gandhi authored
* [AutoModel] Add AutoModelForTextEncoding * add mt5 * add other models * add to docs * fix tf imports * add tf to docs / init * up * fix inits * add to dummy objects
-
Weiming Zhao authored
Explain the reason to clone tensor
-
- 22 Jun, 2023 2 commits
-
-
Yih-Dar authored
* fix * fix * Upgrade complete! --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sylvain Gugger authored
-