- 13 Jun, 2023 7 commits
-
-
Sylvain Gugger authored
* First test * Add info for all models * style * Repo consistency * Fix last model and cleanup prints * Repo consistency * Use consistent function for detecting tied weights
-
Sourab Mangrulkar authored
-
Sebastian authored
* Porting changes from https://github.com/microsoft/DeBERTa/ that hopefully allows for fp16 training of mdeberta * Updates to deberta modeling from microsoft repo * Performing some cleanup * Undoing changes that weren't necessary * Undoing float calls * Minimally change the p2c block * Fix error * Minimally changing the c2p block * Switch to torch sqrt * Remove math * Adding back the to calls to scale * Undoing attention_scores change * Removing commented out code * Updating modeling_sew_d.py to satisfy utils/check_copies.py * Missed changed * Further reduce changes needed to get fp16 working * Reverting changes to modeling_sew_d.py * Make same change in TF
-
amyeroberts authored
-
Nicolas Patry authored
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
yuanwu2017 authored
* fix: TextIteratorStreamer cannot work with pipeline Deepcopying the TextIteratorStreamer object causes the exception. Signed-off-by:
yuanwu <yuan.wu@intel.com> * Update src/transformers/pipelines/text_generation.py Got it. I will update the patch. Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/pipelines/text_generation.py Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> * Update text_generation.py --------- Signed-off-by:
yuanwu <yuan.wu@intel.com> Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com>
-
- 12 Jun, 2023 19 commits
-
-
Sylvain Gugger authored
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Zach Mueller authored
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
fxmarty authored
* fix dtype init * fix copies * fix fixcopies mess * edit forward as well * copy
-
Yih-Dar authored
* fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Ethan authored
Fix step bugs in no trainer + load checkpoint + grad acc
-
Marc Sun authored
Fix test
-
Zach Mueller authored
* Working integration * Fix failing test * Revert label host logic * Bring it back!
-
Kihoon Son authored
*
馃寪 [i18n-KO] Translated tasks_summary.mdx to Korean Co-Authored-By:Hyeonseo Yun <0525yhs@gmail.com> Co-Authored-By:
Wonhyeong Seo <wonhseo@kakao.com> Co-Authored-By:
Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-Authored-By:
Gabriel Yang <gabrielwithhappy@gmail.com> Co-Authored-By:
Jungnerd <46880056+jungnerd@users.noreply.github.com> Co-Authored-By:
Nayeon Han <nayeon2.han@gmail.com> * Apply suggestions from code review Co-authored-by:
Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by:
Hyeonseo Yun <0525yhs@gmail.com> * Update _toctree.yml * Delete generation_strategies.mdx * Delete tasks_explained.mdx --------- Co-authored-by:
Hyeonseo Yun <0525yhs@gmail.com> Co-authored-by:
Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by:
Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by:
Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by:
Jungnerd <46880056+jungnerd@users.noreply.github.com> Co-authored-by:
Nayeon Han <nayeon2.han@gmail.com>
-
Joao Gante authored
-
Jacob authored
* typo: fix typos in CONTRIBUTING.md and deepspeed.mdx * Update CONTRIBUTING.md Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
Yih-Dar authored
* fix * fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Joao Gante authored
-
Kihoon Son authored
* [i18n]Translated "attention.mdx" to korean Co-Authored-By:
Hyeonseo Yun <0525yhs@gmail.com> Co-Authored-By:
Wonhyeong Seo <wonhseo@kakao.com> Co-Authored-By:
Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-Authored-By:
Gabriel Yang <gabrielwithhappy@gmail.com> Co-Authored-By:
Nayeon Han <nayeon2.han@gmail.com> Co-Authored-By:
Jungnerd <46880056+jungnerd@users.noreply.github.com> * Apply suggestions from code review Co-authored-by:
Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by:
Hyeonseo Yun <0525yhs@gmail.com> * Update _toctree.yml --------- Co-authored-by:
Hyeonseo Yun <0525yhs@gmail.com> Co-authored-by:
Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by:
Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by:
Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by:
Nayeon Han <nayeon2.han@gmail.com> Co-authored-by:
Jungnerd <46880056+jungnerd@users.noreply.github.com>
-
AinL authored
* Change ProgressCallback to use dynamic_ncols=True * style: make style * Revert "style: make style" This reverts commit dee484904cd30a072d80e3be0a3d74a03cff30c6. * run make style only trainer_callback
-
NielsRogge authored
Add fix
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 10 Jun, 2023 2 commits
-
-
Yih-Dar authored
* fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Stas Bekman authored
fix bitsandbytes import issue
-
- 09 Jun, 2023 12 commits
-
-
Lysandre Debut authored
* Tool types * Tests + fixes * Isolate types * Oops * Review comments + docs * Tests + docs * soundfile -> vision
-
Freddie Vargus authored
-
LiamSwayne authored
Update image_classification.mdx
-
Yih-Dar authored
* fix * fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Arthur authored
* small tokenizer uses `__start__` and `__end__` * fix PR doctest
-
Arthur authored
* Update documentation * nits
-
Younes Belkada authored
* fix sam test * update pipeline typehint
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sourab Mangrulkar authored
* fix the deepspeed test failures * apex fix * FSDP save ckpt fix * Update src/transformers/trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Joao Gante authored
-