"vscode:/vscode.git/clone" did not exist on "502a10a6f89b2919444aba68cd0def51d5ba618c"
- 13 Jun, 2023 6 commits
-
-
Sylvain Gugger authored
* First test * Add info for all models * style * Repo consistency * Fix last model and cleanup prints * Repo consistency * Use consistent function for detecting tied weights
-
Sourab Mangrulkar authored
-
Sebastian authored
* Porting changes from https://github.com/microsoft/DeBERTa/ that hopefully allows for fp16 training of mdeberta * Updates to deberta modeling from microsoft repo * Performing some cleanup * Undoing changes that weren't necessary * Undoing float calls * Minimally change the p2c block * Fix error * Minimally changing the c2p block * Switch to torch sqrt * Remove math * Adding back the to calls to scale * Undoing attention_scores change * Removing commented out code * Updating modeling_sew_d.py to satisfy utils/check_copies.py * Missed changed * Further reduce changes needed to get fp16 working * Reverting changes to modeling_sew_d.py * Make same change in TF
-
amyeroberts authored
-
Nicolas Patry authored
-
yuanwu2017 authored
* fix: TextIteratorStreamer cannot work with pipeline Deepcopying the TextIteratorStreamer object causes the exception. Signed-off-by:
yuanwu <yuan.wu@intel.com> * Update src/transformers/pipelines/text_generation.py Got it. I will update the patch. Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/pipelines/text_generation.py Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> * Update text_generation.py --------- Signed-off-by:
yuanwu <yuan.wu@intel.com> Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com>
-
- 12 Jun, 2023 11 commits
-
-
Zach Mueller authored
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
fxmarty authored
* fix dtype init * fix copies * fix fixcopies mess * edit forward as well * copy
-
Yih-Dar authored
* fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Marc Sun authored
Fix test
-
Zach Mueller authored
* Working integration * Fix failing test * Revert label host logic * Bring it back!
-
Joao Gante authored
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Joao Gante authored
-
AinL authored
* Change ProgressCallback to use dynamic_ncols=True * style: make style * Revert "style: make style" This reverts commit dee484904cd30a072d80e3be0a3d74a03cff30c6. * run make style only trainer_callback
-
NielsRogge authored
Add fix
-
- 10 Jun, 2023 1 commit
-
-
Yih-Dar authored
* fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 09 Jun, 2023 10 commits
-
-
Lysandre Debut authored
* Tool types * Tests + fixes * Isolate types * Oops * Review comments + docs * Tests + docs * soundfile -> vision
-
Freddie Vargus authored
-
Arthur authored
* small tokenizer uses `__start__` and `__end__` * fix PR doctest
-
Arthur authored
* Update documentation * nits
-
Younes Belkada authored
* fix sam test * update pipeline typehint
-
Sourab Mangrulkar authored
* fix the deepspeed test failures * apex fix * FSDP save ckpt fix * Update src/transformers/trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Joao Gante authored
-
Matt authored
-
Younes Belkada authored
* fix bnb config json serialization * forward contrib credits from discussions --------- Co-authored-by:Andrechang <Andrechang@users.noreply.github.com>
-
Arthur authored
* preventllama fast from returning token type ids * remove type hints * normalised False
-
- 08 Jun, 2023 8 commits
-
-
Serge Panev authored
* Fix typo in Llama docstrings Signed-off-by:
Serge Panev <spanev@nvidia.com> * Update Signed-off-by:
Serge Panev <spanev@nvidia.com> * make style Signed-off-by:
Serge Panev <spanev@nvidia.com> --------- Signed-off-by:
Serge Panev <spanev@nvidia.com>
-
Radam茅s Ajna authored
* add trust_remote_code option * require_torch
-
Younes Belkada authored
[`GPT2`] Add correct keys on `_keys_to_ignore_on_load_unexpected` on all child classes of `GPT2PreTrainedModel` (#24113) * add correct keys on `_keys_to_ignore_on_load_unexpected` * oops
-
Marc Sun authored
* fix get_keys_to_not_convert funct * Fix style
-
Sylvain Gugger authored
-
Younes Belkada authored
* v1 * some refactor - add ST format as well * fix * add `ADAPTER_WEIGHTS_NAME` & `ADAPTER_SAFE_WEIGHTS_NAME`
-
Sourab Mangrulkar authored
-
Sadra Barikbin authored
-
- 07 Jun, 2023 4 commits
-
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* Add AzureOpenAiAgent * quality * Update src/transformers/tools/agents.py Co-authored-by:
Lysandre Debut <lysandre.debut@reseau.eseo.fr> --------- Co-authored-by:
Lysandre Debut <lysandre.debut@reseau.eseo.fr>
-
Zachary Mueller authored
* Min accelerate * Also min version * Min accelerate * Also min version * To different minor version * Empty
-
Sourab Mangrulkar authored
* fix mixed precision prep during eval only mode * update to address comments * update to reflect the changes in accelerate
-