- 19 Apr, 2022 16 commits
-
-
Dahlbomii authored
* Type hints added * return hints added * Update src/transformers/models/speech_to_text/modeling_tf_speech_to_text.py Co-authored-by:Matt <Rocketknight1@users.noreply.github.com>
-
SaulLu authored
* replace `Speech2TextTokenizer` by `Speech2TextFeatureExtractor` in docstring * quality
-
Jeevesh Juneja authored
* Correct Logging of Eval metric to Tensorboard An empty dictionary ``eval_metrics`` was being logged, is replaced by ``eval_metric`` which is the output dictionary of ``metric.compute()``. * Remove unused variable
-
Joao Gante authored
-
wiio12 authored
* Add doc about `attention_mask` on gpt2 Add a simple sentence describing how `attention_mask` needs to be constructed when ``past_key_values` is used. * Add doc about attention_mask on gpt2_tf * clean up style * remove empty line white spaces * remove whitespace in empty line
-
NielsRogge authored
* Add first draft * Improve README and run fixup * Make script aligned with other scripts, improve README * Improve script and add test * Remove print statement * Apply suggestions from code review * Add num_labels to make test pass * Improve README
-
Patrick von Platen authored
* correct * up
-
Ella Charlaix authored
* Add export of models with a multiple-choice classification head
-
Wonjae Kim authored
-
Dahlbomii authored
* Type hints added * make style * Return type hints added * fixed typo Co-authored-by:matt <rocketknight1@gmail.com>
-
code-review-doctor authored
* Fix issue avoid-misusing-assert-true found at https://codereview.doctor * fix tests * fix tf Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Suraj Patil authored
* begin do_init * add params_shape_tree * raise error if params are accessed when do_init is False * don't allow do_init=False when keys are missing * make shape tree a property * assign self._params at the end * add test for do_init * add do_init arg to all flax models * fix param setting * disbale do_init for composite models * update test * add do_init in FlaxBigBirdForMultipleChoice * better names and errors * improve test * style * add a warning when do_init=False * remove extra if * set params after _required_params * add test for from_pretrained * do_init => _do_init * chage warning to info * fix typo * add params in init_weights * add params to gpt neo init * add params to init_weights * update do_init test * Trigger CI * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * update template * trigger CI * style * style * fix template Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Arthur authored
* Solved href rendering issue in heading Markdown references in headings such as '####' don't render well. Replaced it with <h4>...<a></a></h> banners. * PhonemeTokenizer optimization using phonemizer lib The backend should only be initialized once, otherwise it is reloaded. Added `init_backend` function, intializes a backend attribute. Phonemize re-uses self.backend. Should give ~10 times faster phonemization. * formatted file with make style * Documentation suggestion Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update /tokenization_wav2vec2_phoneme.py based on PR suggestion Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update CONTRIBUTING.md Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Li-Huai (Allan) Lin authored
* Fix docstrings * Fix up * Fix
-
NielsRogge authored
* Add first draft from previous PR * First draft * Improve README and remove num_labels * Make script more aligned with other scripts * Improve README and apply suggestion from code review
-
NielsRogge authored
Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
- 18 Apr, 2022 13 commits
-
-
David Hall authored
* fix _setup_devices in case where there is not torch.distributed * in training_args_sm.py as well
-
Lysandre Debut authored
* Refactor issues with yaml * Update .github/ISSUE_TEMPLATE/bug-report.yml Co-authored-by:
Anton Lozhkov <aglozhkov@gmail.com> * Update .github/ISSUE_TEMPLATE/bug-report.yml Co-authored-by:
Anton Lozhkov <aglozhkov@gmail.com> * Update .github/ISSUE_TEMPLATE/feature-request.yml Co-authored-by:
Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Update .github/ISSUE_TEMPLATE/bug-report.yml Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Update .github/ISSUE_TEMPLATE/bug-report.yml Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Address review comments Co-authored-by:
Anton Lozhkov <aglozhkov@gmail.com> Co-authored-by:
Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
jsnfly authored
* Add passing encoder_outputs as tuple to existing test * Add check for tuple * Add check for tuple also for speech and vision Co-authored-by:jsnfly <jsnfly@gmx.de>
-
Nicholas Broad authored
* use base_version * make is_torch_less_than_1_8 match 1_11 Co-authored-by:Nicholas Broad <nicholas@nmbroad.com>
-
Patrick von Platen authored
* save intermediate * add vision * add vision * save * finish models * finish models * continue * finish * up * up * up * tests all pass * clean up * up * up * fix bugs in beit * correct docs * finish * finish docs * make style * up * more fixes * fix type hint * make style * Apply suggestions from code review Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/data2vec/test_modeling_data2vec_vision.py Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * fix test Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Zhengqiang Yin authored
-
Patrick von Platen authored
* [Quicktour Audio] Improve && remove ffmpeg dependency * final fix * final touches
-
NielsRogge authored
* Improve code * Fix bugs * Fix another bug * Clean up DTP as well * Update DPT model outputs Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
Sylvain Gugger authored
-
Joao Gante authored
-
Allan Jie authored
* optimizer issues related to saving * remove the "optimizer saving" option * reformat using make style
-
Joao Gante authored
-
Joao Gante authored
-
- 17 Apr, 2022 1 commit
-
-
Sylvain Gugger authored
* Pin Jax to last working release * Try lower * Try lower
-
- 15 Apr, 2022 4 commits
-
-
NielsRogge authored
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Stas Bekman authored
* add low_cpu_mem_usage tests * wip: revamping * wip * install /usr/bin/time * wip * cleanup * cleanup * cleanup * cleanup * cleanup * fix assert * put the wrapper back * cleanup; switch to bert-base-cased * Trigger CI * Trigger CI
-
Stas Bekman authored
* [trainer / deepspeed] fix hyperparameter_search * require optuna * style * oops * add dep in the right place * create deepspeed-testing dep group * Trigger CI
-
- 14 Apr, 2022 6 commits
-
-
-
Sanchit Gandhi authored
-
NielsRogge authored
* Improve README * Make dataset_name argument optional * Improve local data * Fix bug * Improve README some more * Apply suggestions from code review * Improve README Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
Sylvain Gugger authored
-
Stas Bekman authored
* [deepspeed / m2m_100] make deepspeed 3 work with layerdrop * fix * revert last
-
Zachary Mueller authored
-