- 18 Apr, 2022 7 commits
-
-
Patrick von Platen authored
* [Quicktour Audio] Improve && remove ffmpeg dependency * final fix * final touches
-
NielsRogge authored
* Improve code * Fix bugs * Fix another bug * Clean up DTP as well * Update DPT model outputs Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
Sylvain Gugger authored
-
Joao Gante authored
-
Allan Jie authored
* optimizer issues related to saving * remove the "optimizer saving" option * reformat using make style
-
Joao Gante authored
-
Joao Gante authored
-
- 17 Apr, 2022 1 commit
-
-
Sylvain Gugger authored
* Pin Jax to last working release * Try lower * Try lower
-
- 15 Apr, 2022 4 commits
-
-
NielsRogge authored
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Stas Bekman authored
* add low_cpu_mem_usage tests * wip: revamping * wip * install /usr/bin/time * wip * cleanup * cleanup * cleanup * cleanup * cleanup * fix assert * put the wrapper back * cleanup; switch to bert-base-cased * Trigger CI * Trigger CI
-
Stas Bekman authored
* [trainer / deepspeed] fix hyperparameter_search * require optuna * style * oops * add dep in the right place * create deepspeed-testing dep group * Trigger CI
-
- 14 Apr, 2022 9 commits
-
-
-
Sanchit Gandhi authored
-
NielsRogge authored
* Improve README * Make dataset_name argument optional * Improve local data * Fix bug * Improve README some more * Apply suggestions from code review * Improve README Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
Sylvain Gugger authored
-
Stas Bekman authored
* [deepspeed / m2m_100] make deepspeed 3 work with layerdrop * fix * revert last
-
Zachary Mueller authored
-
Sylvain Gugger authored
* Fix batch size in evaluation loop * remove debug statement
-
Sanchit Gandhi authored
* [Flax] Raise a warning if model weights are not in float32 * apply suggestions and few small changes * reorder wording for better readability
-
Nicolas Patry authored
* Enabling `Tapex` in table question answering pipeline. * Questions are independant for Tapex, making the test respect that. * Missing extra space.
-
- 13 Apr, 2022 14 commits
-
-
Bhadresh Savani authored
* added doctest changes for electra * fixed doctest tests * updated changes
-
Zachary Mueller authored
* Change tracking to store_true * Remove step param and use it in the log dictionary directly * use vars(args) when passing args to init_trackers * Include tracking tests since tensorboard is already a dep
-
Stas Bekman authored
-
Tu Vu authored
* Add self-training code for text-classification * Add self-training code for text-classification * Add self-training code for text-classification * Add self-training code for text-classification * Add self-training code for text-classification * Delete strata
-
Sylvain Gugger authored
* Add defensive check for config num_labels and id2label * Actually check value... * Only warning inside init plus better error message
-
Yih-Dar authored
* Make Funnel Test less flaky Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Joao Gante authored
* Setup-dependent pip cache * Do not restore from old versions
-
Stas Bekman authored
-
Jeremy Fisher authored
* Improve CTRL doctests * Fix `CTRLForSequenceClassification` flakiness with inconsistent losses * Remove unused * Fixup * Add CTRL to documentation_tests.txt * Fix control code not being first * Add output assertions * Change from sshleifer/tiny-ctrl -> ctrl * Run `make fixup` * apply `list` to output logits shape for clarity * Reduce output loss precision to make assertion more robust * Add assertion of control code being first * Fix docstyle * upper case sentence following control code * Weird bug fixes * Add a better generation example Co-authored-by:Yih-Dar <2521628+ydshieh@users.noreply.github.com>
-
Michael Chung authored
* Required the values GPTJ unfortunately cannot run the model =) * Added the file to the doc tests * Run Fixup and Style * Fixed with the test versions of gptj. Ran Style and Fixup. * Trigger ci * A Minor Change to License * Fixed spacing added to the benchmark_utils. Then refactored tests to const variables. * Removed strings that were included as default parameters anyways. Co-authored-by:ArEnSc <xx.mike.chung.xx@gmail.com>
-
Stas Bekman authored
-
davidleonfdez authored
* Fix setters of *_token_id properties of SpecialTokensMixin * Test setters of common tokens ids * Move to a separate test checks of setters of tokens ids * Add independent test for ByT5 * Add Canine test * Test speech to text
-
Patrick von Platen authored
* [Doctests] Fix all T5 doc tests * make style * Update docs/source/en/model_doc/t5.mdx Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply Sylvains comments * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Santiago Castro authored
* Normalize using a logits warper * Add a flag in `generate` to support the logit renormalization * Add in RAG
-
- 12 Apr, 2022 5 commits
-
-
Joao Gante authored
-
Minh Chien Vu authored
* add Bigbird ONNX config
-
Sanchit Gandhi authored
* [FlaxWav2Vec2Model] Fix bug in attention mask * more fixes * add (Flax)SpeechEncoderDecoderModel PT-FX cross-test
-
Sanchit Gandhi authored
* [FlaxSpeechEncoderDecoder] Fix input shape bug in weights init * make style
-
hiromu authored
* start working * fix: ReformerForQA doctest * fix: ReformerModelWithLMHead doctest * fix: ReformerModelForSC doctest * fix: ReformerModelForMLM doctest * add: documentation_tests.txt * make fixup * change: ReformerModelForSC doctest * change: checkpoint
-