"tests/models/ernie/test_modeling_ernie.py" did not exist on "f1fe18465d8c4ee3f5710cdfd7de387a1d136f6b"
- 24 Feb, 2023 1 commit
-
-
Joao Gante authored
Generate - update cookie cutters to not initialize cache with training and gradient checkpointing (#21759)
-
- 14 Feb, 2023 1 commit
-
-
Sylvain Gugger authored
FInal cleanup of TOKENIZER_FOR_DOC
-
- 07 Feb, 2023 2 commits
-
-
Sylvain Gugger authored
* Remove mentions of flake8/isort * Clean up inits * Deall with all other inits * Last special rule for dummy files
-
Arthur authored
* fix past renamed to past_key_value * update more `past`that were ski^锚d * fixup * remove changes made to rag * refactor `_reorder_cache` to use `past_key_values` * fix git `prepare_inputs_for_generation` to pass tests when false is needed in use_cache
-
- 19 Jan, 2023 1 commit
-
-
Joao Gante authored
-
- 09 Jan, 2023 1 commit
-
-
Arthur authored
* small patches, forgot a line * refactor PT * the actual fix
-
- 08 Jan, 2023 1 commit
-
-
Arthur authored
* start cleanup * more updates * more models are affected * more updates * update generation utils * style * revert change that removed reorder cachce * update generation utils * style * style * remove reorder cache
-
- 03 Jan, 2023 1 commit
-
-
Joao Gante authored
-
- 27 Dec, 2022 1 commit
-
-
Eli Simhayev authored
fix Jupyter typos
-
- 08 Dec, 2022 1 commit
-
-
Yih-Dar authored
* fix 1 * fix 2 * fix 3 * fix 4 Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 05 Dec, 2022 1 commit
-
-
Yih-Dar authored
* Remove is_encoder_decoder from some vision models * cleanup more * cleanup more Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 30 Nov, 2022 1 commit
-
-
amyeroberts authored
* Update doc example feature extractor -> image processor * Apply suggestions from code review
-
- 28 Nov, 2022 1 commit
-
-
Matt authored
* Add a test to ensure int dummy inputs are int64 * Move the test into the existing int64 test and update a lot of existing dummies * Fix remaining dummies * Fix remaining dummies * Test for int64 serving sigs as well * Update core tests to use tf.int64 * Add better messages to the assertions * Update all serving sigs to int64 * More sneaky hiding tf.int32s * Add an optional int32 signature in save_pretrained * make fixup * Add Amy's suggestions * Switch all serving sigs back to tf.int32 * Switch all dummies to tf.int32 * Adjust tests to check for tf.int32 instead of tf.int64 * Fix base dummy_inputs dtype * Start casting to tf.int32 in input_processing * Change dtype for unpack_inputs test * Add proper tf.int32 test * Make the alternate serving signature int64
-
- 09 Nov, 2022 1 commit
-
-
Joao Gante authored
* move generation_*.py src files into generation/*.py * populate generation.__init__ with lazy loading * move imports and references from generation.xxx.object to generation.object
-
- 18 Oct, 2022 1 commit
-
-
Arthur authored
-
- 11 Oct, 2022 1 commit
-
-
Joao Gante authored
馃毃 馃毃 馃毃 TF: Remove `TFWrappedEmbeddings` (breaking: TF embedding initialization updated for encoder-decoder models) (#19263) * added test * correct embedding init * some changes in blenderbot (incomplete) * update blenderbot (diff to be used as reference) * update blenderbot_small * update LED * update marian * update T5 and remove TFWrappedEmbeddings * nullcontext() -> ContextManagers() * fix embedding init
-
- 22 Sep, 2022 1 commit
-
-
Joao Gante authored
-
- 15 Sep, 2022 1 commit
-
-
Matt authored
* Override save() to use the serving signature as the default * Replace int32 with int64 in all our serving signatures * Remember one very important line so as not to break every test at once * Dtype fix for TFLED * dtype fix for shift_tokens_right in general * Dtype fixes in mBART and RAG * Fix dtypes for test_unpack_inputs * More dtype fixes * Yet more mBART + RAG dtype fixes * Yet more mBART + RAG dtype fixes * Add a check that the model actually has a serving method
-
- 14 Sep, 2022 2 commits
-
-
Joao Gante authored
-
Sylvain Gugger authored
-
- 12 Sep, 2022 1 commit
-
-
Matt authored
* Update our TF 2.0 input format tip across all models * make style
-
- 07 Sep, 2022 1 commit
-
-
Joao Gante authored
-
- 03 Aug, 2022 1 commit
-
-
LSinev authored
Comparisons like version.parse(torch.__version__) > version.parse("1.6") are True for torch==1.6.0+cu101 or torch==1.6.0+cpu version.parse(version.parse(torch.__version__).base_version) are preferred (and available in pytorch_utils.py
-
- 01 Aug, 2022 1 commit
-
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 11 Jul, 2022 1 commit
-
-
Yulv-git authored
* Fix some typos. Signed-off-by:
Yulv-git <yulvchi@qq.com> * Fix typo. Signed-off-by:
Yulv-git <yulvchi@qq.com> * make fixup.
-
- 01 Jul, 2022 1 commit
-
-
Sanchit Gandhi authored
* [Flax] Add remat (gradient checkpointing) * fix variable naming in test * flip: checkpoint using a method * fix naming * fix class naming * apply PVP's suggestions from code review * make fix-copies * fix big-bird, electra, roberta * cookie-cutter * fix flax big-bird * move test to common
-
- 29 Jun, 2022 1 commit
-
-
Leon Derczynski authored
-
- 20 Jun, 2022 2 commits
-
-
Yih-Dar authored
* Use torch.finfo(self.dtype).min * for GPTNeoX * for Albert * For Splinter * Update src/transformers/models/data2vec/modeling_data2vec_audio.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * fix -inf used in Bart-like models * Fix a few remaining -inf * more fix * clean up * For CLIP * For FSMT * clean up * fix test * Add dtype argument and use it for LayoutLMv3 * update FlaxLongT5Attention Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Joao Gante authored
* Also propagate changes to blenderbot, blenderbot_small, marian, mbart, and pegasus
-
- 13 Jun, 2022 1 commit
-
-
Ayush Mangal authored
-
- 16 May, 2022 1 commit
-
-
cloudhan authored
Change config.encoder_ffn_dim -> config.decoder_ffn_dim for decoder.
-
- 12 May, 2022 1 commit
-
-
Suraj Patil authored
-
- 10 May, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 09 May, 2022 1 commit
-
-
Dom Miketa authored
* rebase and isort * modify cookiecutter init * fix cookiecutter auto imports * fix clean_frameworks_in_init * fix add_model_to_main_init * blackify * replace unnecessary f-strings * update yolos imports * fix roberta import bug * fix yolos missing dependency * fix add_model_like and cookiecutter bug * fix repository consistency error * modify cookiecutter, fix add_new_model_like * remove stale line Co-authored-by:Dom Miketa <dmiketa@exscientia.co.uk>
-
- 03 May, 2022 3 commits
-
-
Pavel Belevich authored
-
Yih-Dar authored
* move test model folders (TODO: fix imports and others) * fix (potentially partially) imports (in model test modules) * fix (potentially partially) imports (in tokenization test modules) * fix (potentially partially) imports (in feature extraction test modules) * fix import utils.test_modeling_tf_core * fix path ../fixtures/ * fix imports about generation.test_generation_flax_utils * fix more imports * fix fixture path * fix get_test_dir * update module_to_test_file * fix get_tests_dir from wrong transformers.utils * update config.yml (CircleCI) * fix style * remove missing imports * update new model script * update check_repo * update SPECIAL_MODULE_TO_TEST_MAP * fix style * add __init__ * update self-scheduled * fix add_new_model scripts * check one way to get location back * python setup.py build install * fix import in test auto * update self-scheduled.yml * update slack notification script * Add comments about artifact names * fix for yolos Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sanchit Gandhi authored
* [FlaxBert] Add ForCausalLM * make style * fix output attentions * Add RobertaForCausalLM * remove comment * fix fx-to-pt model loading * remove comment * add modeling tests * add enc-dec model tests * add big_bird * add electra * make style * make repo-consitency * add to docs * remove roberta test * quality * amend cookiecutter * fix attention_mask bug in flax bert model tester * tighten pt-fx thresholds to 1e-5 * add 'copied from' statements * amend 'copied from' statements * amend 'copied from' statements * quality
-
- 25 Apr, 2022 1 commit
-
-
Joao Gante authored
Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 19 Apr, 2022 1 commit
-
-
Suraj Patil authored
* begin do_init * add params_shape_tree * raise error if params are accessed when do_init is False * don't allow do_init=False when keys are missing * make shape tree a property * assign self._params at the end * add test for do_init * add do_init arg to all flax models * fix param setting * disbale do_init for composite models * update test * add do_init in FlaxBigBirdForMultipleChoice * better names and errors * improve test * style * add a warning when do_init=False * remove extra if * set params after _required_params * add test for from_pretrained * do_init => _do_init * chage warning to info * fix typo * add params in init_weights * add params to gpt neo init * add params to init_weights * update do_init test * Trigger CI * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * update template * trigger CI * style * style * fix template Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
- 12 Apr, 2022 1 commit
-
-
Anmol Joshi authored
* Moved functions to pytorch_utils.py * isort formatting * Reverted tf changes * isort, make fix-copies * documentation fix * Fixed Conv1D import * Reverted research examples file * backward compatibility for pytorch_utils * missing import * isort fix
-