- 20 Dec, 2021 6 commits
-
-
Kamal Raj authored
fix cmd typo
-
Chang Lan authored
* [trainer] Set bucket_cap_mb for DDP from arguments * Put find_unused_parameters into kwargs
-
Anton Lozhkov authored
* Add converted heads * Add dummies
-
Patrick von Platen authored
-
Patrick von Platen authored
-
Patrick von Platen authored
* [Seq2SeqTrainer] Remove model input name hack * Update src/transformers/trainer_seq2seq.py * make style * finish
-
- 17 Dec, 2021 8 commits
-
-
Patrick von Platen authored
* [ImageGPT] Deprecate pixel_values input name to input_ids * up * Apply suggestions from code review Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * correct * finish Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com>
-
Patrick von Platen authored
* up * add tokenizer * improve more * finish tokenizer * finish * adapt speech recognition script * adapt convert * more fixes * more fixes * update phonemizer wav2vec2 * better naming * fix more tests * more fixes swedish * correct tests * finish * improve script * remove file * up * lets get those 100 model architectures until the end of the month * make fix-copies * correct more * correct script * more fixes * more fixes * add to docs * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * replace assert * fix copies * fix docs * new try docs * boom boom * update * add phonemizer to audio tests * make fix-copies * up * upload models * some changes * Update tests/test_tokenization_wav2vec2_phoneme.py Co-authored-by:
Anton Lozhkov <aglozhkov@gmail.com> * more fixes * remove @ Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Anton Lozhkov <aglozhkov@gmail.com>
-
Lysandre Debut authored
* BERT to mdx mdx :) c * Update docs/source/model_doc/bert.mdx Co-authored-by:
Julien Chaumond <julien@huggingface.co> * Remove all Co-authored-by:
sgugger <sylvain.gugger@gmail.com> Co-authored-by:
Julien Chaumond <julien@huggingface.co>
-
Sylvain Gugger authored
-
Daniel Stancl authored
* Implement head_mask for Flax BERT and other models copied from BERT * Remove `from jax._src.nn.functions import sigmoid` Remove `from jax._src.nn.functions import sigmoid` unintentionally added by IDE * Remove no more valid copy statement * Apply patil-suraj's suggestions from code review * Apply suggestions from the code review * Update Flax template * Fix a typo * Also update template for CausalLM modules
-
Patrick von Platen authored
* [Generate] Correct input_ids detection * correct
-
Patrick von Platen authored
* [WavLM] Layerdrop is not allowed for first layer * Apply suggestions from code review
-
NielsRogge authored
-
- 16 Dec, 2021 11 commits
-
-
Patrick von Platen authored
-
Lysandre Debut authored
-
Patrick von Platen authored
* first commit * fix some stuff * fix more readme * Apply suggestions from code review * update * correct * up * attn layer works * push code * make modedls work * Small change * more refactor * finish * up * fix convertsion * fix position bias * Fix style * fix conversion * make fix-copies * add * clean * fix docs * fix * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply final changes * make fix-copies Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Patrick von Platen authored
* finish refactor * refactor * add tests * add more tests * up * finish tests * finish * up * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * improve docstring * fix docs Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Anton Lozhkov authored
* Models * Squashed commit of the following: commit 72278e1e931a16d0879acc77f65762f3364833d0 Author: anton-l <aglozhkov@gmail.com> Date: Fri Dec 10 21:45:08 2021 +0300 * Add unispeech heads * Add sd/sv automodels * Docs cleanup * Fix docstrings * rename xvector classes * examples * Tests cleanup * Style * Better checkpoints for tests * leftover docs * apply review suggestions * Style + init tests * Update unispeech-sat tdnn downsampling
-
Matt authored
* Fix for TF train step when no "labels" key in input * make style
-
Kamal Raj authored
fix pip installation cmd
-
Kamal Raj authored
typo
-
Sylvain Gugger authored
* Fix the build documentation job * Fix install * Address review comment
-
Sylvain Gugger authored
* Clean up sphinx * Update contributing guide * Update docs README * No example title * Fix copies * Update CONTRIBUTING.md Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
Lysandre Debut authored
* First try * Update instructions
-
- 15 Dec, 2021 12 commits
-
-
Sylvain Gugger authored
* PoC for conserving old links * Do the same for other links * remap the redirects section * add instructions on how to move sections * improve Co-authored-by:Stas Bekman <stas@stason.org>
-
Sylvain Gugger authored
-
Lysandre authored
-
Lysandre authored
-
NielsRogge authored
* Fix docs * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Code quality Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Lysandre <lysandre.debut@reseau.eseo.fr>
-
NielsRogge authored
* Fix code examples * Fix code example
-
Matt authored
* Initial commit for Keras model cards * Revert accidental change * make style * make style * make style * Fix PR comments * Move repo creation to __init__ * Fixes to README.md creation * Partial progress for proper card creation on `push_to_hub` * Proper card creation from `push_to_hub` plus fixes for malformed model cards * Fixes for model card creation outside the callback * Adding a model card creation test * Putting the model card creation test in the right file. Good job, Matt. * make style * Fix model card test temp dir usage * Fix model card creation when no optimizer present * Fixes for when training history not present * Fix accidental edit to test_modeling_common
-
Xing Han Lu authored
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sylvain Gugger authored
* Fake new model * Fix doc-building test job * Is this the problem? * Another try * Typo * Clean up * Can we do without -e ? * Clean setup
-
Stas Bekman authored
-
Amit Chaudhary authored
* Fix broken links to distillation on index page of documentation * Fix broken link for distillation in main README * Run make fixup
-
- 14 Dec, 2021 3 commits
-
-
Nicolas Patry authored
* Adding support for multiple mask tokens. - Original implem: https://github.com/huggingface/transformers/pull/10222 Co-authored-by:
njafer <naveen.jafer@oracle.com> * In order to accomodate optionally multimodal models like Perceiver we add information to the tasks to specify tasks where we know for sure if we need the tokenizer/feature_extractor or not. * Adding info in the documentation about multi masks. + marked as experimental. * Add a copy() to prevent overriding the same tensor over and over. * Fixup. * Adding small test for multi mask with real values.. Co-authored-by:
njafer <naveen.jafer@oracle.com>
-
Benjamin Minixhofer authored
* use jax and jnp instead of numpy in data_loader * return batches as np.ndarray
-
Nicolas Patry authored
* Adding some slow test to check for perceiver at least from a high level. * Re-enabling fast tests for Perceiver ImageClassification. * Perceiver might try to run without Tokenizer (Fast doesn't exist) and with FeatureExtractor some text only pipelines. * Oops. * Adding a comment for `update_config_with_model_class`. * Remove `model_architecture` to get `tiny_config`. * Finalize rebase. * Smarter way to handle undefined FastTokenizer. * Remove old code. * Addressing some nits. * Don't instantiate `None`.
-