- 22 Nov, 2021 1 commit
-
-
Stas Bekman authored
* add test for --config_overrides * remove unneeded parts of the test
-
- 19 Nov, 2021 1 commit
-
-
Shang Zhang authored
* clean up branch for add-qdqbert-model * README update for QAT example; update docstrings in modeling_qdqbert.py * Update qdqbert.rst * Update README.md * Update README.md * calibration data using traning set; QAT example runs in fp32 * re-use BERTtokenizer for qdqbert * Update docs/source/model_doc/qdqbert.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/model_doc/qdqbert.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/model_doc/qdqbert.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove qdqbert tokenizer * Update qdqbert.rst * update evaluate-hf-trt-qa.py * update configuration_qdqbert.py * update modeling_qdqbert.py: add copied statement; replace assert with ValueError * update copied from statement * add is_quantization_available; run make fix-copies * unittest add require_quantization * add backend dependency to qdqbert model * update README; update evaluate script; make style * lint * docs qdqbert update * circleci build_doc add pytorch-quantization for qdqbert * update README * update example readme with instructions to upgrade TensorRT to 8.2 * Update src/transformers/models/qdqbert/configuration_qdqbert.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/models/qdqbert/configuration_qdqbert.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/models/qdqbert/configuration_qdqbert.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/models/qdqbert/configuration_qdqbert.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * change quantization to pytorch_quantization for backend requirement * feed_forward_chunking not supported in QDQBert * make style * update model docstrings and comments in testing scripts * rename example to quantization-qdqbert; rename example scripts from qat to quant * Update src/transformers/models/qdqbert/modeling_qdqbert.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * rm experimental functions in quant_trainer * qa cleanup * make fix-copies for docs index.rst * fix doctree; use post_init() for qdqbert * fix early device assignment for qdqbert * fix CI:Model templates runner Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
- 18 Nov, 2021 2 commits
-
-
Patrick von Platen authored
Add more XLS-R training runs to the official examples
-
William Held authored
-
- 17 Nov, 2021 1 commit
-
-
Antonio Carlos Falc茫o Petri authored
Co-authored-by:Stas Bekman <stas@stason.org>
-
- 15 Nov, 2021 2 commits
-
-
Eldar Kurtic authored
Running Movement pruning experiments with the newest HuggingFace would crash due to non-existing BertLayerNorm.
-
Matt authored
-
- 12 Nov, 2021 1 commit
-
-
Patrick von Platen authored
* improve some stuff * finish * correct last
-
- 11 Nov, 2021 3 commits
-
-
Stas Bekman authored
-
Matt authored
* Fixing requirements for TF LM models and use correct model mappings * make style
-
Suraj Patil authored
* fix inits * fix embed dtype * fix embed dtype * add test to check default dtype * quality * add type conversion methods for flax models * more robust casting * cast sinusoidal positions * update pegasus * update albert * update test * make sure dtype is passed to every module * style * fix electra dense * fix t5 * quality * add more tests * better name * use the dtype for lm head computation * fix albert * style * fix albert embed dtype * more tests * fix vision enc-dec * cleanup * fix embed dtype pegasus * fix default param test * doc * update template * fix final_logits_bias dtype * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * fix doc * fix doc * add detailed docstring for dtype parameter * remove un-necessary import Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
- 09 Nov, 2021 2 commits
-
-
Suraj Patil authored
-
karthikrangasai authored
* Update postporcessing accordingly to use SQuAD metric. * Update assets accordingly based on SQuAD metrics. * Fix function naming error.
-
- 06 Nov, 2021 1 commit
-
-
Junbum Lee authored
-
- 05 Nov, 2021 1 commit
-
-
Sylvain Gugger authored
-
- 02 Nov, 2021 1 commit
-
-
Sylvain Gugger authored
* Update Transformers to huggingface_hub >= 0.1.0 * Forgot to save... * Style * Fix test
-
- 01 Nov, 2021 1 commit
-
-
NielsRogge authored
-
- 29 Oct, 2021 1 commit
-
-
Thomas Wang authored
* Remove n_ctx from configs * Fix GPTJ and OpenAIGPT, both are acceptable breaking changes as there are no configs such that it breaks * Remove unecessary n_positions from TFOpenAIGPT
-
- 28 Oct, 2021 5 commits
-
-
Patrick von Platen authored
-
Lysandre authored
-
Lysandre authored
-
Anton Lozhkov authored
-
Patrick von Platen authored
-
- 27 Oct, 2021 2 commits
-
-
Patrick von Platen authored
-
Patrick von Platen authored
* up * up * fix * up * Update examples/pytorch/test_xla_examples.py * correct labels * up * up * up * up * up * up
-
- 26 Oct, 2021 6 commits
-
-
Emanuel Huber authored
Updated masked-language modeling examples in pytorch with convention defined by #12789
-
Matthew Goldey authored
* specify the text column name in the error message * pluralize the word fields
-
Jangwon Park authored
-
Patrick von Platen authored
-
Patrick von Platen authored
[Speech Recognition] - Distributed training: Make sure vocab file removal and creation don't interfer (#14161) * up * better
-
Patrick von Platen authored
-
- 25 Oct, 2021 4 commits
-
-
Patrick von Platen authored
-
Patrick von Platen authored
-
Reza Gharibi authored
-
karthikrangasai authored
* Add seq2seq example for QnA on SQuAD Dataset. * Changes from review - Fixing styling mistakes. * Added how to example in README, simplified the access to dataset's preprocess function. * Added tests for the seq2seq QA example. * Change dataset column name to fix tests. * Fix test command mistake. * Add missing argument 'ignore_pad_token_for_loss' from DataTrainingArguments. * Add missing argument 'num_beams' from DataTrainingArguments. * Fix processing of output predicted token ids so that tokenizer decode gets appropriate input. Updated assertion conditions on the tests.
-
- 22 Oct, 2021 1 commit
-
-
Antonio Carlos Falc茫o Petri authored
-
- 21 Oct, 2021 4 commits
-
-
lee1jun authored
last line: "# limitations under the License." is missing
-
Anton Lozhkov authored
* Update SEW integration test tolerance * Add audio classification notebooks
-
Christopher Akiki authored
-
Patrick von Platen authored
-