- 13 Apr, 2021 9 commits
-
-
Suraj Patil authored
* refactor GPT2 * fix mlp and head pruning * address Sylvains comments * apply suggestion from code review Co-authored-by:Lysandre Debut <lysandre@huggingface.co>
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* Replace error by warning when loading an architecture in another * Style * Style again * Add a test * Adapt old test
-
Yusuke Mori authored
* Start writing BERT-Japanese doc * Fix typo, Update toctree * Modify model file to use comment for document, Add examples * Clean bert_japanese by make style * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Split a big code block into two * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add prefix >>> to all lines in code blocks * Clean bert_japanese by make fixup Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Suraj Patil authored
-
Lysandre Debut authored
* Fix GPT-2 warnings * Update src/transformers/models/gpt2/modeling_gpt2.py Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com>
-
Lysandre Debut authored
-
Ceyda Cinarel authored
wav2vec2 converter: create the proper vocab.json while converting fairseq wav2vec2 finetuned model (#11041) * add vocab while converting wav2vec2 original finetuned model * check save directory exists * return_attention_mask fix * quality
-
calpt authored
-
- 12 Apr, 2021 9 commits
-
-
Philipp Schmid authored
* increased train_runtime for model parallelism * added documentation for framework upgrade
-
Lysandre Debut authored
-
cronoik authored
-
NielsRogge authored
* First draft of deit * More improvements * Remove DeiTTokenizerFast from init * Conversion script works * Add DeiT to ViT conversion script * Add tests, add head model, add support for deit in vit conversion script * Update model checkpoint names * Update image_mean and image_std, set resample to bicubic * Improve docs * Docs improvements * Add DeiTForImageClassificationWithTeacher to init * Address comments by @sgugger * Improve feature extractors * Make fix-copies * Minor fixes * Address comments by @patil-suraj * All models uploaded * Fix tests * Remove labels argument from DeiTForImageClassificationWithTeacher * Fix-copies, style and quality * Fix tests * Fix typo * Multiple docs improvements * More docs fixes
-
Takuya Makino authored
-
fghuman authored
* Added documentation for data collator. * Update docs/source/data_collator.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Added documentation for data collator. * Added documentation for the data collator. * Merge branch 'doc_DataCollator' of C:\Users\mahii\PycharmProjects\transformers with conflicts. * Update documentation for the data collator. * Update documentation for the data collator. Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Amna <A.A.Ahmad@student.tudelft.nl>
-
Masatoshi TSUCHIYA authored
* model_path is refered as the path of the trainer, and should be ignored as the checkpoint path. * Improved according to Sgugger's comment.
-
Sylvain Gugger authored
-
cronoik authored
-
- 09 Apr, 2021 13 commits
-
-
Sylvain Gugger authored
-
Lysandre authored
-
Philipp Schmid authored
* added json dump and extraction of train run time * make style happy
-
Stas Bekman authored
* fix _LazyModule hasher error * reword
-
Suraj Patil authored
* keep a list of multilingual tokenizers * add forced_bos_token argument
-
Kevin Canwen Xu authored
* Add a special tokenizer for CPM model * make style * fix * Add docs * styles * cpm doc * fix ci * fix the overview * add test * make style * typo * Custom tokenizer flag * Add REAMDE.md Co-authored-by:Lysandre <lysandre.debut@reseau.eseo.fr>
-
Sylvain Gugger authored
-
Saviour Owolabi authored
Corrected a typo ('Downlowd' to 'Download') -
Keisuke Hirota authored
* Change duplicated LogitsProcessor to LogitsWarper in LogitsProcessorList document * Write more detailed information about LogitsProcessor's scores argument * apply suggestion from review * style Co-authored-by:Suraj Patil <surajp815@gmail.com>
-
Niklas Muennighoff authored
* Add Wav2Vec Inference notebook * Update docs/source/community.md Co-authored-by:Suraj Patil <surajp815@gmail.com>
-
Stas Bekman authored
* typo * style
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
- 08 Apr, 2021 9 commits
-
-
Stas Bekman authored
* make fairscale and deepspeed setup extras * fix default * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * no reason not to ask for the good version * update the CIs Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Sylvain Gugger authored
* Add support for multiple models for one config in auto classes * Use get_values everywhere * Prettier doc
-
Stas Bekman authored
* extras[doc] must include 'all' * fix * better * regroup
-
Stas Bekman authored
* relocate core integration tests * add sys.path context manager * cleanup * try * try2 * fix path * doc * style * add dep * add 2 more deps
-
Andrea Cappelli authored
* Add mlm collator pad to multiple option (#10627) * Use padding to 8x in run mlm (#10627)
-
Sylvain Gugger authored
-
Philipp Schmid authored
-
Lysandre Debut authored
* Add fairscale and deepspeed back to the CI * Add deepspeed to single GPU tests
-
Stas Bekman authored
* solve "scheduler before optimizer step" warning * style * correct the state evaluation test
-