Commits · 6a025487a63a206f2438b1dab426c5c8adc36144 · chenpangpang / transformers

12 Dec, 2021 1 commit
- [Flax examples] remove dependancy on pytorch training args (#14636) · 6a025487
  Suraj Patil authored Dec 12, 2021
```
* use custom training arguments

* update tests
```
  6a025487
09 Dec, 2021 2 commits
- Docs for v4.14.0dev0 · ab31b3e4
  Lysandre authored Dec 09, 2021
  
  ab31b3e4
- Release: v4.13.0 · 4da3a696
  Lysandre authored Dec 09, 2021
  
  4da3a696
08 Dec, 2021 1 commit

fix: verify jsonlines file in run_translation (#14660) (#14661) · 4ea19de8

Gaurang Tandon authored Dec 08, 2021

* fix: verify jsonl in run_translation (#14660)

* fix(run_translation.py): json/jsonl validation

Both json and jsonl are to be accepted as valid jsonlines file extension

* fix(run_translation.py): make black happy

* Ran make style

4ea19de8

06 Dec, 2021 6 commits

fix flax examples tests (#14646) · 75ae287a

Suraj Patil authored Dec 07, 2021

* make tensorboard optional

* update test_fetcher for flax examples

* make the tests slow

75ae287a

fix flax example tests (#14643) · cbe60265
Suraj Patil authored Dec 06, 2021

cbe60265

Update the example of exporting Bart + BeamSearch to ONNX module to resolve comments. (#14310) · 1ccc033c

Jay Zhang authored Dec 06, 2021



* Update code to resolve comments left in previous PR.

* Add README.md file for this example.

* Update examples/onnx/pytorch/translation/README.md
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update examples/onnx/pytorch/translation/README.md
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update examples/onnx/pytorch/translation/README.md
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update README.md file to resolve comments.

* Add a section name.

* Update examples/onnx/pytorch/translation/README.md
Co-authored-by: Gary Miguel <garymm@garymm.org>

* Add more comments for _convert_past_list_to_tuple().

* Change the default file name to a consistent one.

* Fix a format issue.

* Update examples/onnx/pytorch/translation/README.md
Co-authored-by: Gary Miguel <garymm@garymm.org>

* Update examples/onnx/pytorch/translation/run_onnx_exporter.py
Co-authored-by: Gary Miguel <garymm@garymm.org>

* Update examples/onnx/pytorch/translation/README.md
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* Change the folder to summarization and address some other coments.

* Update the torch version.
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Gary Miguel <garymm@garymm.org>
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

1ccc033c

[urls to hub] Replace outdated model tags with their now-canonical pipeline types (#14617) · 6cdc3a78
Julien Chaumond authored Dec 06, 2021
```
* Replace outdated model tags with their now-canonical pipeline types

* spam the CI till it's green
```
6cdc3a78

Add Flax example tests (#14599) · c5bd732a

Suraj Patil authored Dec 06, 2021

* add test for glue

* add tests for clm

* fix clm test

* add summrization tests

* more tests

* fix few tests

* add test for t5 mlm

* fix t5 mlm test

* fix tests for multi device

* cleanup

* ci job

* fix metric file name

* make t5 more robust

c5bd732a

updated readme with proper arguments (#14624) · 803a8cd1
Kamal Raj authored Dec 06, 2021

803a8cd1

05 Dec, 2021 1 commit
- fix a typo (#14626) · 3977b584
  (Bill) Yuchen Lin authored Dec 04, 2021
  
  3977b584
02 Dec, 2021 1 commit

Add CodeParrot 🦜 codebase (#14536) · 43f953cc

Leandro von Werra authored Dec 02, 2021



* add readme skeleton

* update readme

* add initialization script

* add deduplication script

* add codeparrot training script

* add code generation evaluation

* add validation loss script

* add requirements

* update readme

* tweak readme

* make style

* add highlights to readme

* add CLIs to scripts

* add tokenizer training script

* add docstring to constant length dataset

* fix defaults in arguments

* update readme with cli

* move image to hub

* tweaks of readme

* fix cli commands

* add author

* explain env variables

* fix formatting

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* Apply suggestions from code review
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* replace generic with gpt2 tokenizer
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

43f953cc

01 Dec, 2021 1 commit

Doc new front (#14590) · 4df7d05a

Sylvain Gugger authored Dec 01, 2021



* Convert PretrainedConfig doc to Markdown

* Use syntax

* Add necessary doc files (#14496)

* Doc fixes (#14499)

* Fixes for the new front

* Convert DETR file for table

* Title is needed

* Simplify a bit

* Even simpler

* Remove imports

* Fix typo in toctree (#14516)

* Fix checkpoints badge

* Update versions.yml format (#14517)

* Doc new front github actions (#14512)

* Doc new front github actions

* Fix docstring

* Fix feature extraction utils import (#14515)

* Address Julien's comments

* Push to doc-builder

* Ready for merge

* Remove old build and deploy

* Doc misc fixes (#14583)

* Rm versions.yml from doc

* Fix converting.rst

* Rm pretrained_models from toctree

* Fix index links (#14567)

* Fix links in README

* Localized READMEs

* Fix copy script

* Fix find doc script

* Update README_ko.md
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Adapt build command to new CLI tools (#14578)

* Fix typo

* Fix doc interlinks (#14589)

* Convert PretrainedConfig doc to Markdown

* Use syntax

* Rm pattern <[a-z]+(.html).*>

* Rm huggingface.co/transformers/master

* Rm .html

* Rm .html from index.mdx

* Rm .html from model_summary.rst

* Update index.mdx rm html

* Update remove .html

* Fix inner doc links

* Fix interlink in preprocssing.rst

* Update pr_checks
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Convert PretrainedConfig doc to Markdown

* Use syntax

* Add necessary doc files (#14496)

* Doc fixes (#14499)

* Fixes for the new front

* Convert DETR file for table

* Title is needed

* Simplify a bit

* Even simpler

* Remove imports

* Fix checkpoints badge

* Fix typo in toctree (#14516)

* Update versions.yml format (#14517)

* Doc new front github actions (#14512)

* Doc new front github actions

* Fix docstring

* Fix feature extraction utils import (#14515)

* Address Julien's comments

* Push to doc-builder

* Ready for merge

* Remove old build and deploy

* Doc misc fixes (#14583)

* Rm versions.yml from doc

* Fix converting.rst

* Rm pretrained_models from toctree

* Fix index links (#14567)

* Fix links in README

* Localized READMEs

* Fix copy script

* Fix find doc script

* Update README_ko.md
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Adapt build command to new CLI tools (#14578)

* Fix typo

* Fix doc interlinks (#14589)

* Convert PretrainedConfig doc to Markdown

* Use syntax

* Rm pattern <[a-z]+(.html).*>

* Rm huggingface.co/transformers/master

* Rm .html

* Rm .html from index.mdx

* Rm .html from model_summary.rst

* Update index.mdx rm html

* Update remove .html

* Fix inner doc links

* Fix interlink in preprocssing.rst

* Update pr_checks
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Styling
Co-authored-by: Mishig Davaadorj <mishig.davaadorj@coloradocollege.edu>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Julien Chaumond <julien@huggingface.co>

4df7d05a

30 Nov, 2021 1 commit

use functional interface for softmax in attention (#14198) · 6ed9882d

Thomas Viehmann authored Nov 30, 2021

* use functional interface instead of instantiating module and immediately calling it

* fix torch.nn.functional to nn.functional. Thank you Stas!

6ed9882d

29 Nov, 2021 2 commits
- Fix sentinel token IDs in data collator for Flax T5 pretraining script (#14477) · 8332327d
  Rahul Nadkarni authored Nov 29, 2021
  
  8332327d
- [Flax] token-classification model steps enumerate start from 1 (#14547) · 2bd950ca
  Kamal Raj authored Nov 29, 2021
```
* step start from 1

* Updated cur_step calcualtion
```
  2bd950ca
22 Nov, 2021 2 commits

Switch from using sum for flattening lists of lists in group_texts (#14472) · 69e16abf

Nicholas Broad authored Nov 22, 2021



* remove sum for list flattening

* change to chain(*)

* make chain object a list

* delete empty lines

per sgugger's suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Nicholas Broad <nicholas@nmbroad.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

69e16abf

[test] add test for --config_overrides (#14466) · 11f65d41
Stas Bekman authored Nov 22, 2021
```
* add test for --config_overrides

* remove unneeded parts of the test
```
11f65d41

19 Nov, 2021 1 commit

Add QDQBert model and quantization examples of SQUAD task (#14066) · a59e7c1e

Shang Zhang authored Nov 19, 2021



* clean up branch for add-qdqbert-model

* README update for QAT example; update docstrings in modeling_qdqbert.py

* Update qdqbert.rst

* Update README.md

* Update README.md

* calibration data using traning set; QAT example runs in fp32

* re-use BERTtokenizer for qdqbert

* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove qdqbert tokenizer

* Update qdqbert.rst

* update evaluate-hf-trt-qa.py

* update configuration_qdqbert.py

* update modeling_qdqbert.py: add copied statement; replace assert with ValueError

* update copied from statement

* add is_quantization_available; run make fix-copies

* unittest add require_quantization

* add backend dependency to qdqbert model

* update README; update evaluate script; make style

* lint

* docs qdqbert update

* circleci build_doc add pytorch-quantization for qdqbert

* update README

* update example readme with instructions to upgrade TensorRT to 8.2

* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* change quantization to pytorch_quantization for backend requirement

* feed_forward_chunking not supported in QDQBert

* make style

* update model docstrings and comments in testing scripts

* rename example to quantization-qdqbert; rename example scripts from qat to quant

* Update src/transformers/models/qdqbert/modeling_qdqbert.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* rm experimental functions in quant_trainer

* qa cleanup

* make fix-copies for docs index.rst

* fix doctree; use post_init() for qdqbert

* fix early device assignment for qdqbert

* fix CI:Model templates runner
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a59e7c1e

18 Nov, 2021 2 commits
- [Speech Recognition] More examples · efea0f86
  Patrick von Platen authored Nov 18, 2021
```
Add more XLS-R training runs to the official examples
```
  efea0f86
- Recover Deleted XNLI Instructions (#14437) · 01f8e639
  William Held authored Nov 18, 2021
  
  01f8e639
17 Nov, 2021 1 commit
- [Gradient checkpoining] Update Wav2Vec scripts (#14036) · 7544efc9
  Antonio Carlos Falcão Petri authored Nov 17, 2021
```
Co-authored-by: Stas Bekman <stas@stason.org>
```
  7544efc9
15 Nov, 2021 2 commits
- Replace BertLayerNorm with LayerNorm (#14385) · 9fd937ea
  Eldar Kurtic authored Nov 15, 2021
```
Running Movement pruning experiments with the newest HuggingFace would crash due to non-existing BertLayerNorm.
```
  9fd937ea
- Quick fix to TF summarization example (#14401) · 267867e8
  Matt authored Nov 15, 2021
  
  267867e8
12 Nov, 2021 1 commit
- [Wav2Vec2 Example] Improve fine-tuning script (#14373) · 55f49c5f
  Patrick von Platen authored Nov 12, 2021
```
* improve some stuff

* finish

* correct last
```
  55f49c5f
11 Nov, 2021 3 commits

fix --gradient_checkpointing (#13964) · 77262ef7
Stas Bekman authored Nov 11, 2021

77262ef7
Fixing requirements for TF LM models and use correct model mappings (#14372) · 7f20bf0d
Matt authored Nov 11, 2021
```
* Fixing requirements for TF LM models and use correct model mappings

* make style
```
7f20bf0d

Fix Flax params dtype (#13098) · e92190c0

Suraj Patil authored Nov 11, 2021



* fix inits

* fix embed dtype

* fix embed dtype

* add test to check default dtype

* quality

* add type conversion methods for flax models

* more robust casting

* cast sinusoidal positions

* update pegasus

* update albert

* update test

* make sure dtype is passed to every module

* style

* fix electra dense

* fix t5

* quality

* add more tests

* better name

* use the dtype for lm head computation

* fix albert

* style

* fix albert embed dtype

* more tests

* fix vision enc-dec

* cleanup

* fix embed dtype pegasus

* fix default param test

* doc

* update template

* fix final_logits_bias dtype

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix doc

* fix doc

* add detailed docstring for dtype parameter

* remove un-necessary import
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

e92190c0

09 Nov, 2021 2 commits
- bump flax version (#14343) · 85a4bda4
  Suraj Patil authored Nov 09, 2021
  
  85a4bda4
- Update Seq2Seq QA example script to use SQuAD metric. (#14335) · 4f24058c
  karthikrangasai authored Nov 09, 2021
```
* Update postporcessing accordingly to use SQuAD metric.

* Update assets accordingly based on SQuAD metrics.

* Fix function naming error.
```
  4f24058c
06 Nov, 2021 1 commit
- Fix execution PATH for PPLM Example (#14287) · c016dbdb
  Junbum Lee authored Nov 06, 2021
  
  c016dbdb
05 Nov, 2021 1 commit
- Add new LFS prune API (#14294) · 08a5f575
  Sylvain Gugger authored Nov 05, 2021
  
  08a5f575
02 Nov, 2021 1 commit
- Update Transformers to huggingface_hub >= 0.1.0 (#14251) · 558f8543
  Sylvain Gugger authored Nov 02, 2021
```
* Update Transformers to huggingface_hub >= 0.1.0

* Forgot to save...

* Style

* Fix test
```
  558f8543
01 Nov, 2021 1 commit
- Update README of QA examples (#14172) · 7396095a
  NielsRogge authored Nov 01, 2021
  
  7396095a
29 Oct, 2021 1 commit

Remove n_ctx from configs (#14165) · 5b45422b

Thomas Wang authored Oct 29, 2021

* Remove n_ctx from configs

* Fix GPTJ and OpenAIGPT, both are acceptable breaking changes as there are no configs such that it breaks

* Remove unecessary n_positions from TFOpenAIGPT

5b45422b

28 Oct, 2021 5 commits
- Update README.md · ba71f1b5
  Patrick von Platen authored Oct 28, 2021
  
  ba71f1b5
- v4.13.0.dev0 · b8fad022
  Lysandre authored Oct 28, 2021
  
  b8fad022
- Release v4.12.0 · 62bf5366
  Lysandre authored Oct 28, 2021
  
  62bf5366
- Add audio-classification benchmarking results (#14192) · 78b6a2ec
  Anton Lozhkov authored Oct 28, 2021
  
  78b6a2ec
- Update README.md · 88cd82e8
  Patrick von Platen authored Oct 28, 2021
  
  88cd82e8