Commits · 7df4b90c760804295cd4c23a0840055b772898ee · chenpangpang / transformers

21 Dec, 2021 3 commits
- Skip failing test · e51c7b58
  Sylvain Gugger authored Dec 21, 2021
  
  e51c7b58
- [examples/summarization] deal with None in data records (#14816) · 033c3ed9
  Stas Bekman authored Dec 21, 2021
```
* [examples/summarization] deal with None in data records

* rewrite to use a simpler (slower) variant
```
  033c3ed9
- [ASR example] Improve example + add more examples (#14848) · 7ae6f070
  Patrick von Platen authored Dec 21, 2021
```
* up

* load up

* up
```
  7ae6f070
17 Dec, 2021 1 commit

Wav2Vec2 meets phonemes (#14353) · c4a96cec

Patrick von Platen authored Dec 17, 2021



* up

* add tokenizer

* improve more

* finish tokenizer

* finish

* adapt speech recognition script

* adapt convert

* more fixes

* more fixes

* update phonemizer wav2vec2

* better naming

* fix more tests

* more fixes swedish

* correct tests

* finish

* improve script

* remove file

* up

* lets get those 100 model architectures until the end of the month

* make fix-copies

* correct more

* correct script

* more fixes

* more fixes

* add to docs

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* replace assert

* fix copies

* fix docs

* new try docs

* boom boom

* update

* add phonemizer to audio tests

* make fix-copies

* up

* upload models

* some changes

* Update tests/test_tokenization_wav2vec2_phoneme.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* more fixes

* remove @
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

c4a96cec

15 Dec, 2021 2 commits
- Docs for v4.14.0 · 7c9c41f4
  Lysandre authored Dec 15, 2021
  
  7c9c41f4
- Release: v4.14.0 · 960d8cb4
  Lysandre authored Dec 15, 2021
  
  960d8cb4
13 Dec, 2021 1 commit
- Change how to load config of XLNetLMHeadModel (#14746) · 971e3666
  Josué Nascimento authored Dec 13, 2021
  
  971e3666
09 Dec, 2021 2 commits
- Docs for v4.14.0dev0 · ab31b3e4
  Lysandre authored Dec 09, 2021
  
  ab31b3e4
- Release: v4.13.0 · 4da3a696
  Lysandre authored Dec 09, 2021
  
  4da3a696
08 Dec, 2021 1 commit

fix: verify jsonlines file in run_translation (#14660) (#14661) · 4ea19de8

Gaurang Tandon authored Dec 08, 2021

* fix: verify jsonl in run_translation (#14660)

* fix(run_translation.py): json/jsonl validation

Both json and jsonl are to be accepted as valid jsonlines file extension

* fix(run_translation.py): make black happy

* Ran make style

4ea19de8

06 Dec, 2021 2 commits
- [urls to hub] Replace outdated model tags with their now-canonical pipeline types (#14617) · 6cdc3a78
  Julien Chaumond authored Dec 06, 2021
```
* Replace outdated model tags with their now-canonical pipeline types

* spam the CI till it's green
```
  6cdc3a78
- updated readme with proper arguments (#14624) · 803a8cd1
  Kamal Raj authored Dec 06, 2021
  
  803a8cd1
05 Dec, 2021 1 commit
- fix a typo (#14626) · 3977b584
  (Bill) Yuchen Lin authored Dec 04, 2021
  
  3977b584
22 Nov, 2021 2 commits

Switch from using sum for flattening lists of lists in group_texts (#14472) · 69e16abf

Nicholas Broad authored Nov 22, 2021



* remove sum for list flattening

* change to chain(*)

* make chain object a list

* delete empty lines

per sgugger's suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Nicholas Broad <nicholas@nmbroad.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

69e16abf

[test] add test for --config_overrides (#14466) · 11f65d41
Stas Bekman authored Nov 22, 2021
```
* add test for --config_overrides

* remove unneeded parts of the test
```
11f65d41

18 Nov, 2021 2 commits
- [Speech Recognition] More examples · efea0f86
  Patrick von Platen authored Nov 18, 2021
```
Add more XLS-R training runs to the official examples
```
  efea0f86
- Recover Deleted XNLI Instructions (#14437) · 01f8e639
  William Held authored Nov 18, 2021
  
  01f8e639
12 Nov, 2021 1 commit
- [Wav2Vec2 Example] Improve fine-tuning script (#14373) · 55f49c5f
  Patrick von Platen authored Nov 12, 2021
```
* improve some stuff

* finish

* correct last
```
  55f49c5f
09 Nov, 2021 1 commit

Update Seq2Seq QA example script to use SQuAD metric. (#14335) · 4f24058c

karthikrangasai authored Nov 09, 2021

* Update postporcessing accordingly to use SQuAD metric.

* Update assets accordingly based on SQuAD metrics.

* Fix function naming error.

4f24058c

05 Nov, 2021 1 commit
- Add new LFS prune API (#14294) · 08a5f575
  Sylvain Gugger authored Nov 05, 2021
  
  08a5f575
01 Nov, 2021 1 commit
- Update README of QA examples (#14172) · 7396095a
  NielsRogge authored Nov 01, 2021
  
  7396095a
28 Oct, 2021 5 commits
- Update README.md · ba71f1b5
  Patrick von Platen authored Oct 28, 2021
  
  ba71f1b5
- v4.13.0.dev0 · b8fad022
  Lysandre authored Oct 28, 2021
  
  b8fad022
- Release v4.12.0 · 62bf5366
  Lysandre authored Oct 28, 2021
  
  62bf5366
- Add audio-classification benchmarking results (#14192) · 78b6a2ec
  Anton Lozhkov authored Oct 28, 2021
  
  78b6a2ec
- Update README.md · 88cd82e8
  Patrick von Platen authored Oct 28, 2021
  
  88cd82e8
27 Oct, 2021 2 commits
- Update README.md · e118db15
  Patrick von Platen authored Oct 28, 2021
  
  e118db15
- [TPU tests] Enable first TPU examples pytorch (#14121) · 01b14669
  Patrick von Platen authored Oct 28, 2021
```
* up

* up

* fix

* up

* Update examples/pytorch/test_xla_examples.py

* correct labels

* up

* up

* up

* up

* up

* up
```
  01b14669
26 Oct, 2021 6 commits
- Replace assertions with ValueError exception (#14142) · ebd48c6d
  Emanuel Huber authored Oct 26, 2021
```
Updated masked-language modeling examples in pytorch
with convention defined by #12789
```
  ebd48c6d
- fix typos in error messages in speech recognition example and modelcard.py (#14166) · 42bfb83d
  Matthew Goldey authored Oct 26, 2021
```
* specify the text column name in the error message

* pluralize the word fields
```
  42bfb83d
- chore: typo on ner accelerate example code (#14150) · 41dad89f
  Jangwon Park authored Oct 27, 2021
  
  41dad89f
- Update README.md · 9799f4e1
  Patrick von Platen authored Oct 26, 2021
  
  9799f4e1
- [Speech Recognition] - Distributed training: Make sure vocab file removal and... · f5ed19f5
  Patrick von Platen authored Oct 26, 2021
```
[Speech Recognition] - Distributed training: Make sure vocab file removal and creation don't interfer  (#14161)

* up

* better
```
  f5ed19f5
- up (#14154) · e248e9b0
  Patrick von Platen authored Oct 26, 2021
  
  e248e9b0
25 Oct, 2021 3 commits

Update README.md · c99a2832
Patrick von Platen authored Oct 25, 2021

c99a2832
Update README.md · 1a9381c6
Patrick von Platen authored Oct 25, 2021

1a9381c6

Supporting Seq2Seq model for question answering task (#13432) · 1b871e09

karthikrangasai authored Oct 25, 2021

* Add seq2seq example for QnA on SQuAD Dataset.

* Changes from review - Fixing styling mistakes.

* Added how to example in README, simplified the access to dataset's preprocess function.

* Added tests for the seq2seq QA example.

* Change dataset column name to fix tests.

* Fix test command mistake.

* Add missing argument 'ignore_pad_token_for_loss' from DataTrainingArguments.

* Add missing argument 'num_beams' from DataTrainingArguments.

* Fix processing of output predicted token ids so that tokenizer decode gets appropriate input. Updated assertion conditions on the tests.

1b871e09

21 Oct, 2021 3 commits
- fix typo in license docstring (#14094) · d432a654
  lee1jun authored Oct 22, 2021
```
last line: "# limitations under the License." is missing
```
  d432a654
- [Examples] Add audio classification notebooks (#14099) · e03544a1
  Anton Lozhkov authored Oct 21, 2021
```
* Update SEW integration test tolerance

* Add audio classification notebooks
```
  e03544a1
- up (#14093) · e9d2a639
  Patrick von Platen authored Oct 21, 2021
  
  e9d2a639