Commits · aecaaf73a4092290391ef209a0276ef9edf03772 · chenpangpang / transformers

28 May, 2020 5 commits
- [Community notebooks] add longformer-for-qa notebook (#4652) · aecaaf73
  Suraj Patil authored May 29, 2020
  
  aecaaf73
- Fix add_special_tokens on fast tokenizers (#4531) · 5e737018
  Anthony MOI authored May 28, 2020
  
  5e737018
- LongformerForTokenClassification (#4638) · e444648a
  Suraj Patil authored May 28, 2020
  
  e444648a
- add 2 colab notebooks (#4505) · 3cc2c2a1
  Lavanya Shukla authored May 28, 2020
```
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  3cc2c2a1
- [Longformer] more models + model cards (#4628) · ef03ae87
  Iz Beltagy authored May 28, 2020
```
* adding freeze roberta models

* model cards

* lint
```
  ef03ae87
27 May, 2020 10 commits

[Benchmark] Memory benchmark utils (#4198) · 96f57c9c

Patrick von Platen authored May 27, 2020



* improve memory benchmarking

* correct typo

* fix current memory

* check torch memory allocated

* better pytorch function

* add total cached gpu memory

* add total gpu required

* improve torch gpu usage

* update memory usage

* finalize memory tracing

* save intermediate benchmark class

* fix conflict

* improve benchmark

* improve benchmark

* finalize

* make style

* improve benchmarking

* correct typo

* make train function more flexible

* fix csv save

* better repr of bytes

* better print

* fix __repr__ bug

* finish plot script

* rename plot file

* delete csv and small improvements

* fix in plot

* fix in plot

* correct usage of timeit

* remove redundant line

* remove redundant line

* fix bug

* add hf parser tests

* add versioning and platform info

* make style

* add gpu information

* ensure backward compatibility

* finish adding all tests

* Update src/transformers/benchmark/benchmark_args.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/benchmark/benchmark_args_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* delete csv files

* fix isort ordering

* add out of memory handling

* add better train memory handling
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

96f57c9c

LongformerForSequenceClassification (#4580) · ec4cdfdd

Suraj Patil authored May 28, 2020



* LongformerForSequenceClassification

* better naming x=>hidden_states, fix typo in doc

* Update src/transformers/modeling_longformer.py

* Update src/transformers/modeling_longformer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

ec4cdfdd

[Model Card] model card for longformer-base-4096-finetuned-squadv1 (#4625) · 4402879e
Suraj Patil authored May 27, 2020

4402879e

per_device instead of per_gpu/error thrown when argument unknown (#4618) · 6a176880

Lysandre Debut authored May 27, 2020



* per_device instead of per_gpu/error thrown when argument unknown

* [docs] Restore examples.md symlink

* Correct absolute links so that symlink to the doc works correctly

* Update src/transformers/hf_argparser.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Warning + reorder

* Docs

* Style

* not for squad
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

6a176880

README for HooshvareLab (#4610) · 1381b6d0
Mehrdad Farahani authored May 27, 2020
```
HooshvareLab/bert-base-parsbert-uncased
```
1381b6d0
Update version command when contributing (#4614) · 5acb4edf
Patrick von Platen authored May 27, 2020

5acb4edf
uncased readme (#4608) · 842588c1
Darek Kłeczek authored May 27, 2020
```
Co-authored-by: kldarek <darekmail>
```
842588c1
Create README.md (#4607) · ac1a6121
Darek Kłeczek authored May 27, 2020
```
Model card for cased model
```
ac1a6121
[testing] LanguageModelGenerationTests require_tf or require_torch (#4616) · 07797c4d
Sam Shleifer authored May 27, 2020

07797c4d

Add back --do_lower_case to uncased models (#4245) · a9aa7456

Hao Tan authored May 26, 2020

The option `--do_lower_case` is currently required by the uncased models (i.e., bert-base-uncased, bert-large-uncased).

Results:
BERT-BASE without --do_lower_case: 'exact': 73.83, 'f1': 82.22
BERT-BASE with --do_lower_case: 'exact': 81.02, 'f1': 88.34

a9aa7456

26 May, 2020 11 commits

Creating a readme for ALBERT in Mongolian (#4603) · a801c7fd

Bayartsogt Yadamsuren authored May 27, 2020

Here I am uploading Mongolian masked language model (ALBERT) on your platform.
https://en.wikipedia.org/wiki/Mongolia

a801c7fd

updated model cards for both models at aubmindlab (#4604) · 6458c0e2
Wissam Antoun authored May 26, 2020
```
* updated aubmindlab/bert-base-arabert/ Model card

* updated aubmindlab/bert-base-arabertv01 model card
```
6458c0e2
Improve model card for Tereveni-AI/gpt2-124M-uk-fiction (#4582) · ea4e7a53
Oleksandr Bushkovskyi authored May 26, 2020
```
Add language metadata, training and evaluation corpora details.
Add example output. Fix inconsistent use of quotes.
```
ea4e7a53
Create README.md (#4591) · 937930dc
Manuel Romero authored May 26, 2020

937930dc
Remove MD emojis (#4602) · bac1cc4d
Manuel Romero authored May 26, 2020

bac1cc4d
[GPT2, CTRL] Allow input of input_ids and past of variable length (#4581) · 003c4771
Patrick von Platen authored May 26, 2020
```
* revert convenience  method

* clean docs a bit
```
003c4771

Add BART fine-tuning summarization community notebook (#4539) · 5ddd8d65

ohmeow authored May 26, 2020



* adding BART summarization how-to community notebook

* Update notebooks/README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5ddd8d65

Make transformers-cli cross-platform (#4131) · 8cc6807e

Bram Vanroy authored May 26, 2020

* make transformers-cli cross-platform

Using "scripts" is a useful option in setup.py particularly when you want to get access to non-python scripts. However, in this case we want to have an entry point into some of our own Python scripts. To do this in a concise, cross-platfom way, we can use entry_points.console_scripts. This change is necessary to provide the CLI on different platforms, which "scripts" does not ensure. Usage remains the same, but the "transformers-cli" script has to be moved (be part of the library) and renamed (underscore + extension)

* make style & quality

8cc6807e

[Longformer For Question Answering] Conversion script, doc, small fixes (#4593) · c589eae2
Patrick von Platen authored May 26, 2020
```
* add new longformer for question answering model

* add new config as well

* fix links

* fix links part 2
```
c589eae2
[T5] Fix Cross Attention position bias (#4499) · a163c9ca
ZhuBaohe authored May 26, 2020
```
* fix

* fix1
```
a163c9ca
fix (#4410) · 1d690289
ZhuBaohe authored May 26, 2020

1d690289

25 May, 2020 11 commits
- [ci] fix 3 remaining slow GPU failures (#4584) · b86e42e0
  Sam Shleifer authored May 25, 2020
  
  b86e42e0
- [ci] Slow GPU tests run daily (#4465) · 365d452d
  Julien Chaumond authored May 25, 2020
  
  365d452d
- [Reformer] fix reformer num buckets (#4564) · 3e3e5521
  Patrick von Platen authored May 25, 2020
```
* fix reformer num buckets

* fix

* adapt docs

* set num buckets in config
```
  3e3e5521
- fixing tokenization of extra_id symbols in T5Tokenizer. Related to issue 4021 (#4353) · 3dea40b8
  Elman Mansimov authored May 25, 2020
  
  3dea40b8
- LongformerTokenizerFast (#4547) · 51397336
  Suraj Patil authored May 26, 2020
  
  51397336
- Updated the link to the paper (#4570) · c9c385c5
  Oliver Guhr authored May 25, 2020
```
I looks like the conference has changed the link to the paper.
```
  c9c385c5
- Add nn.Module as superclass (#4533) · adab7f83
  Sho Arora authored May 25, 2020
  
  adab7f83
- Create model card (#4578) · 8f7c1c76
  Manuel Romero authored May 25, 2020
  
  8f7c1c76
- Update README.md (#4556) · 4c6b2180
  Ali Safaya authored May 25, 2020
  
  4c6b2180
- add DistilBERT to supported models (#4558) · 50d1ce41
  Antonis Maronikolakis authored May 25, 2020
  
  50d1ce41
- Longformer for question answering (#4500) · 03d8527d
  Suraj Patil authored May 25, 2020
```
* added LongformerForQuestionAnswering

* add LongformerForQuestionAnswering

* fix import for LongformerForMaskedLM

* add LongformerForQuestionAnswering

* hardcoded sep_token_id

* compute attention_mask if not provided

* combine global_attention_mask with attention_mask when provided

* update example in  docstring

* add assert error messages, better attention combine

* add test for longformerForQuestionAnswering

* typo

* cast gloabl_attention_mask to long

* make style

* Update src/transformers/configuration_longformer.py

* Update src/transformers/configuration_longformer.py

* fix the code quality

* Merge branch 'longformer-for-question-answering' of https://github.com/patil-suraj/transformers

 into longformer-for-question-answering
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  03d8527d
23 May, 2020 1 commit
- DOC: Fix typos in modeling_auto (#4534) · a34a9896
  Bharat Raghunathan authored May 23, 2020
  
  a34a9896
22 May, 2020 2 commits

Add Type Hints to modeling_utils.py Closes #3911 (#3948) · e19b9781

Bijay Gurung authored May 23, 2020



* Add Type Hints to modeling_utils.py Closes #3911

Add Type Hints to methods in `modeling_utils.py`

Note: The coverage isn't 100%. Mostly skipped internal methods.

* Reformat according to `black` and `isort`

* Use typing.Iterable instead of Sequence

* Parameterize Iterable by its generic type

* Use typing.Optional when None is the default value

* Adhere to style guideline

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

e19b9781

Warn the user about max_len being on the path to be deprecated. (#4528) · 996f393a

Funtowicz Morgan authored May 22, 2020

* Warn the user about max_len being on the path to be deprecated.

* Ensure better backward compatibility when max_len is provided to a tokenizer.

* Make sure to override the parameter and not the actual instance value.

* Format & quality

996f393a