Commits · 8becb732931bbab5dd75cca5f5e7c75b2516d10b · chenpangpang / transformers

"...gpu/git@developer.sourcefind.cn:gaoqiong/migraphx.git" did not exist on "7668ef6b75a5befa9eb8516257aec8c00e632f56"

19 Mar, 2020 23 commits

removing torch.cuda.empty_cache() from TF function (#3267) · 8becb732

Nitish Shirish Keskar authored Mar 19, 2020

torch.cuda.empty_cache() was being called from a TF function (even when torch is unavailable)
not sure any replacement is needed if TF OOMs

8becb732

Simpler Error message when loading config/model with .from_pretrained() (#3341) · ecfd3363
Julien Chaumond authored Mar 19, 2020

ecfd3363

Update 01-training-tokenizers.ipynb (typo issue) (#3343) · 8eeefcb5

Kyeongpil Kang authored Mar 20, 2020

I found there are two grammar errors or typo issues in the explanation of the encoding properties.

The original sentences:
If your was made of multiple \"parts\" such as (question, context), then this would be a vector with for each token the segment it belongs to
If your has been truncated into multiple subparts because of a length limit (for BERT for example the sequence length is limited to 512), this will contain all the remaining overflowing parts.

I think "input" should be inserted after the phrase "If your".

8eeefcb5

Support T5 Generation (#3228) · bbf26c4e

Patrick von Platen authored Mar 19, 2020



* fix conflicts

* update bart max length test

* correct spelling mistakes

* implemented model specific encode function

* fix merge conflicts

* better naming

* save intermediate state -> need to rethink strucuture a bit

* leave tf problem as it is for now

* current version

* add layers.pop

* remove ipdb

* make style

* clean return cut decoding

* remove ipdbs

* Fix restoring layers in the decoders that doesnt exists.

* push good intermediate solution for now

* fix conflicts

* always good to refuse to merge conflicts when rebasing

* fix small bug

* improve function calls

* remove unused file

* add correct scope behavior for t5_generate
Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>

bbf26c4e

Fix #3305: run_ner only possible on ModelForTokenClassification models · 656e1386
Julien Chaumond authored Mar 18, 2020

656e1386
add bert bahasa readme · 0c44b119
husein zolkepli authored Mar 19, 2020

0c44b119
Create model card for bert-small-finetuned-squadv2 · e99af3b1
Manuel Romero authored Mar 19, 2020

e99af3b1
Merge pull request #3348 from mrm8488/patch-28 · 39db0552
Manuel Romero authored Mar 19, 2020
```
Create card for BERT-Mini finetuned on SQuAD v2
```
39db0552
Create card for BERT-Tiny fine-tuned on SQuAD v2 · dedc7a8f
Manuel Romero authored Mar 19, 2020
```
- Only 17MB of Model weights!!
```
dedc7a8f
Created card for spanbert-finetuned-squadv1 · 676adf86
Manuel Romero authored Mar 19, 2020

676adf86

Add model cards for FinBERT. (#3331) · 11d8bcc9

Antti Virtanen authored Mar 19, 2020

* Add a model card for FinBERT

This is a copy of https://github.com/TurkuNLP/FinBERT/blob/master/README.md.

* Added a file for uncased.

* Add metadata for cased.

* Added metadata for uncased.

11d8bcc9

Export ALBERT main layer in TensorFlow (#3354) · f049be7a
Lysandre Debut authored Mar 19, 2020

f049be7a

Fix wrong link for the notebook file (#3344) · 3bedfd33

Kyeongpil Kang authored Mar 20, 2020

For the tutorial of "How to generate text", the URL link was wrong (it was linked to the tutorial of "How to train a language model").

I fixed the URL.

3bedfd33

Minor Bug Fix for Running Roberta on Glue (#3240) · b2c2c31c

Serkan Karakulak authored Mar 19, 2020



* added return_token_type_ids argument for tokenizers which do not generate return_type_ids by default

* fixed styling

* Style
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>

b2c2c31c

[BART] torch 1.0 compatibility (#3322) · 4e4403c9
Sam Shleifer authored Mar 19, 2020
```
* config.activation_function
```
4e4403c9

[FIX] not training when epoch is small (#3006) · c44a17db

mataney authored Mar 19, 2020

* solving bug where for small epochs and large gradient_accumulation_steps we never train

* black formatting

* no need to change these files

c44a17db

[BART] cleanup: remove redundant kwargs, improve docstrings (#3319) · ad7233fc
Sam Shleifer authored Mar 19, 2020

ad7233fc
Typo in warning message (#3219) · cd21d8bc
Mohamed El-Geish authored Mar 19, 2020
```
`T5Tokenizer` instead of `XLNetTokenizer`
```
cd21d8bc
fix typo in docstring demonstrating usage (#3213) · 8d3e218e
Matthew Goldey authored Mar 19, 2020

8d3e218e
Fix input ids can be none attn mask (#3345) · cec3cdda
Patrick von Platen authored Mar 19, 2020
```
* fix issue 3289

* fix attention mask if input_ids None behavior
```
cec3cdda
Create README.md · f6d813aa
Junyi_Li authored Mar 19, 2020

f6d813aa
Create README.md · 93932811
Junyi_Li authored Mar 19, 2020
```
roberta_chinese_base card
```
93932811
Create README.md · 29442d2e
Junyi_Li authored Mar 19, 2020
```
albert_chinese_tiny card
```
29442d2e

18 Mar, 2020 8 commits
- Added model cards for SciBERT models uploaded under AllenAI org (#3330) · 20139b7c
  Kyle Lo authored Mar 18, 2020
```
* Create README.md

* model card

* add model card for cased
```
  20139b7c
- Improve fill-mask pipeline example in 03-pipelines notebook. · cae334c4
  Morgan Funtowicz authored Mar 18, 2020
```
Remove hardcoded mask_token and use the value provided by the tokenizer.
```
  cae334c4
- Create README.md · 4b1970bb
  Branden Chan authored Mar 18, 2020
  
  4b1970bb
- XLM-R Tokenizer now passes common tests + Integration tests (#3198) · d6afbd32
  Lysandre Debut authored Mar 18, 2020
```
* XLM-R now passes common tests + Integration tests

* Correct mask index

* Model input names

* Style

* Remove text preprocessing

* Unneccessary import
```
  d6afbd32
- Adding LM Head to Transfo-XL and first step to fixing problem with Adaptive... · 292186a3
  Patrick von Platen authored Mar 18, 2020
```
Adding LM Head to Transfo-XL and first step to fixing problem with Adaptive Embeddings in TransfoXL (#3286)

* first commit

* work in progress

* make language generation task pass

* update to working version for LM

* delete print

* remove dead code

* make style
```
  292186a3
- add link to blog post (#3326) · efdb46b6
  Patrick von Platen authored Mar 18, 2020
  
  efdb46b6
- improve doctstring (#3327) · ddb10c64
  Patrick von Platen authored Mar 18, 2020
  
  ddb10c64
- Init card for model · d7f98cd3
  Junyi_Li authored Mar 18, 2020
  
  d7f98cd3
17 Mar, 2020 9 commits

Add Summarization to Pipelines (#3128) · 38a555a8

Sam Shleifer authored Mar 17, 2020

* passing

* Undo stupid chg

* docs

* undo rename

* delete-cruft

* only import if you have torch

* Dont rely on dict ordering

* Fix dict ordering upstream

* docstring link

* docstring link

* remove trailing comma for 3.5 compat

* new name

* delegate kwarging

* Update kwargs

38a555a8

Update examples/ner/run_ner.py to use AutoModel (#3305) · 2b60a26b
J.P Lee authored Mar 18, 2020
```
* Update examples/ner/run_ner.py to use AutoModel

* Fix missing code and apply `make style` command
```
2b60a26b
Create model card for CodeBERTaPy (#3309) · e41212c7
Manuel Romero authored Mar 17, 2020

e41212c7
[model_cards] Add google thumbnail · 0f1bc0d6
Julien Chaumond authored Mar 17, 2020

0f1bc0d6

[WIP] Lightning glue example (#3290) · 930c9412

Nathan Raw authored Mar 17, 2020

* ✨ Alter base pl transformer to use automodels

* 🐛 Add batch size env variable to function call

* 💄 Apply black code style from Makefile

* 🚚 Move lightning base out of ner directory

* ✨ Add lightning glue example

* 💄 self

* move _feature_file to base class

* ✨ Move eval logging to custom callback

* 💄 Apply black code style

* 🐛 Add parent to pythonpath, remove copy command

* 🐛 Add missing max_length kwarg

930c9412

[generate] do_sample default back to False (#3298) · e8f44af5

Patrick von Platen authored Mar 17, 2020

* change do_samples back

* None better default as boolean

* adapt do_sample to True in test example

* make style

e8f44af5

CPU/GPU memory benchmarking utilities - Remove support for python 3.5 (now only 3.6+) (#3186) · 2187c49f

Thomas Wolf authored Mar 17, 2020

* memory benchmark rss

* have both forward pass and line-by-line mem tracing

* cleaned up tracing

* refactored and cleaning up API

* no f-strings yet...

* add GPU mem logging

* fix GPU memory monitoring

* style and quality

* clean up and doc

* update with comments

* Switching to python 3.6+

* fix quality

2187c49f

Create README.md (#3306) · bd3feddf
Jannes authored Mar 17, 2020
```
* Create README.md

* Updated README.md
```
bd3feddf
[model_cards] Symlink all Google AI's BERT Miniatures to source model card · 68ef0a11
Julien Chaumond authored Mar 16, 2020

68ef0a11