Commits · a40955f0710478b141a7bbbca7edea45701eb833 · chenpangpang / transformers

18 Jun, 2019 2 commits
- no need to duplicate models anymore · a40955f0
  thomwolf authored Jun 18, 2019
  
  a40955f0
- spliting config and weight files for bert also · 382e2d1e
  thomwolf authored Jun 18, 2019
  
  382e2d1e
11 Jun, 2019 2 commits
- [hotfix] Fix frozen pooler parameters in SWAG example. · e02ce4dc
  Meet Pragnesh Shah authored Jun 11, 2019
  
  e02ce4dc
- adds the tokenizer + model config to the output · 5c08c8c2
  Oliver Guhr authored Jun 11, 2019
  
  5c08c8c2
10 Jun, 2019 1 commit

Update pregenerate_training_data.py · a3a604ce

jeonsworld authored Jun 10, 2019

apply Whole Word Masking technique.
referred to [create_pretraining_data.py](https://github.com/google-research/bert/blob/master/create_pretraining_data.py)

a3a604ce

27 May, 2019 1 commit
- support latest multi language bert fine tune · c4fe56dc
  Ahmad Barqawi authored May 27, 2019
```
fix issue of bert-base-multilingual and add support for uncased multilingual
```
  c4fe56dc
22 May, 2019 1 commit

Update run_squad.py · 9e7bc51b

tguens authored May 22, 2019

Indentation change so that the output "nbest_predictions.json" is not empty.

9e7bc51b

13 May, 2019 1 commit
- Make num_train_optimization_steps int · 94247ad6
  samuelbroscheit authored May 13, 2019
  
  94247ad6
11 May, 2019 2 commits

Clean up a little bit · 49a77ac1
samuel.broscheit authored May 12, 2019

49a77ac1

Fixing the issues reported in https://github.com/huggingface/pytorch-pretrained-BERT/issues/556 · 3bf3f959

samuel.broscheit authored May 12, 2019

Reason for issue was that optimzation steps where computed from example size, which is different from actual size of dataloader when an example is chunked into multiple instances.

Solution in this pull request is to compute num_optimization_steps directly from len(data_loader).

3bf3f959

09 May, 2019 2 commits
- Division to num_train_optimizer of global_step in lr_this_step is removed. · 00c7fd2b
  burcturkoglu authored May 09, 2019
  
  00c7fd2b
- Division to num_train_optimizer of global_step in lr_this_step is removed. · 5289b4b9
  burcturkoglu authored May 09, 2019
  
  5289b4b9
02 May, 2019 2 commits
- Fix documentation typo · 18c8aef9
  MottoX authored May 02, 2019
  
  18c8aef9
- Prepare optimizer only when args.do_train is True · 74dbba64
  MottoX authored May 02, 2019
  
  74dbba64
30 Apr, 2019 1 commit
- small fix to remove shifting of lm labels during pre process of roc stories,... · 365fb34c
  Aneesh Pappu authored Apr 30, 2019
```
small fix to remove shifting of lm labels during pre process of roc stories, as this shifting happens interanlly in the model
```
  365fb34c
29 Apr, 2019 1 commit
- Fix tr_loss rescaling factor using global_step · 87b9ec38
  Mathieu Prouveur authored Apr 29, 2019
  
  87b9ec38
24 Apr, 2019 1 commit
- Update example files so that tr_loss is not affected by args.gradient_accumulation_step · ed8fad73
  Mathieu Prouveur authored Apr 24, 2019
  
  ed8fad73
23 Apr, 2019 1 commit
- fix training schedules in examples to match new API · d94c6b01
  thomwolf authored Apr 23, 2019
  
  d94c6b01
22 Apr, 2019 1 commit
- Made --reduce_memory actually do something in finetune_on_pregenerated · b8e2a9c5
  Matthew Carrigan authored Apr 22, 2019
  
  b8e2a9c5
21 Apr, 2019 1 commit
- Fix indentation weirdness in GPT-2 example. · 14b1f719
  Sangwhan Moon authored Apr 22, 2019
  
  14b1f719
16 Apr, 2019 2 commits
- [run_gpt2.py] temperature should be a float, not int · 87677fcc
  Ben Mann authored Apr 16, 2019
  
  87677fcc
- Fix indentation for unconditional generation · 07154dad
  Abhi Sharma authored Apr 16, 2019
  
  07154dad
15 Apr, 2019 7 commits
- fix saving models in distributed setting examples · 3571187e
  thomwolf authored Apr 15, 2019
  
  3571187e
- add ptvsd to run_squad · 2499b0a5
  thomwolf authored Apr 15, 2019
  
  2499b0a5
- clean up distributed training logging in run_squad example · 7816f792
  thomwolf authored Apr 15, 2019
  
  7816f792
- clean up logger in examples for distributed case · 1135f238
  thomwolf authored Apr 15, 2019
  
  1135f238
- added best practices for serialization in README and examples · 60ea6c59
  thomwolf authored Apr 15, 2019
  
  60ea6c59
- update example to work with new serialization semantic · 179a2c2f
  thomwolf authored Apr 15, 2019
  
  179a2c2f
- add serialization semantics to tokenizers - fix transfo-xl tokenizer · 3e65f255
  thomwolf authored Apr 15, 2019
  
  3e65f255
12 Apr, 2019 1 commit
- Replaced some randints with cleaner randranges, and added a helpful · dbbd6c75
  Matthew Carrigan authored Apr 12, 2019
```
error for users whose corpus is just one giant document.
```
  dbbd6c75
11 Apr, 2019 2 commits
- fix tsv read error in Windows · c49ce3c7
  Jie Yang authored Apr 11, 2019
  
  c49ce3c7
- finetuning any BERT model - fixes #455 · 4bc4c69a
  thomwolf authored Apr 11, 2019
  
  4bc4c69a
09 Apr, 2019 2 commits

Update README.md · 8fffba5f

Yaroslav Bulatov authored Apr 09, 2019

Fix for

```> > > > 04/09/2019 21:39:38 - INFO - __main__ -   device: cuda n_gpu: 1, distributed training: False, 16-bits training: False
Traceback (most recent call last):
  File "/home/ubuntu/pytorch-pretrained-BERT/examples/lm_finetuning/simple_lm_finetuning.py", line 642, in <module>
    main()
  File "/home/ubuntu/pytorch-pretrained-BERT/examples/lm_finetuning/simple_lm_finetuning.py", line 502, in main
    raise ValueError("Training is currently the only implemented execution option. Please set `do_train`.")
ValueError: Training is currently the only implemented execution option. Please set `do_train`.
```

8fffba5f

fix run_gpt2.py · fd8a3556
Benjamin Mann authored Apr 08, 2019

fd8a3556

07 Apr, 2019 2 commits
- removing some redundant lines · 4d3cf0d6
  Dhanajit Brahma authored Apr 07, 2019
  
  4d3cf0d6
- Cleaned some redundant lines · 0d6a882f
  dhanajitb authored Apr 07, 2019
```
```while not args.unconditional:
     if not args.unconditional:
```
These lines have been updated
```
  0d6a882f
03 Apr, 2019 1 commit
- Fix #419 · 846b1fd6
  thomwolf authored Apr 03, 2019
  
  846b1fd6
01 Apr, 2019 1 commit
- Fixes to the TensorFlow conversion tool · 8b5c63e4
  Mike Arpaia authored Apr 01, 2019
  
  8b5c63e4
30 Mar, 2019 2 commits

Fix typo in example code · d07db28f
Weixin Wang authored Mar 31, 2019
```
Modify 'unambigiously' to 'unambiguously'
```
d07db28f

Update pregenerate_training_data.py · 60005f46

jeonsworld authored Mar 30, 2019

If the value of rand_end is returned from the randint function, the value of sampled_doc_index that matches current_idx is returned from searchsorted.

example:
cumsum_max = {int64} 30
doc_cumsum = {ndarray} [ 5  7 11 19 30]
doc_lengths = {list} <class 'list'>: [5, 2, 4, 8, 11]
if current_idx  = 1,
rand_start = 7
rand_end = 35
sentence_index = randint(7, 35) % cumsum_max
if randint return 35, sentence_index becomes 5.
if sentence_index is 5, np.searchsorted returns 1 equal to current_index.

60005f46