Commits · 2da8853775b61cde0894dee17c6c713aba711688 · chenpangpang / transformers

18 Sep, 2023 1 commit

[`Tokenizer`] attemp to fix add_token issues

(#23909) · 2da88537

Arthur authored Sep 18, 2023

* fix test for bart. Order is correct now let's skip BPEs

* ouf

* styling

* fix bert....

* slow refactoring

* current updates

* massive refactoring

* update

* NICE!

* update to see where I am at

* updates

* update

* update

* revert

* updates

* updates

* start supporting legacy_save

* styling

* big update

* revert some changes

* nits

* nniiiiiice

* small fixes

* kinda fix t5 with new behaviour

* major update

* fixup

* fix copies

* today's updates

* fix byt5

* upfate

* update

* update

* updates

* update vocab size test

* Barthez does not use not need the fairseq offset ids

* super calll must be after

* calll super

* move all super init

* move other super init

* fixup

* nits

* more fixes

* nits

* more fixes

* nits

* more fix

* remove useless files

* ouch all of them are affected
...

2da88537

07 Feb, 2023 1 commit

Cleanup quality (#21493) · 67d07487

Sylvain Gugger authored Feb 07, 2023

* Remove mentions of flake8/isort

* Clean up inits

* Deall with all other inits

* Last special rule for dummy files

67d07487

17 Feb, 2022 1 commit

Add SimMIM (#15586) · 57882177

NielsRogge authored Feb 17, 2022



* Add first draft

* Make model importable

* Make SwinForMaskedImageModeling importable

* Fix imports

* Add missing inits

* Add support for Swin

* Fix bug

* Fix bug

* Fix another bug

* Fix Swin MIM implementation

* Fix default encoder stride

* Fix Swin

* Add print statements for debugging

* Add image_size data argument

* Fix Swin

* Fix image_size

* Add print statements for debugging

* Fix print statement

* Remove print statements

* Improve reshaping of bool_masked_pos

* Add support for DeiT, fix tests

* Improve docstrings

* Apply new black version

* Improve script

* Fix bug

* Improve README

* Apply suggestions from code review

* Remove DS_Store and add to gitignore

* Apply suggestions from code review + fix BEiT Flax

* Revert BEiT changes

* Improve README

* Fix code quality

* Improve README
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
Co-authored-by: Niels Rogge <nielsr...

57882177

06 Apr, 2021 1 commit

Auto feature extractor (#11097) · 403d530e

Sylvain Gugger authored Apr 06, 2021

* AutoFeatureExtractor

* Init and first tests

* Tests

* Damn you gitignore

* Quality

* Defensive test for when not all backends are here

* Use pattern for Speech2Text models

403d530e

08 Dec, 2020 1 commit

New squad example (#8992) · 447808c8

Sylvain Gugger authored Dec 08, 2020



* Add new SQUAD example

* Same with a task-specific Trainer

* Address review comment.

* Small fixes

* Initial work for XLNet

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Final clean up and working XLNet script

* Test and debug

* Final working version

* Add new SQUAD example

* Same with a task-specific Trainer

* Address review comment.

* Small fixes

* Initial work for XLNet

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Final clean up and working XLNet script

* Test and debug

* Final working version

* Add tick

* Update README

* Address review comments
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

447808c8

17 Nov, 2020 1 commit

Reorganize repo (#8580) · c89bdfbe

Sylvain Gugger authored Nov 16, 2020

* Put models in subfolders

* Styling

* Fix imports in tests

* More fixes in test imports

* Sneaky hidden imports

* Fix imports in doc files

* More sneaky imports

* Finish fixing tests

* Fix examples

* Fix path for copies

* More fixes for examples

* Fix dummy files

* More fixes for example

* More model import fixes

* Is this why you're unhappy GitHub?

* Fix imports in conver command

c89bdfbe

19 Oct, 2020 1 commit
- [EncoderDecoder] Fix Typo (#7915) · c912ba5f
  Patrick von Platen authored Oct 19, 2020
```
* fix encoder decoder models

* add .gitignore
```
  c912ba5f
18 Oct, 2020 1 commit

[Dependencies|tokenizers] Make both SentencePiece and Tokenizers optional dependencies (#7659) · ba8c4d0a

Thomas Wolf authored Oct 18, 2020

* splitting fast and slow tokenizers [WIP]

* [WIP] splitting sentencepiece and tokenizers dependencies

* update dummy objects

* add name_or_path to models and tokenizers

* prefix added to file names

* prefix

* styling + quality

* spliting all the tokenizer files - sorting sentencepiece based ones

* update tokenizer version up to 0.9.0

* remove hard dependency on sentencepiece 🎉

* and removed hard dependency on tokenizers 🎉

* update conversion script

* update missing models

* fixing tests

* move test_tokenization_fast to main tokenization tests - fix bugs

* bump up tokenizers

* fix bert_generation

* update ad fix several tokenizers

* keep sentencepiece in deps for now

* fix funnel and deberta tests

* fix fsmt

* fix marian tests

* fix layoutlm

* fix squeezebert and gpt2

* fix T5 tokenization

* fix xlnet tests

* style

* fix mbart...

ba8c4d0a

12 Oct, 2020 1 commit
- [marian] Automate Tatoeba-Challenge conversion (#7709) · 9c2b2db2
  Sam Shleifer authored Oct 12, 2020
  
  9c2b2db2
22 Sep, 2020 1 commit

RAG (#6813) · c754c41c

Ola Piktus authored Sep 22, 2020

* added rag WIP

* path fix

* Formatting / renaming prior to actual work

* added rag WIP

* path fix

* Formatting / renaming prior to actual work

* added rag WIP

* path fix

* Formatting / renaming prior to actual work

* added rag WIP

* Formatting / renaming prior to actual work

* First commit

* improve comments

* Retrieval evaluation scripts

* refactor to include modeling outputs + MPI retriever

* Fix rag-token model + refactor

* Various fixes + finetuning logic

* use_bos fix

* Retrieval refactor

* Finetuning refactoring and cleanup

* Add documentation and cleanup

* Remove set_up_rag_env.sh file

* Fix retrieval wit HF index

* Fix import errors

* Fix quality errors

* Refactor as per suggestions in https://github.com/huggingface/transformers/pull/6813#issuecomment-687208867

* fix quality

* Fix RAG Sequence generation

* minor cleanup plus initial tests

* fix test

* fix tests 2

* Comments fix

* post-merge...

c754c41c

05 Jun, 2020 1 commit
- Add .vs to gitignore (#4774) · ceaab8dd
  Sylvain Gugger authored Jun 05, 2020
  
  ceaab8dd
04 Jun, 2020 1 commit

Tensorflow improvements (#4530) · f9414f75

Julien Plu authored Jun 05, 2020



* Better None gradients handling

* Apply Style

* Apply Style

* Create a loss class per task to compute its respective loss

* Add loss classes to the ALBERT TF models

* Add loss classes to the BERT TF models

* Add question answering and multiple choice to TF Camembert

* Remove prints

* Add multiple choice model to TF DistilBERT + loss computation

* Add question answering model to TF Electra + loss computation

* Add token classification, question answering and multiple choice models to TF Flaubert

* Add multiple choice model to TF Roberta + loss computation

* Add multiple choice model to TF XLM + loss computation

* Add multiple choice and question answering models to TF XLM-Roberta

* Add multiple choice model to TF XLNet + loss computation

* Remove unused parameters

* Add task loss classes

* Reorder TF imports + add new model classes

* Add new model classes

* Bugfix in TF T5 model

* Bugfix for TF T5 tests

* Bugfix in TF T5 model

* Fix TF T5 model tests

* Fix T5 tests + some renaming

* Fix inheritance issue in the AutoX tests

* Add tests for TF Flaubert and TF XLM Roberta

* Add tests for TF Flaubert and TF XLM Roberta

* Remove unused piece of code in the TF trainer

* bugfix and remove unused code

* Bugfix for TF 2.2

* Apply Style

* Divide TFSequenceClassificationAndMultipleChoiceLoss into their two respective name

* Apply style

* Mirror the PT Trainer in the TF one: fp16, optimizers and tb_writer as class parameter and better dataset handling

* Fix TF optimizations tests and apply style

* Remove useless parameter

* Bugfix and apply style

* Fix TF Trainer prediction

* Now the TF models return the loss such as their PyTorch couterparts

* Apply Style

* Ignore some tests output

* Take into account the SQuAD cls_index, p_mask and is_impossible parameters for the QuestionAnswering task models.

* Fix names for SQuAD data

* Apply Style

* Fix conflicts with 2.11 release

* Fix conflicts with 2.11

* Fix wrongname

* Add better documentation on the new create_optimizer function

* Fix isort

* logging_dir: use same default as PyTorch
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

f9414f75

07 May, 2020 1 commit

BIG Reorganize examples (#4213) · 0ae96ff8

Julien Chaumond authored May 07, 2020

* Created using Colaboratory

* [examples] reorganize files

* remove run_tpu_glue.py as superseded by TPU support in Trainer

* Bugfix: int, not tuple

* move files around

0ae96ff8

05 May, 2020 1 commit

Trainer: add logging through Weights & Biases (#3916) · 818463ee

Boris Dayma authored May 04, 2020



* feat: add logging through Weights & Biases

* feat(wandb): make logging compatible with all scripts

* style(trainer.py): fix formatting

* [Trainer] Tweak wandb integration
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

818463ee

22 Apr, 2020 1 commit

Trainer (#3800) · dd9d483d

Julien Chaumond authored Apr 21, 2020

* doc

* [tests] Add sample files for a regression task

* [HUGE] Trainer

* Feedback from @sshleifer

* Feedback from @thomwolf + logging tweak

* [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes

* [glue] Use default max_seq_length of 128 like before

* [glue] move DataTrainingArguments around

* [ner] Change interface of InputExample, and align run_{tf,pl}

* Re-align the pl scripts a little bit

* ner

* [ner] Add integration test

* Fix language_modeling with API tweak

* [ci] Tweak loss target

* Don't break console output

* amp.initialize: model must be on right device before

* [multiple-choice] update for Trainer

* Re-align to 827d6d6e

dd9d483d

23 Feb, 2020 1 commit
- add_ctags_to_git_ignore (#2984) · 38f5fe9e
  Patrick von Platen authored Feb 23, 2020
  
  38f5fe9e
17 Feb, 2020 1 commit
- update .gitignore to ignore .swp files created when using vim · fb4d8d08
  Patrick von Platen authored Feb 17, 2020
  
  fb4d8d08
06 Jan, 2020 2 commits
- GPU text generation: mMoved the encoded_prompt to correct device · 81d6841b
  alberduris authored Dec 31, 2019
  
  81d6841b
- Moved the encoded_prompts to correct device · dd4df80f
  alberduris authored Dec 31, 2019
  
  dd4df80f
12 Nov, 2019 1 commit
- whitespace · dd6b2e05
  Julien Chaumond authored Nov 11, 2019
  
  dd6b2e05
09 Oct, 2019 1 commit
- Pycharm folder added to gitignore · e17ea08e
  LysandreJik authored Oct 09, 2019
  
  e17ea08e
04 Oct, 2019 1 commit

Adding CTRL (squashed commit) · dbed1c5d

keskarnitish authored Sep 30, 2019

adding conversion script

adding first draft of modeling & tokenization

adding placeholder for test files

bunch of changes

registering the tokenizer/model/etc

tests

change link; something is very VERY wrong here

weird end-of-word thingy going on

i think the tokenization works now ; wrote the unit tests

overall structure works;load w next

the monster is alive!

works after some cleanup as well

adding emacs autosave to gitignore

currently only supporting the 48 layer one; seems to infer fine on my macbook

cleanup

fixing some documentation

fixing some documentation

tests passing?

now works on CUDA also

adding greedy?

adding greedy sampling

works well

dbed1c5d

24 Sep, 2019 1 commit
- updated data processor and metrics · b5ec526f
  thomwolf authored Sep 24, 2019
  
  b5ec526f
05 Sep, 2019 1 commit
- gitignore · 04b50cab
  VictorSanh authored Sep 05, 2019
  
  04b50cab
20 Aug, 2019 2 commits
- adding proxies options for the from_pretrained methods · 43489756
  thomwolf authored Aug 20, 2019
  
  43489756
- various fix and clean up on run_lm_finetuning · a690edab
  thomwolf authored Aug 20, 2019
  
  a690edab
09 Jul, 2019 1 commit
- adding tests to examples - updating summary module - coverage update · d5481cbe
  thomwolf authored Jul 09, 2019
  
  d5481cbe
24 Jun, 2019 1 commit
- updating run_xlnet_classifier · 24ed0b93
  thomwolf authored Jun 24, 2019
  
  24ed0b93
20 Jun, 2019 1 commit
- update gitignore · b407972e
  thomwolf authored Jun 20, 2019
  
  b407972e
05 Feb, 2019 2 commits
- gitignore · 0ad9b239
  thomwolf authored Feb 05, 2019
  
  0ad9b239
- more explicit notation: num_train_step => num_train_optimization_steps · 1579c536
  thomwolf authored Feb 05, 2019
  
  1579c536
15 Jan, 2019 1 commit
- conversion working · 7d03c537
  thomwolf authored Jan 15, 2019
  
  7d03c537
05 Nov, 2018 1 commit
- update gitignore · 3a301d44
  thomwolf authored Nov 05, 2018
  
  3a301d44
31 Oct, 2018 1 commit
- switch to full google code · 13ee61e4
  thomwolf authored Oct 31, 2018
  
  13ee61e4
30 Oct, 2018 1 commit
- getting ready · ccce66be
  thomwolf authored Oct 30, 2018
  
  ccce66be
29 Oct, 2018 1 commit
- Initial commit · 43badf21
  Thomas Wolf authored Oct 29, 2018
  
  43badf21