Commits · dcff504e1806467965e2ac1f1e3864cddabaf31f · chenpangpang / transformers

24 Aug, 2022 2 commits

fixed docstring typos (#18739) · dcff504e

Juyoung Kim authored Aug 24, 2022



* fixed docstring typos

* Added missing colon
Co-authored-by: 김주영 <juyoung@zezedu.com>

dcff504e

Add TF implementation of `XGLMModel` (#16543) · c72d7d91

Daniel Stancl authored Aug 24, 2022



* Add TFXGLM models 

* Add todo: self.supports_xla_generation = False
Co-authored-by: Daniel Stancl <stancld@Daniels-MacBook-Pro.local>
Co-authored-by: Daniel Stancl <stancld@daniels-mbp.home>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Daniel <daniel.stancl@rossum.ai>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

c72d7d91

01 Aug, 2022 1 commit
- Add a check regarding the number of occurrences of ``` (#18389) · bd6d1b43
  Yih-Dar authored Aug 01, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  bd6d1b43
26 Jul, 2022 1 commit
- Replace false parameter by a buffer (#18259) · c8ed1b8b
  Sylvain Gugger authored Jul 26, 2022
  
  c8ed1b8b
20 Jun, 2022 1 commit

Not use -1e4 as attn mask (#17306) · d3cb2888

Yih-Dar authored Jun 20, 2022



* Use torch.finfo(self.dtype).min

* for GPTNeoX

* for Albert

* For Splinter

* Update src/transformers/models/data2vec/modeling_data2vec_audio.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix -inf used in Bart-like models

* Fix a few remaining -inf

* more fix

* clean up

* For CLIP

* For FSMT

* clean up

* fix test

* Add dtype argument and use it for LayoutLMv3

* update FlaxLongT5Attention
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d3cb2888

31 May, 2022 1 commit

Fx support for multiple model architectures (#17393) · 28d00482

Michael Benayoun authored May 31, 2022

* Support for Bart and LayoutLM, and partial support for XLNet

* Support for mbart

* A lot of new models supported

* Support for other models

* LayoutLM fix

* Use strings instead of classes

28d00482

25 May, 2022 1 commit
- Make check_init script more robust and clean inits (#17408) · 56b35ce3
  Sylvain Gugger authored May 25, 2022
  
  56b35ce3
12 May, 2022 1 commit

Black preview (#17217) · afe5d42d

Sylvain Gugger authored May 12, 2022

* Black preview

* Fixup too!

* Fix check copies

* Use the same version as the CI

* Bump black

afe5d42d

09 May, 2022 1 commit

[WIP] Fix Pyright static type checking by replacing if-else imports with try-except (#16578) · df735d13

Dom Miketa authored May 09, 2022



* rebase and isort

* modify cookiecutter init

* fix cookiecutter auto imports

* fix clean_frameworks_in_init

* fix add_model_to_main_init

* blackify

* replace unnecessary f-strings

* update yolos imports

* fix roberta import bug

* fix yolos missing dependency

* fix add_model_like and cookiecutter bug

* fix repository consistency error

* modify cookiecutter, fix add_new_model_like

* remove stale line
Co-authored-by: Dom Miketa <dmiketa@exscientia.co.uk>

df735d13

25 Apr, 2022 1 commit
- Fix issue probably-meant-fstring found at https://codereview.doctor (#16913) · 65687520
  code-review-doctor authored Apr 25, 2022
  
  65687520
19 Apr, 2022 1 commit

[Flax] improve large model init and loading (#16148) · d3bd9ac7

Suraj Patil authored Apr 19, 2022



* begin do_init

* add params_shape_tree

* raise error if params are accessed when do_init is False

* don't allow do_init=False when keys are missing

* make shape tree a property

* assign self._params at the end

* add test for do_init

* add do_init arg to all flax models

* fix param setting

* disbale do_init for composite models

* update test

* add do_init in FlaxBigBirdForMultipleChoice

* better names and errors

* improve test

* style

* add a warning when do_init=False

* remove extra if

* set params after _required_params

* add test for from_pretrained

* do_init => _do_init

* chage warning to info

* fix typo

* add params in init_weights

* add params to gpt neo init

* add params to init_weights

* update do_init test

* Trigger CI

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update template

* trigger CI

* style

* style

* fix template
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d3bd9ac7

12 Apr, 2022 1 commit

Replace assertion with exception (#16720) · cc034f72

Anmol Joshi authored Apr 12, 2022



* Updated assertions to exceptions

* updated assertions to exceptions

* bug fixes

* fix-copies

* Update modeling_ctrl.py

* Update src/transformers/models/ctrl/modeling_tf_ctrl.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gpt_neo/modeling_gpt_neo.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gptj/modeling_gptj.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gptj/modeling_tf_gptj.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update modeling_led.py

* Update modeling_led.py

* Update modeling_led.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

cc034f72

04 Apr, 2022 1 commit
- Fix flax import in __init__.py: modeling_xglm -> modeling_flax_xglm (#16556) · ec4da72f
  Daniel Stancl authored Apr 04, 2022
  
  ec4da72f
01 Apr, 2022 1 commit

Adding missing type hints for mBART model (PyTorch) (#16429) · 5fe06b9b

Rishav Chandra Varma authored Apr 01, 2022



* added type hints for mbart tensorflow tf implementation

* Adding missing type hints for mBART model 

Tensorflow Implementation model added with missing type hints

* Missing Type hints - correction

For TF model

* Code fixup using make quality tests

* Hint types - typo error

* make fix-copies and make fixup

* type hints

* updated files

* type hints update

* making dependent modesls coherent
Co-authored-by: matt <rocketknight1@gmail.com>

5fe06b9b

31 Mar, 2022 1 commit

added type hints to xglm pytorch (#16500) · b808d8a5

Mowaninuola Osifeso authored Mar 31, 2022



* added type hints to xglm pytorch

* Update src/transformers/models/xglm/modeling_xglm.py

* Update src/transformers/models/xglm/modeling_xglm.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

b808d8a5

25 Mar, 2022 1 commit
- Big file_utils cleanup (#16396) · 088c1880
  Sylvain Gugger authored Mar 25, 2022
```
* Big file_utils cleanup

* This one still needs to be treated separately
```
  088c1880
23 Mar, 2022 1 commit

Reorganize file utils (#16264) · 4975002d

Sylvain Gugger authored Mar 23, 2022

* Split file_utils in several submodules

* Fixes

* Add back more objects

* More fixes

* Who exactly decided to import that from there?

* Second suggestion to code with code review

* Revert wront move

* Fix imports

* Adapt all imports

* Adapt all imports everywhere

* Revert this import, will fix in a separate commit

4975002d

22 Mar, 2022 1 commit
- add xglm conversion script (#16305) · 7865f4d0
  Suraj Patil authored Mar 22, 2022
```
* add xglm conversion script

* style

* update script
```
  7865f4d0
21 Mar, 2022 2 commits
- fix last element in hidden_states for XGLM (#16301) · 4b277483
  Yih-Dar authored Mar 21, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  4b277483
- Fix XGLM cross attention (#16290) · 641e5f3f
  Suraj Patil authored Mar 21, 2022
  
  641e5f3f
16 Mar, 2022 1 commit
- Replace all deprecated `jax.ops` operations with jnp's `at` (#16078) · ee27b3d7
  Sanchit Gandhi authored Mar 16, 2022
```
* Replace all deprecated `jax.ops` operations with jnp's `at`

* np to jnp scores

* suggested changes
```
  ee27b3d7
23 Feb, 2022 1 commit
- [M2M100, XGLM] fix create_position_ids_from_inputs_embeds (#15751) · 24588c67
  Suraj Patil authored Feb 23, 2022
  
  24588c67
09 Feb, 2022 1 commit
- Upgrade black to version ~=22.0 (#15565) · 7732d0fe
  Lysandre Debut authored Feb 09, 2022
```
* Upgrade black to version ~=22.0

* Check copies

* Fix code
```
  7732d0fe
01 Feb, 2022 1 commit
- [M2M100, XGLM] fix positional emb resize (#15444) · 1c9648c4
  Suraj Patil authored Feb 01, 2022
  
  1c9648c4
31 Jan, 2022 2 commits
- correct positionla emb size (#15441) · a5ecbf73
  Suraj Patil authored Jan 31, 2022
  
  a5ecbf73
- import torch.utils.checkpoint (#15427) · 38dfb40a
  Suraj Patil authored Jan 31, 2022
  
  38dfb40a
30 Jan, 2022 1 commit
- [XGLMTokenizer] fix init and add in AutoTokenizer (#15406) · 0f69b924
  Suraj Patil authored Jan 30, 2022
  
  0f69b924
28 Jan, 2022 1 commit

Add XGLM models (#14876) · d25e25ee

Suraj Patil authored Jan 28, 2022



* add xglm

* update vocab size

* fix model name

* style and tokenizer

* typo

* no mask token

* fix pos embed compute

* fix args

* fix tokenizer

* fix positions

* fix tokenization

* style and dic fixes

* fix imports

* add fast tokenizer

* update names

* add pt tests

* fix tokenizer

* fix typo

* fix tokenizer import

* fix fast tokenizer

* fix tokenizer

* fix converter

* add tokenizer test

* update checkpoint names

* fix tokenizer tests

* fix slow tests

* add copied from comments

* rst -> mdx

* flax model

* update flax tests

* quality

* style

* doc

* update index and readme

* fix copies

* fix doc

* update toctrr

* fix indent

* minor fixes

* fix config doc

* don't save embed_pos weights

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* address Sylvains commnets, few doc fixes

* fix check_repo

* align order of arguments

* fix copies

* fix labels

* remove unnecessary mapping

* fix saving tokenizer
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d25e25ee