Commits · 4169dc84bf0072a26f10096a187907d661dcc383 · chenpangpang / transformers

03 Apr, 2023 17 commits

[setup] migrate setup script to `pyproject.toml` (#22539) · 4169dc84

Xuehai Pan authored Apr 04, 2023

* [setup] migrate setup script to `pyproject.toml`

* [setup] cleanup configurations

* remove unused imports

4169dc84

Generate: Enable easier TextStreamer customization (#22516) · a17841ac
Vladimir Blagojevic authored Apr 03, 2023

a17841ac

[setup] drop deprecated `distutils` usage (#22531) · 80d1319e

Xuehai Pan authored Apr 04, 2023

* [setup] drop deprecated `distutils` usage

* drop deprecated `distutils.util.strtobool` usage

* fix import order

* reformat docstring by `doc-builder`

80d1319e

Fix missing metrics with multiple eval datasets (#22536) · 4c33a0c4
Ilya authored Apr 03, 2023

4c33a0c4
[`T5`] Enable naive Pipeline Parallelism training for T5 (#22535) · d7a4f5be
Younes Belkada authored Apr 03, 2023
```
* enable PP for T5

* make fixup

* fix failing tests
```
d7a4f5be

[`Trainer`] Force `is_model_parallel` when model is loaded in multiple GPUs... · cab048fb

Younes Belkada authored Apr 03, 2023

[`Trainer`] Force `is_model_parallel` when model is loaded in multiple GPUs using `accelerate` (#22532)

* add `is_model_parallel` arg on Trainer

* add warning

* adapt from suggestions

* revert t5 changes

* remove commas

* adapt from suggestions

cab048fb

[BLIP] fix cross attentions for BlipTextEncoder (#22515) · aecbcb36
zhbh01 authored Apr 03, 2023

aecbcb36

fix LayoutLMv3TokenizerFast subword label after 'Ġ' token (#21695) · 4e441e52

Thibault Douzon authored Apr 03, 2023

LayoutLMv3TokenizerFast produces empty 'Ġ' token with `offset_mapping = (0, 0)`.
Next token is wrongly assumed to also be beginning of word and isn't
correctly assigned `pad_token_label`.
Modify test with text that produce 'Ġ' token.
Remove copy check from LayoutLMv2TokenizerFast for `_batch_encode_plus`.

solves issue: #19978

4e441e52

llama docs: fix conversion script url (#22514) · a6001056
Kirill authored Apr 03, 2023

a6001056

Fix convert_opt_original_pytorch_checkpoint_to_pytorch.py typo (#22526) · 9419f144

larekrow authored Apr 03, 2023

`load_checkpoint()` silently fails because `".qkj_proj." in key` is always `False`, but will eventually cause an error at `model.load_state_dict(state_dict)`.

9419f144

Generate: `TextIteratorStreamer` (streamer for gradio) (#22501) · a55a822a
Joao Gante authored Apr 03, 2023
```
* haha text go brrr (but in gradio)
```
a55a822a

added biogpt token classifier (#22447) · 7d25c9c8

Mohammed Jabir authored Apr 03, 2023



* added biogpt token classifier

* fix reviews

* Updated modeling_biogpt.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

7d25c9c8

[WIP] docs: ko: sagemaker.mdx (#22509) · 1194c3e3
Jungnerd authored Apr 03, 2023
```
docs: ko: sagemaker.mdx
```
1194c3e3

Fix llama tokenizer (#22402) · c0f99b4d

Arthur authored Apr 03, 2023

* draft

* update tokenization limma and conversion script

* more udpates

* initial commit

* style

* default pad to None

* draft tokenization tests

* update test

* update tokenization tests

* nits

* update

* versioning test

* major fix

* fix more testst

* finish fixing special masks

* last nit

* more nits

* add encode decode tests

* add more

* fix token type ids

* style

c0f99b4d

[Time-Series] fix past_observed_mask type (#22076) · 9eae4aa5
Eli Simhayev authored Apr 03, 2023
```
added > 0.5 to `past_observed_mask`
```
9eae4aa5

Backbone add out indices (#22493) · 559a45d1

amyeroberts authored Apr 03, 2023

* Add out_indices to backbones, deprecate out_features

* Update - can specify both out_features and out_indices but not both

* Can specify both

* Fix copies

* Add out_indices to convnextv2 configuration

559a45d1

Update convert_llama_weights_to_hf.py (#22525) · db803b69
kevinpro authored Apr 03, 2023

db803b69

31 Mar, 2023 6 commits

Test fetch v2 (#22367) · c6126280

Sylvain Gugger authored Mar 31, 2023



* Test fetcher v2

* Fix regexes

* Remove sanity check

* Fake modification to OPT

* Fixes some .sep issues

* Remove fake OPT change

* Fake modif for BERT

* Fake modif for init

* Exclude SageMaker tests

* Fix test and remove fake modif

* Fake setup modif

* Fake pipeline modif

* Remove all fake modifs

* Adds options to skip/force tests

* [test-all-models] Fake modif for BERT

* Try this way

* Does the command actually work?

* [test-all-models] Try again!

* [skip circleci] Remove fake modif

* Remove debug statements

* Add the list of important models

* Quality

* Update utils/tests_fetcher.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Address review comments

* Address review comments

* Fix and add test

* Apply suggestions from code review
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

* Address review comments

---------
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

c6126280

Update Neptune callback docstring (#22497) · 3a9464bd

Sabine authored Mar 31, 2023



* update NeptuneCallback docstring

* formatting

* apply make style

---------
Co-authored-by: Aleksander Wojnarowicz <alwojnarowicz@gmail.com>

3a9464bd

Bump redis from 4.5.3 to 4.5.4 in /examples/research_projects/decision_transformer (#22494) · 6fc44656

dependabot[bot] authored Mar 31, 2023

Bump redis in /examples/research_projects/decision_transformer

Bumps [redis](https://github.com/redis/redis-py) from 4.5.3 to 4.5.4.
- [Release notes](https://github.com/redis/redis-py/releases)
- [Changelog](https://github.com/redis/redis-py/blob/master/CHANGES)
- [Commits](https://github.com/redis/redis-py/compare/v4.5.3...v4.5.4

)

---
updated-dependencies:
- dependency-name: redis
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

6fc44656

Making sure we can use safetensors to serialize all the time. (#22437) · d143087d

Nicolas Patry authored Mar 31, 2023



* Making sure we can use safetensors to serialize all the time.

* Expanding the tests for increased coverage.

* Update the test.

* Getting current state of affairs.

* Tentative fix.

* Fixing black version.

* Fixing the worst offenders.

* Try to modify less files.

* Fixing blip_2 (Weird solution right now).

* Fixing deta.

* Fix blip ?

* Missing extra newline.

* No deta modification.

* Adding some comments.

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Addressing comments.

* Addressing comments.

* creating warn_once.

* Warning_once !

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d143087d

Update `Wav2Vec2ProcessorWithLM` doc example (#22474) · 516077b3
Yih-Dar authored Mar 31, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
516077b3
Relax `eos_token_id < 0` checks in `generate()` from `ValueError` to warning (#22472) · da68fd69
lewtun authored Mar 31, 2023
```
* Relax  checks from  to warning

* Fix style

* Replace warnings with logger

* Use warning vs warn
```
da68fd69

30 Mar, 2023 10 commits
- (Re-)Enable Nightly + Past CI (#22393) · 0fe6c6bd
  Yih-Dar authored Mar 30, 2023
```
* Enable Nightly + Past CI

* put schedule

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  0fe6c6bd
- Docs fix: Multinomial sampling decoding needs "num_beams=1", since by default... · d5de578c
  Manuel de Prada authored Mar 30, 2023
```
Docs fix: Multinomial sampling decoding needs "num_beams=1", since by default it is usually not 1. (#22473)

Fix: Multinomial sampling needs "num_beams=1", since by default is 5.
```
  d5de578c
- Llama: support for `max_position_embeddings` (#22471) · 165dd6dc
  Joao Gante authored Mar 30, 2023
```
* Llama now supports max_position_embeddings

* Save config; Cosmetic edits
```
  165dd6dc
- [NLLB-MoE] `model_type` update for auto mapping (#22470) · 349e1242
  Arthur authored Mar 30, 2023
```
edit default model type and testing path set to hf-internal-testing
```
  349e1242
- Guard imports of PreTrainedTokenizerFast on is_tokenizers_available (#22285) · 11426641
  Roy Hvaara authored Mar 30, 2023
```
Guard imports that use the tokenizers library
```
  11426641
- 🚨🚨🚨 Fix ordering of height, width for BLIP image processor (#22466) · 4d7a5b5b
  amyeroberts authored Mar 30, 2023
```
Fix ordering of height,width for BLIP
```
  4d7a5b5b
- Generate: basic token streaming (#22449) · 228792a9
  Joao Gante authored Mar 30, 2023
```
* haha tokens go brrrr
```
  228792a9
- Skip flaky NLLB Moe test for now (#22463) · f0aeb1be
  amyeroberts authored Mar 30, 2023
```
Skip flaky test for now
```
  f0aeb1be
- Rescale image back if it was scaled during PIL conversion (#22458) · 154c6bb7
  amyeroberts authored Mar 30, 2023
```
* Rescale image back if it was scaled during PIL conversion

* do_rescale is defined if PIL image passed in
```
  154c6bb7
- Move common properties to BackboneMixin (#21855) · c15f9375
  amyeroberts authored Mar 30, 2023
```
* Move common properties to BackboneMixin

* Fix failing tests

* Update ConvNextV2 backbone
```
  c15f9375
29 Mar, 2023 7 commits
- Update: ignore padding support for TransfoXL training when n_clusters==0 (#22457) · cd73b9a8
  Stefan Heng authored Mar 29, 2023
```
* Update: ignore padding support for TransfoXL training when n_clusters==0

* Update: transformer XL always pad

* Update: drop doc
```
  cd73b9a8
- Pin ruff (#22455) · 2194943a
  Sylvain Gugger authored Mar 29, 2023
  
  2194943a
- Update release instructions (#22454) · 4c295a26
  Sylvain Gugger authored Mar 29, 2023
  
  4c295a26
- Avoid using personal HF token in CI (#22453) · 97440e9c
  Yih-Dar authored Mar 29, 2023
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  97440e9c
- Update Neptune docs (#22452) · 173193cc
  Sabine authored Mar 29, 2023
  
  173193cc
- Revert "Fix --bf16 option support for Neuron after PR #22300" (#22451) · 5e89a435
  jeffhataws authored Mar 29, 2023
```
This reverts commit fd81746dbec5f17c8285a0fdc72ca4b4c025cc33.
```
  5e89a435
- [`Pix2Struct`] Fix slow test (#22448) · b844f8a9
  Younes Belkada authored Mar 29, 2023
```
fix slow test
```
  b844f8a9