Commits · 26ef0f1991341528ef2fef117b753196461ef812 · chenpangpang / transformers

14 Feb, 2023 13 commits
- fix: Race Condition when using Sagemaker Checkpointing and Model Repository (#21614) · 26ef0f19
  Douglas Trajano authored Feb 14, 2023
```
* Add _add_sm_patterns_to_gitignore

* Add _is_world_process_zero() call and move patterns arg to constant

* Update git status time.sleep

* Apply make style
```
  26ef0f19
- Fix typo in QA task guide (#21608) · 7bce8042
  Steven Liu authored Feb 14, 2023
```
fix typo
```
  7bce8042
- Error (also in original) model, scaling only q matrix not qk.T dot product... · bad83008
  Benoit authored Feb 14, 2023
```
Error (also in original) model, scaling only q matrix not qk.T dot product (qk.T/sqrt(dim_per_head)) (#21627)

* Error in model, scaling only q matrix not qK.T dot product (qk.T/sqrt(dim_per_head))

As per Vaswani et al, 2017 p.4
Is torch.matmul(q, k.transpose(2, 3)) / math.sqrt(dim_per_head) not q / math.sqrt(dim_per_head)
https://arxiv.org/pdf/1912.05372.pdf

Error was in original FlauBERT repo and effectively scales queries but not values
cf. https://github.com/getalp/Flaubert/pull/45/commits/6d176880ca3a1a8dfa2b76c97030bb51c5e917b8

* Update modeling_flaubert.py

Update to https://github.com/huggingface/transformers/pull/21627
make fixup
make repo_consistency

* Update modeling_xlm.py

* Update modeling_flaubert.py

* Update modeling_xlm.py
```
  bad83008
- Fix typo in documentation. (#21632) · aaf6795f
  Matthew McDermott authored Feb 14, 2023
  
  aaf6795f
- Removes duplicate computations in DETR post processing (#21592) · d3b1adf5
  Vitali Petsiuk authored Feb 14, 2023
```
* Remove redundant computations, comb variable names

* Fix scores to cur_scores
```
  d3b1adf5
- Fix generation config for empty state dict (#21630) · d4ba6e1a
  Sylvain Gugger authored Feb 14, 2023
  
  d4ba6e1a
- Fix the real failing test · 31728292
  Sylvain Gugger authored Feb 14, 2023
  
  31728292
- Remove Niels from templates (#21564) · 22888d30
  Sylvain Gugger authored Feb 14, 2023
  
  22888d30
- Final cleanup of TOKENIZER_FOR_DOC (#21565) · 68b21b37
  Sylvain Gugger authored Feb 14, 2023
```
FInal cleanup of TOKENIZER_FOR_DOC
```
  68b21b37
- Skip failing test · c6f163c7
  Sylvain Gugger authored Feb 14, 2023
  
  c6f163c7
- Generate: input expansion for any model input (#21624) · a81fe4e1
  Joao Gante authored Feb 14, 2023
  
  a81fe4e1
- Generate: filter encoder inputs when its signature does not accept wildcards (#21603) · 13e03e61
  Joao Gante authored Feb 14, 2023
  
  13e03e61
- Enable `requires_grad` on input embedding to train on top of frozen layers (#21598) · 41fa672d
  Younes Belkada authored Feb 14, 2023
```
* v1

* make fixup

* add more methods
```
  41fa672d
13 Feb, 2023 21 commits

Add in big model inference to issue template (#21611) · 8c502662
Zachary Mueller authored Feb 13, 2023
```
* Add in big model inference to issue template

* Trigger

* Untrigger

* empty test commit
```
8c502662
Fix TF CTC tests (#21606) · 56b03c96
Joao Gante authored Feb 13, 2023

56b03c96

Fix env. variable type issue in testing (#21609) · cbecf121

Yih-Dar authored Feb 13, 2023



* fix env issue

* fix env issue

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

cbecf121

Clarify available pipelines in quicktour (#21607) · 5987e0ab
Steven Liu authored Feb 13, 2023
```
clarify available pipelines
```
5987e0ab

[deepspeed] performance docs (#21573) · 101b9a7e

Stas Bekman authored Feb 13, 2023



* [deepspeed] performance docs

* fix

* re-org

* update

* update

* a new NCCL Collectives section

* inference

* Update docs/source/en/main_classes/deepspeed.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* suggestion

* Update docs/source/en/main_classes/deepspeed.mdx

* suggestion

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

101b9a7e

Update setup.py (#21584) · 68eff403
Stas Bekman authored Feb 13, 2023
```
* Update setup.py

* suggestions
```
68eff403
[i18n-fr] Translate quicktour page to French (#21589) · a27074ab
Nolwenn Bernard authored Feb 13, 2023
```
* Translate quicktour to French

* Traduction missing task
```
a27074ab
Generate: correct default model input creation for decoder-only models (#21580) · fa4bdb0a
Joao Gante authored Feb 13, 2023

fa4bdb0a

Fix Blip-2 CI (#21595) · edc1e734

Yih-Dar authored Feb 13, 2023



* use fp16

* use fp16

* use fp16

* use fp16

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

edc1e734

Add missing arguemtn to run_clip.py (#21588) · fd5320bb
Warren Green authored Feb 13, 2023

fd5320bb
Correct Markdown bullets indentation (#21583) · 1210c72e
Yi Wang authored Feb 13, 2023

1210c72e

Bump ipython from 8.1.1 to 8.10.0 in /examples/research_projects/decision_transformer (#21577) · 92487f5d

dependabot[bot] authored Feb 13, 2023

Bump ipython in /examples/research_projects/decision_transformer

Bumps [ipython](https://github.com/ipython/ipython) from 8.1.1 to 8.10.0.
- [Release notes](https://github.com/ipython/ipython/releases)
- [Commits](https://github.com/ipython/ipython/compare/8.1.1...8.10.0

)

---
updated-dependencies:
- dependency-name: ipython
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

92487f5d

annotated TFvisionEncoderDecoder input type hints (#21432) · dee4d72e

Billy Lee authored Feb 13, 2023



* annotated TFvisionEncoderDecoder input type hints
Co-authored-by: JuheonChu <chuj@dickinson.edu>
Co-authored-by: AdiaWu <wua@dickinson.edu>

* fixed failing tests

* make fix-copies

* failed test fix

* style fix

* revert

---------
Co-authored-by: JuheonChu <chuj@dickinson.edu>
Co-authored-by: AdiaWu <wua@dickinson.edu>
Co-authored-by: Matt <rocketknight1@gmail.com>

dee4d72e

[`bnb`] Let's make the daily CI green 🍏 (#21597) · 1666c42f
Younes Belkada authored Feb 13, 2023
```
* fix bnb slow test

* make fixup
```
1666c42f
Generate: Fix flaky indexing error in `test_constrained_beam_search_generate_dict_output` (#21561) · 24273268
Joao Gante authored Feb 13, 2023

24273268
Add `inputs_embeds` support when generating with GPT-J (#21575) · 93ed89bf
Dzmitry Pletnikau authored Feb 13, 2023

93ed89bf

[MINOR] Fix link in timeseries transformer docs (#21602) · dcb5e011

Christopher Akiki authored Feb 13, 2023

[MINOR] Fix link

I'm not sure this will also fix the currently broken link in the docs (Specifically here: https://huggingface.co/docs/transformers/model_doc/time_series_transformer) whereby clicking on `kashif` attempts to link to the following non-existent URL: https://huggingface.co/docs/transformers/model_doc/%3Chttps://huggingface.co/kashif

dcb5e011

Remove trailing 'extractive' word from en documentation (#21594) · dd7429d6
Thomas Paviot authored Feb 13, 2023
```
remove trailing word
```
dd7429d6
CI: skip failing TF hubert test (#21601) · 4be75e97
Joao Gante authored Feb 13, 2023
```
skip test
```
4be75e97

Add: document question answering task guide (#21518) · 3baa407f

Maria Khalusova authored Feb 13, 2023



* document question answering guide

* Added the list of supported models

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* switched to AutoProcessor

* feedback addressed

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update docs/source/en/tasks/document_question_answering.mdx
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* more feedback addressed

* addressed comments about evaluation loss

* added appropriate image link

* make style

* typo fix

* resolving toc conflict

* fixed the image link

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

3baa407f

Generate: TF supports multiple eos tokens (#21571) · eb6c59bc
Joao Gante authored Feb 13, 2023

eb6c59bc

12 Feb, 2023 1 commit
- Fix quality on main (ruff release) · c836f772
  Sylvain Gugger authored Feb 11, 2023
  
  c836f772
10 Feb, 2023 5 commits

[`Blip2`] Add int8 support for `blip2-flan-t5-xxl` (#21574) · 75a208ef
Younes Belkada authored Feb 10, 2023
```
add int8 support
```
75a208ef

Remove more unused attributes in config classes (#21543) · b47a1674

Yih-Dar authored Feb 10, 2023



* Remove unused decoder_layerdrop

* Update SPECIAL_CASES_TO_ALLOW for MT5Config

* Remove unused position_embedding_init_scale

* Remove unused decoder_max_relative_position

* Use unused decoder_max_relative_position

* Remove unused init_std

* Remove unused forgotten attributes

* Remove unused patch_norm

* Remove unused max_seq_len

* Update SPECIAL_CASES_TO_ALLOW for OneFormerConfig

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b47a1674

Added timesformer configuration (#21446) · 862e8e4f

Han Wu authored Feb 10, 2023



* Added timesformer configuration
Co-authored-by: JuheonChu <chuj@dickinson.edu>

* Create documentation_tests.txt

* Update documentation_tests.txt
Co-authored-by: JuheonChu <chuj@dickinson.edu>

* Delete documentation_tests.txt

Updates, Deleting "src/transformers/utils/documentation_tests.txt" file.
Co-authored-by: JuheonChu <chuj@dickinson.edu>

* Create documentation_tests.txt
Co-authored-by: JuheonChu <chuj@dickinson.edu>

* Delete documentation_tests.txt
Co-authored-by: JuheonChu <chuj@dickinson.edu>

---------
Co-authored-by: JuheonChu <chuj@dickinson.edu>

862e8e4f

Replace input_values_processing with unpack_inputs (#21502) · cb565901
amyeroberts authored Feb 10, 2023
```
* Replace input_values_prrocessing with unpack_inputs

* Skip test failing with OOM

* Update tests
```
cb565901
improving contributing tests section (#21569) · 55712563
Shubhamai authored Feb 10, 2023
```
* improving tests section

* documenting other  env variables
```
55712563