Commits · 5f092194009461efc14231db22b9a1650133ecac · chenpangpang / transformers

18 Apr, 2023 8 commits

Fix from_pretrained when model is instantiated on the meta device (#22837) · 5f092194
Sylvain Gugger authored Apr 18, 2023

5f092194

Use code on the Hub from another repo (#22814) · 5f9b825c

Sylvain Gugger authored Apr 18, 2023

* initial work

* Add other classes

* Refactor code

* Move warning and fix dynamic pipeline

* Issue warning when necessary

* Add test

* Do not skip auto tests

* Fix failing tests

* Refactor and address review comments

* Address review comments

5f9b825c

Update accelerate version + warning check fix (#22833) · aec10d16
Zachary Mueller authored Apr 18, 2023

aec10d16

Generate: Add assisted generation (#22211) · 78cda46f

Joao Gante authored Apr 18, 2023

* working mvp

* remove breakpoint

* fix commit

* standardize outputs

* tmp commit

* tests almost ready

* tmp commit

* skip a few models

* Add streaming; Docs and examples

* document limitations

* PR commits

* Amy PR comments

78cda46f

Fix `test_eos_token_id_int_and_list_top_k_top_sampling` (#22826) · 90247d3e
Yih-Dar authored Apr 18, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
90247d3e
Fix Past CI not running against the latest `main` (#22823) · 1ebc1dee
Yih-Dar authored Apr 18, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
1ebc1dee

🌐

[i18n-KO] Fix anchor links for docs `auto_tutorial`, `training` (#22796) · 42288269

Gabriel Yang authored Apr 18, 2023



docs: ko: fix anchor links for docs (auto_tutorial, training)
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Na Yeon Han <nayeon2.han@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

42288269

TTS fine-tuning for SpeechT5 (#21824) · ac2bc50a

Matthijs Hollemans authored Apr 18, 2023

* wrong argument name

* append eos_token_id

* all tokenizers need mask and ctc_blank tokens

* remove reduction factor from feature extractor

* add proper TTS loss

* did shifting the wrong way around

* mask out padded portions

* remove logits again (don't really need it)

* fix unit tests

* fixup

* pad also returns the decoder attention mask, since that's useful to have

* clean up feature extractor logic

* pad can handle TTS task too

* remove stop_labels from loss calculation

* simplify logic

* fixup

* do -100 masking properly

* small STFT optimization (calculate mel filterbanks only once)

* replace torchaudio fbanks with audio_utils

* remove torchaudio dependency

* simplify & speed up the STFT

* don't serialize window and mel filters

* output cross attentions when generating speech

* add guided attention loss

* fix failing test

* Update src/transformers/models/speecht5/feature_extraction_speecht...

ac2bc50a

17 Apr, 2023 14 commits
- Mark auto models as important (#22815) · dacd3456
  Sylvain Gugger authored Apr 17, 2023
```
* Mark auto models as important

* Annoying file with bad line endings
```
  dacd3456
- Introduce `PartialState` as the device handler in the `Trainer` (#22752) · 03462875
  Zachary Mueller authored Apr 17, 2023
```
* Use accelerate for device management

* Add accelerate to setup
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  03462875
- Revert "Use code on the Hub from another repo" (#22813) · 50caa206
  Sylvain Gugger authored Apr 17, 2023
```
Revert "Use code on the Hub from another repo (#22698)"

This reverts commit ea7b0a53.
```
  50caa206
- Simplify update metadata job (#22811) · e13d6ef7
  Sylvain Gugger authored Apr 17, 2023
```
* Simplify update metadata job

* Match more branch names

* Install all what is necessary

* Install all what is necessary

* Forgot the dev

* Install less stuff

* This syntax?
```
  e13d6ef7
- Remove accelerate from tf test reqs (#22777) · cd3e0211
  Zachary Mueller authored Apr 17, 2023
```
Remove accelerate from tf
```
  cd3e0211
- Fix squeeze into torch 1.x compatible form in llama model (#22808) · f8c43c94
  Kunhao ZHENG authored Apr 17, 2023
```
fix-squeeze-tuple
```
  f8c43c94
- Don't use `LayoutLMv2` and `LayoutLMv3` in some pipeline tests (#22774) · 5269718c
  Yih-Dar authored Apr 17, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  5269718c
- Use code on the Hub from another repo (#22698) · ea7b0a53
  Sylvain Gugger authored Apr 17, 2023
```
* initial work

* Add other classes

* Refactor code

* Move warning and fix dynamic pipeline

* Issue warning when necessary

* Add test
```
  ea7b0a53
- 🌐 [i18n-KO] Translated `tasks/translation.mdx` to Korean (#22805) · 4d2c52e8
  Wonhyeong Seo authored Apr 18, 2023
```
docs: ko: tasks/translation.mdx
```
  4d2c52e8
- Fix sneaky torch dependency in TF example (#22804) · 2237127a
  Matt authored Apr 17, 2023
  
  2237127a
- improve(llama): Faster apply_rotary_pos_emb (#22785) · 626c1b8a
  fpgaminer authored Apr 17, 2023
  
  626c1b8a
- [i18n-KO] fix: docs: ko: sagemaker anchors and `_toctree.yml` (#22549) · abbc96a2
  Jungnerd authored Apr 17, 2023
```
fix: docs: ko: sagemaker anchors and  `_toctree.yml`
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Na Yeon Han <nayeon2.han@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
```
  abbc96a2
- 🌐 [i18n-KO] Translated `custom_models.mdx` to Korean (#22534) · 18c89481
  Na Yeon Han authored Apr 17, 2023
```
docs: ko: translated `custom_models.mdx`
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
```
  18c89481
- Fix `test_word_time_stamp_integration` for `Wav2Vec2ProcessorWithLMTest` (#22800) · 76d24f1a
  Yih-Dar authored Apr 17, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  76d24f1a
15 Apr, 2023 1 commit
- Generate: add CJK support to TextStreamer (#22664) · 28f26c10
  bcol authored Apr 15, 2023
  
  28f26c10
14 Apr, 2023 12 commits

Move labels to the same device as logits for Whisper (#22779) · fb3aa06c
oscar-garzon authored Apr 14, 2023

fb3aa06c
Indexing fix - CLIP checkpoint conversion (#22776) · 20e54e49
amyeroberts authored Apr 14, 2023
```
* Indexing fix - CLIP checkpoint conversion

* Fix up
```
20e54e49
Seq2SeqTrainer: Evict decoder_input_ids only when it is created from labels (#22772) · 895ae3b5
Joao Gante authored Apr 14, 2023

895ae3b5
Fix word_ids hyperlink (#22765) · daf53241
Mayank Agarwal authored Apr 14, 2023
```
* Fix word_ids hyperlink

* Add suggested fix
```
daf53241
Tweak ESM tokenizer for Nucleotide Transformer (#22770) · 06e737fb
Matt authored Apr 14, 2023
```
* If EOS is None, don't add it to sequences

* If EOS is None, don't add it to sequences
```
06e737fb

[WIP]

🌐

[i18n-KO] Translated `tutorial/proprecssing.mdx` to Korean (#22578) · c8df3900

Sohyun Sim authored Apr 14, 2023



* add ko preprocessing

* translate preprocessing.mdx to korean

* translate preprocessing.mdx

* Update preprocessing.mdx

Fixed the line 273 as below:
또한, 특징 추출기에 `sampling_rate` 인자를 추가하여 발생할 수 있는 조용한 오류(silent errors)를 더 잘 디버깅하는 것을 권장합니다.

* translate Image part

* translated preprocess.mdx

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* fixed translation

---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

c8df3900

Fix failing torchscript tests for `CpmAnt` model (#22766) · 53c710d1
Yih-Dar authored Apr 14, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
53c710d1

Fix a mistake in Llama weight converter log output. (#22764) · d2ffc3fc

Alexander Ljungberg authored Apr 14, 2023

Fixed string format; better tokenizer message.

Before: `Saving a {tokenizer_class} to {tokenizer_path}`
After: `Saving a LlamaTokenizerFast to outdir.`

d2ffc3fc

Generate: pin number of beams in BART test (#22763) · 9af845af
Joao Gante authored Apr 14, 2023

9af845af
Pix2struct: doctest fix (#22761) · 66b15efb
Joao Gante authored Apr 14, 2023

66b15efb

[Examples] TPU-based training of a language model using TensorFlow (#21657) · 390e121f

Sayak Paul authored Apr 14, 2023



* add: tokenizer training script for TF TPU LM training.

* add: script for preparing the TFRecord shards.

* add: sequence of execution to readme.

* remove limit from the tfrecord shard name.

* Add initial train_model.py

* Add basic training arguments and model init

* Get up to the point of writing the data collator

* Pushing progress so far!

* Complete first draft of model training code

* feat: grouping of texts efficiently.
Co-authored-by: Matt <rocketknight1@gmail.com>

* Add proper masking collator and get training loop working

* fix: things.

* Read sample counts from filenames

* Read sample counts from filenames

* Draft README

* Improve TPU warning

* Use distribute instead of distribute.experimental

* Apply suggestions from code review
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Modularize loading and add MLM probability as arg

* minor refactoring to better use the cli args.

* readme fillup.

* include tpu and inference sections in the readme.

* table of contents.

* parallelize maps.

* polish readme.

* change script name to run_mlm.py

* address PR feedback (round I).

---------
Co-authored-by: Matt <rocketknight1@gmail.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

390e121f

🌐

[i18n-KO] Translated `sequence_classification.mdx` to Korean (#22655) · bfb3925f

Hyeonseo Yun authored Apr 14, 2023



* docs: ko: init: tasks/sequence_classification.mdx

* docs: ko: revised: change voca in tasks/sequence_classification.mdx

* docs: ko: revised: [RE] change voca in tasks/sequence_classification.mdx

* docs: ko: revised: spell check and sentence naturally in tasks/sequence_classification.mdx

* docs: ko: revised: spell check and consistent vocabulary in tasks/sequence_classification.mdx

* docs: ko: revised: Add full stop and change voca in tasks/sequence_classification.mdx

* docs: ko: revised: sync first section templates in tasks/sequence_classification.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* fix: revert use of full-stops to colons

* colons are used to emphasize the code block that follows

* @0525hhgus @wonhyeongseo docs: ko: revised: sync second section templates in tasks/sequence_classification.mdx
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>

* docs: ko: revised: change 'train', 'finetuning' in tasks/sequence_classification.mdx

---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

bfb3925f

13 Apr, 2023 5 commits
- Fix `serving_output` for TF composite models (encoder-decoder like models) (#22743) · a6752a7d
  Yih-Dar authored Apr 13, 2023
```
* fix

* style

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  a6752a7d
- Revert (for now) the change on `Deta` in #22437 (#22750) · 410b61ad
  Yih-Dar authored Apr 13, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  410b61ad
- Generate: handle text conditioning with multimodal encoder-decoder models (#22748) · 9dfd6a4b
  Joao Gante authored Apr 13, 2023
  
  9dfd6a4b
- fix(llama): fix LlamaTokenzier (#22746) · 90ce374d
  Ruiyang Sun authored Apr 14, 2023
```
Bug in LlamaTokenizer when  #22742
```
  90ce374d
- [trainer] update url (#22747) · d85bf954
  Stas Bekman authored Apr 13, 2023
```
* [trainer] update url

* style
```
  d85bf954