Commits · 2237127a6cdaa3459677878a51fe7ed363e6556f · chenpangpang / transformers

17 Apr, 2023 5 commits

Fix sneaky torch dependency in TF example (#22804) · 2237127a
Matt authored Apr 17, 2023

2237127a
improve(llama): Faster apply_rotary_pos_emb (#22785) · 626c1b8a
fpgaminer authored Apr 17, 2023

626c1b8a

[i18n-KO] fix: docs: ko: sagemaker anchors and `_toctree.yml` (#22549) · abbc96a2

Jungnerd authored Apr 17, 2023



fix: docs: ko: sagemaker anchors and  `_toctree.yml`
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Na Yeon Han <nayeon2.han@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

abbc96a2

🌐

[i18n-KO] Translated `custom_models.mdx` to Korean (#22534) · 18c89481

Na Yeon Han authored Apr 17, 2023



docs: ko: translated `custom_models.mdx`
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

18c89481

Fix `test_word_time_stamp_integration` for `Wav2Vec2ProcessorWithLMTest` (#22800) · 76d24f1a
Yih-Dar authored Apr 17, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
76d24f1a

15 Apr, 2023 1 commit
- Generate: add CJK support to TextStreamer (#22664) · 28f26c10
  bcol authored Apr 15, 2023
  
  28f26c10
14 Apr, 2023 12 commits

Move labels to the same device as logits for Whisper (#22779) · fb3aa06c
oscar-garzon authored Apr 14, 2023

fb3aa06c
Indexing fix - CLIP checkpoint conversion (#22776) · 20e54e49
amyeroberts authored Apr 14, 2023
```
* Indexing fix - CLIP checkpoint conversion

* Fix up
```
20e54e49
Seq2SeqTrainer: Evict decoder_input_ids only when it is created from labels (#22772) · 895ae3b5
Joao Gante authored Apr 14, 2023

895ae3b5
Fix word_ids hyperlink (#22765) · daf53241
Mayank Agarwal authored Apr 14, 2023
```
* Fix word_ids hyperlink

* Add suggested fix
```
daf53241
Tweak ESM tokenizer for Nucleotide Transformer (#22770) · 06e737fb
Matt authored Apr 14, 2023
```
* If EOS is None, don't add it to sequences

* If EOS is None, don't add it to sequences
```
06e737fb

[WIP]

🌐

[i18n-KO] Translated `tutorial/proprecssing.mdx` to Korean (#22578) · c8df3900

Sohyun Sim authored Apr 14, 2023



* add ko preprocessing

* translate preprocessing.mdx to korean

* translate preprocessing.mdx

* Update preprocessing.mdx

Fixed the line 273 as below:
또한, 특징 추출기에 `sampling_rate` 인자를 추가하여 발생할 수 있는 조용한 오류(silent errors)를 더 잘 디버깅하는 것을 권장합니다.

* translate Image part

* translated preprocess.mdx

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* fixed translation

---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

c8df3900

Fix failing torchscript tests for `CpmAnt` model (#22766) · 53c710d1
Yih-Dar authored Apr 14, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
53c710d1

Fix a mistake in Llama weight converter log output. (#22764) · d2ffc3fc

Alexander Ljungberg authored Apr 14, 2023

Fixed string format; better tokenizer message.

Before: `Saving a {tokenizer_class} to {tokenizer_path}`
After: `Saving a LlamaTokenizerFast to outdir.`

d2ffc3fc

Generate: pin number of beams in BART test (#22763) · 9af845af
Joao Gante authored Apr 14, 2023

9af845af
Pix2struct: doctest fix (#22761) · 66b15efb
Joao Gante authored Apr 14, 2023

66b15efb

[Examples] TPU-based training of a language model using TensorFlow (#21657) · 390e121f

Sayak Paul authored Apr 14, 2023



* add: tokenizer training script for TF TPU LM training.

* add: script for preparing the TFRecord shards.

* add: sequence of execution to readme.

* remove limit from the tfrecord shard name.

* Add initial train_model.py

* Add basic training arguments and model init

* Get up to the point of writing the data collator

* Pushing progress so far!

* Complete first draft of model training code

* feat: grouping of texts efficiently.
Co-authored-by: Matt <rocketknight1@gmail.com>

* Add proper masking collator and get training loop working

* fix: things.

* Read sample counts from filenames

* Read sample counts from filenames

* Draft README

* Improve TPU warning

* Use distribute instead of distribute.experimental

* Apply suggestions from code review
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Modularize loading and add MLM probability as arg

* minor refactoring to better use the cli args.

* readme fillup.

* include tpu and inference sections in the readme.

* table of contents.

* parallelize maps.

* polish readme.

* change script name to run_mlm.py

* address PR feedback (round I).

---------
Co-authored-by: Matt <rocketknight1@gmail.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

390e121f

🌐

[i18n-KO] Translated `sequence_classification.mdx` to Korean (#22655) · bfb3925f

Hyeonseo Yun authored Apr 14, 2023



* docs: ko: init: tasks/sequence_classification.mdx

* docs: ko: revised: change voca in tasks/sequence_classification.mdx

* docs: ko: revised: [RE] change voca in tasks/sequence_classification.mdx

* docs: ko: revised: spell check and sentence naturally in tasks/sequence_classification.mdx

* docs: ko: revised: spell check and consistent vocabulary in tasks/sequence_classification.mdx

* docs: ko: revised: Add full stop and change voca in tasks/sequence_classification.mdx

* docs: ko: revised: sync first section templates in tasks/sequence_classification.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* fix: revert use of full-stops to colons

* colons are used to emphasize the code block that follows

* @0525hhgus @wonhyeongseo docs: ko: revised: sync second section templates in tasks/sequence_classification.mdx
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>

* docs: ko: revised: change 'train', 'finetuning' in tasks/sequence_classification.mdx

---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

bfb3925f

13 Apr, 2023 15 commits
- Fix `serving_output` for TF composite models (encoder-decoder like models) (#22743) · a6752a7d
  Yih-Dar authored Apr 13, 2023
```
* fix

* style

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  a6752a7d
- Revert (for now) the change on `Deta` in #22437 (#22750) · 410b61ad
  Yih-Dar authored Apr 13, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  410b61ad
- Generate: handle text conditioning with multimodal encoder-decoder models (#22748) · 9dfd6a4b
  Joao Gante authored Apr 13, 2023
  
  9dfd6a4b
- fix(llama): fix LlamaTokenzier (#22746) · 90ce374d
  Ruiyang Sun authored Apr 14, 2023
```
Bug in LlamaTokenizer when  #22742
```
  90ce374d
- [trainer] update url (#22747) · d85bf954
  Stas Bekman authored Apr 13, 2023
```
* [trainer] update url

* style
```
  d85bf954
- Remove `DS_BUILD_AIO=1` (#22741) · 656d41ab
  Yih-Dar authored Apr 13, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  656d41ab
- `DocumentQuestionAnsweringPipeline` only for fast ⚡ tokenizers (#22745) · 32b08742
  Yih-Dar authored Apr 13, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  32b08742
- 🌐 [i18n-KO] Translated `training.mdx` to Korean (#22670) · 4def2fe9
  Gabriel Yang authored Apr 14, 2023
```
translate training doc to Korean
```
  4def2fe9
- Change `torch_dtype` to `str` when `saved_model=True` in `save_pretrained` for TF models (#22740) · 7df13432
  Yih-Dar authored Apr 13, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  7df13432
- [Pix2struct] Simplify generation (#22527) · 8eb38f63
  NielsRogge authored Apr 13, 2023
```
* Add model to doc tests

* Remove generate and replace by prepare_inputs_for_generation

* More fixes

* Remove print statements

* Update integration tests

* Fix generate

* Remove model from auto mapping

* Use auto processor

* Fix integration tests

* Fix test

* Add inference code snippet

* Remove is_encoder_decoder

* Update docs

* Remove notebook link
```
  8eb38f63
- Make vilt, switch_transformers compatible with model parallelism (#22703) · 95e70575
  Rinat authored Apr 13, 2023
```
* Update modeling_vilt.py

Vilt compatible with model parallelism

* Update modeling_switch_transformers.py

switch_transformers compatible with model parallelism
```
  95e70575
- Indexing fix for gpt_bigcode (#22737) · 89087597
  Joel Lamy-Poirier authored Apr 13, 2023
```
Fix indexing
```
  89087597
- [Doctest] Add configuration_mvp.py (#22735) · 7ade6ef7
  Elabonga Atuo authored Apr 13, 2023
```
* added configuration file for mvp model

* added configuration_mvp.py line to file
```
  7ade6ef7
- [Doctest] Add configuration_m2m_100.py (#22733) · 51007976
  Elabonga Atuo authored Apr 13, 2023
```
m2m-100-config for doctest
```
  51007976
- v4.29.0.dev0 · 888c4a2a
  Sylvain Gugger authored Apr 12, 2023
  
  888c4a2a
12 Apr, 2023 7 commits

Fix docstrings for TF BLIP (#22618) · 50f82e12

Matt authored Apr 12, 2023

* Fix docstrings for TFBLIP

* Fix missing line in TF port!

* Use values from torch tests now other bugs fixed

* Use values from torch tests now other bugs fixed

* Fix doctest string

50f82e12

Update warning levels (#22727) · ce06e478

NielsRogge authored Apr 12, 2023

* Use different level

* Remove futurewarning

* Use warning_once

* Update copies

ce06e478

add fast support and option (#22724) · 98581954

Arthur authored Apr 12, 2023



* add fast support and option

* update based on review

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/llama/convert_llama_weights_to_hf.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* nit

* add print

* fixup

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

98581954

`torch.distributed` group initialization for `torch_neuron` disabled when... · 10fab90f

Michael Benayoun authored Apr 12, 2023

`torch.distributed` group initialization for `torch_neuron` disabled when `optimum-neuron` is installed (#22728)

* Make the process group initialization not happen if optimum_neuron is installed

* Add warning

* Remove list and added warning

10fab90f

[tests] switch to torchrun (#22712) · 1306b7d3
Stas Bekman authored Apr 12, 2023

1306b7d3

Modify pipeline_tutorial.mdx (#22726) · d87ef00c

ARKA1112 authored Apr 12, 2023

generator(model="openai/whisper-large") always returns error. As the error says the generator expects an input, just like the .flac file above. Even the generator object has no parameters called model. While there are parameters which can be passed to generator like 'batch_size' but to pass a model i believe the the parameter has to be passed while instantiating the pipeline and not as a parameter to the instance.

I believe the correct term should be:

generator = pipeline(model="openai/whisper-large", device=0)

d87ef00c

[`bnb`] Let's make serialization of int8 models possible (#22177) · 370f0ca1

Younes Belkada authored Apr 12, 2023



* make serialization of int8 models possible

* make fixup

* add docs

* add ability to push to hub and save pretrained

* fixes

* more addition

* more tests

* fix issues

* change variable

* clearer message

* adapt from suggestions

* few fixes

* remove unused function

* Update src/transformers/utils/quantization_config.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* address last comments

* last warning

* clarify doc

* protect import

* Update src/transformers/modeling_utils.py

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

370f0ca1