Commits · 807483edba45ee9707e55a36c572f8e2c3cd347e · chenpangpang / transformers

10 Jun, 2024 1 commit
- docs: fix style (#31340) · 807483ed
  谭九鼎 authored Jun 10, 2024
  
  807483ed
07 Jun, 2024 1 commit

Remove ConversationalPipeline and Conversation object (#31165) · 065729a6

Matt authored Jun 07, 2024

* Remove ConversationalPipeline and Conversation object, as they have been deprecated for some time and are due for removal

* Update not-doctested.txt

* Fix JA and ZH docs

* Fix JA and ZH docs some more

* Fix JA and ZH docs some more

065729a6

06 Jun, 2024 3 commits

Enable HF pretrained backbones (#31145) · bdf36dcd

amyeroberts authored Jun 06, 2024

* Enable load HF or tim backbone checkpoints

* Fix up

* Fix test - pass in proper out_indices

* Update docs

* Fix tvp tests

* Fix doc examples

* Fix doc examples

* Try to resolve DPT backbone param init

* Don't conditionally set to None

* Add condition based on whether backbone is defined

* Address review comments

bdf36dcd

Update text-to-speech.md (#31269) · a3d351c0
Jack Yang authored Jun 07, 2024
```
SpeechBrain usage has changed
```
a3d351c0
Switch from `cached_download` to `hf_hub_download` in remaining occurrences (#31284) · 9ef93fcc
Lucain authored Jun 06, 2024
```
Switch from hf_hub_url to hf_hub_download in remaining occurences
```
9ef93fcc

05 Jun, 2024 1 commit

doc: add info about wav2vec2 bert in older wav2vec2 models. (#31120) · 4a602492

Vaibhav Srivastav authored Jun 05, 2024



* doc: add info about wav2vec2 bert in older wav2vec2 models.

* apply suggestions from review.

* forward contrib credits from review

---------
Co-authored-by: Sanchit Gandhi <sanchit-gandhi@users.noreply.github.com>

4a602492

04 Jun, 2024 1 commit
- Blip: Deprecate `BlipModel` (#31235) · 485d913d
  Younes Belkada authored Jun 04, 2024
```
* deprecate blip

* mention deprecation on docs
```
  485d913d
03 Jun, 2024 3 commits

[docs] Spanish translation of tokenizer_summary.md (#31154) · c73ee133

Aaron Jimenez authored Jun 03, 2024

* add tokenizer_summary to es/_toctree.yml

* add tokenizer_summary to es/

* fix link to Transformes XL in en/

* translate until Subword tokenization section

* fix GPT link in en/

* fix other GPT link in en/

* fix typo in en/

* translate the doc

* run make fixup

* Remove .md in Transformer XL link

* fix some link issues in es/

* fix typo

c73ee133

Wrong translation FR : Contents = Contenu (#31186) · 98dd8423
Jade Choghari authored Jun 03, 2024
```
Update index.md - Contents = Contenu

French typo -
Contents = Contenu
```
98dd8423

Add Qwen2 GGUF loading support (#31175) · e4628434

Isotr0py authored Jun 03, 2024

* add qwen2 gguf support

* Update docs

* fix qwen2 tokenizer

* add qwen2 gguf test

* fix typo in qwen2 gguf test

* format code

* Remove mistral, clarify the error message

* format code

* add typing and update docstring

e4628434

31 May, 2024 3 commits

Instance segmentation examples (#31084) · cdc81311

Pavel Iakubovskii authored May 31, 2024



* Initial setup

* Metrics

* Overfit on two batches

* Train 40 epochs

* Memory leak debugging

* Trainer fine-tuning

* Draft

* Fixup

* Trained end-to-end

* Add requirements

* Rewrite evaluator

* nits

* Add readme

* Add instance-segmentation to the table

* Support void masks

* Remove sh

* Update docs

* Add pytorch test

* Add accelerate test

* Update examples/pytorch/instance-segmentation/README.md

* Update examples/pytorch/instance-segmentation/run_instance_segmentation.py

* Update examples/pytorch/instance-segmentation/run_instance_segmentation_no_trainer.py

* Update examples/pytorch/instance-segmentation/run_instance_segmentation_no_trainer.py

* Update examples/pytorch/instance-segmentation/run_instance_segmentation.py

* Fix consistency oneformer

* Fix imports

* Fix imports sort

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update examples/pytorch/instance-segmentation/run_instance_segmentation.py
Co-authored-by: Sangbum Daniel Choi <34004152+SangbumChoi@users.noreply.github.com>

* Add resources to docs

* Update examples/pytorch/instance-segmentation/README.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/pytorch/instance-segmentation/README.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Remove explicit model_type argument

* Fix tests

* Update readme

* Note about other models

---------
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sangbum Daniel Choi <34004152+SangbumChoi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

cdc81311

Add streaming, various fixes (#30838) · 9837a254

Aymeric Roucher authored May 31, 2024

* Implement streaming run in ReAct agents
* Allow additional imports in code agents
* Python interpreter: support classes and exceptions, fixes

9837a254

Update sam.md (#31130) · bd9d1ddf

Asif Ajrof authored May 31, 2024

`mask` variable is not defined. probably a writing mistake. it should be `segmentation_map`. `segmentation_map` should be a `1` channel image rather than `RGB`.
[on a different note, the `mask_url` is the same as `raw_image`. could provide a better example.

bd9d1ddf

30 May, 2024 1 commit
- Docs / Quantization: Replace all occurences of `load_in_8bit` with bnb config (#31136) · f5590dea
  Younes Belkada authored May 30, 2024
```
Replace all occurences of `load_in_8bit` with bnb config
```
  f5590dea
29 May, 2024 2 commits

FIX / Docs: Fix GPTQ expected number of bits (#31111) · cb879c58
Younes Belkada authored May 29, 2024
```
Update overview.md
```
cb879c58

Use `HF_HUB_OFFLINE` + fix has_file in offline mode (#31016) · c3044ec2

Lucain authored May 29, 2024

* Fix has_file in offline mode

* harmonize env variable for offline mode

* Switch to HF_HUB_OFFLINE

* fix test

* revert test_offline to test TRANSFORMERS_OFFLINE

* Add new offline test

* merge conflicts

* docs

c3044ec2

28 May, 2024 5 commits

Deprecate low use models (#30781) · a564d10a

amyeroberts authored May 28, 2024

* Deprecate models
- graphormer
- time_series_transformer
- xlm_prophetnet
- qdqbert
- nat
- ernie_m
- tvlt
- nezha
- mega
- jukebox
- vit_hybrid
- x_clip
- deta
- speech_to_text_2
- efficientformer
- realm
- gptsan_japanese

* Fix up

* Fix speech2text2 imports

* Make sure message isn't indented

* Fix docstrings

* Correctly map for deprecated models from model_type

* Uncomment out

* Add back time series transformer and x-clip

* Import fix and fix-up

* Fix up with updated ruff

a564d10a

Docs / Quantization: Redirect deleted page (#31063) · 7f08817b
Younes Belkada authored May 28, 2024
```
Update _redirects.yml
```
7f08817b

Docs / PEFT: Add PEFT API documentation (#31078) · 4f98b144

Younes Belkada authored May 28, 2024

* add peft references

* add peft references

* Update docs/source/en/peft.md

* Update docs/source/en/peft.md

4f98b144

[SuperPoint, PaliGemma] Update docs (#31025) · 90da0b1c
NielsRogge authored May 28, 2024
```
* Update docs

* Add PaliGemma resources

* Address comment

* Update docs
```
90da0b1c

Update quicktour.md to fix broken link to Glossary (#31072) · dd4654ea

AP authored May 28, 2024

Update quicktour.md to fix broken link

Missing '/' in attention mask link in the transformers quicktour

dd4654ea

27 May, 2024 2 commits
- Follow up: Fix link in dbrx.md (#30514) · 0a064dc0
  Eitan Turok authored May 27, 2024
```
* Fix link in dbrx.md

* remove "though this may not be up to date"

---------
Co-authored-by: Lysandre Debut <hi@lysand.re>
```
  0a064dc0
- Redirect transformers_agents doc to agents (#31054) · 84c4b72e
  Aymeric Roucher authored May 27, 2024
  
  84c4b72e
23 May, 2024 4 commits

[Port] TensorFlow implementation of Mistral (#29708) · 965e98dc

Aritra Roy Gosthipaty authored May 23, 2024



* chore: initial commit

* chore: adding imports and inits

* chore: adding the causal and classification code

* chore: adding names to the layers

* chore: using single self attn layer

* chore: built the model and layers

* chore: start with testing

* chore: docstring change, transpose fix

* fix: rotary embedding

* chore: adding cache implementation

* remove unused torch

* chore: fixing the indexing issue

* make fix-copies

* Use modeling_tf_utils.keras

* make fixup

* chore: fixing tests

* chore: adding past key value logic

* chore: adding multi label classfication test

* fix: switching on the built parameters in the layers

* fixing repo consistency

* ruff formats

* style changes

* fix: tf and pt equivalence

* removing returns from docstrings

* fix docstrings

* fix docstrings

* removing todos

* fix copies

* fix docstring

* fix docstring

* chore: using easier rotate_half

* adding integration tests

* chore: addressing review related to rotary embedding layer

* review changes

* [run-slow] mistral

* skip: test save load after resize token embedding

* style

---------
Co-authored-by: Matt <rocketknight1@gmail.com>

965e98dc

FIX / Docs: Minor changes in quantization docs (#30985) · 5a74ae6d

Younes Belkada authored May 23, 2024



* Change in quantization docs

* Update overview.md

* Update docs/source/en/quantization/overview.md
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

---------
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

5a74ae6d

Docs / Quantization: refactor quantization documentation (#30942) · 87a35181

Younes Belkada authored May 23, 2024



* refactor quant docs

* delete file

* rename to overview

* fix

* fix table

* fix

* add content

* fix library versions

* fix table

* fix table

* fix table

* fix table

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* replace to quantization_config

* fix aqlm snippet

* add DLAI courses

* fix

* fix table

* fix bulet points

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

87a35181

Quantized KV Cache (#30483) · d583f131

Raushan Turganbay authored May 23, 2024



* clean-up

* Update src/transformers/cache_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/cache_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/cache_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

* Update tests/quantization/quanto_integration/test_quanto.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/generation/configuration_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* more suggestions

* mapping if torch available

* run tests & add 'support_quantized' flag

* fix jamba test

* revert, will be fixed by another PR

* codestyle

* HQQ and versatile cache classes

* final update

* typo

* make tests happy

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

d583f131

22 May, 2024 3 commits

Update object detection with latest resize and pad strategies (#30955) · 15585b81

Pavel Iakubovskii authored May 22, 2024

* Update with new resizing and pad strategy

* Return pixel mask param

* Update inference in guide

* Fix empty compose

* Update guide

15585b81

[doc] Add references to the fine-tuning blog and distil-whisper to Whisper. (#30938) · 24d2a5e1
Vaibhav Srivastav authored May 22, 2024
```
[doc] Add references to the fine-tuning blog and distil-whisper to Whisper doc.
```
24d2a5e1

Update video-llava docs (#30935) · 934e1b84

Raushan Turganbay authored May 22, 2024



* update video-llava

* Update docs/source/en/model_doc/video_llava.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

934e1b84

21 May, 2024 2 commits

🚨 [Idefics2] Update ignore index (#30898) · 60bb571e
NielsRogge authored May 21, 2024
```
* Update ignore index

* Update docs

* Update docs
```
60bb571e

FEAT / Trainer: LOMO optimizer support (#30178) · 8871b261

Younes Belkada authored May 21, 2024



* add V1 - adalomo not working yet

* add todo docs + refactor from comments

* adjust LR

* add docs

* add more elaborated test

* Apply suggestions from code review
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* fix

* push

* add accelerate check

* fix DDP case

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix

* init kwargs

* safely add attribute

* revert to enum logic

* Update src/transformers/trainer.py

---------
Co-authored-by: Zach Mueller <muellerzr@gmail.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

8871b261

20 May, 2024 4 commits

[docs] Spanish translation of model_memory_anatomy.md (#30885) · 0df888ff

Aaron Jimenez authored May 20, 2024

* add model_memory_anatomy to es/_toctree.yml

* copy model_memory_anatomy.md to es/

* translate first section

* translate doc

* chage forward activations

* fix sentence and and link to Trainer

* fix Trainer link

0df888ff

Add torch.compile for Mistral (#30642) · 616bb11d

Longjie Zheng authored May 20, 2024

* first version

* fix sliding window

* fix style

* add sliding window cache

* fix style

* address comments

* fix test

* fix style

* move sliding window check inside cache init

* revert changes on irrelevant files & add comment on SlidingWindowCache

* address comments & fix style

fix style

* update causal mask

* [run-slow] mistral

* [run-slow] mistral

* [run-slow] mistral

* [run-slow] mistral

* [run-slow] mistral

* [run-slow] llama

* [run-slow] mistral

* [run-slow] mistral

* [run-slow] mistral

* revert CI from a10 to t4

* wrap up

616bb11d

LLaVa-Next: Update docs with batched inference (#30857) · 5d0bf59b

Raushan Turganbay authored May 20, 2024



* update docs with batch ex

* Update docs/source/en/model_doc/llava_next.md
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* accept nested list of img

---------
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

5d0bf59b

Add TokenClassification for Mistral, Mixtral and Qwen2 (#29878) · 07bf2dff

Joseph Enguehard authored May 20, 2024



* Add MistralForTokenClassification

* Add tests and docs

* Add token classification for Mixtral and Qwen2

* Save llma for token classification draft

* Add token classification support for Llama, Gemma, Persimmon, StableLm and StarCoder2

* Formatting

* Add token classification support for Qwen2Moe model

* Add dropout layer to each ForTokenClassification model

* Add copied from in tests

* Update src/transformers/models/llama/modeling_llama.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Propagate suggested changes

* Style

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

07bf2dff

17 May, 2024 1 commit
- Fix dependencies for image classification example (#30842) · 977ce58a
  Jacky Lee authored May 17, 2024
```
* fix: missing dependencies

* fix: image classification dependencies
```
  977ce58a
16 May, 2024 3 commits

Docs: update example with assisted generation + sample (#30853) · f4014e75
Joao Gante authored May 16, 2024

f4014e75
Video-LLaVa: Fix docs (#30855) · 95b3c381
Raushan Turganbay authored May 16, 2024
```
fix model id in docs
```
95b3c381

[Idefics2] Improve docs, add resources (#30717) · 17cc71e1

NielsRogge authored May 16, 2024



* Add resources

* Address comment

* Address comments

* Update docs/source/en/model_doc/idefics2.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update figure

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

17cc71e1