Commits · 1749841a0e9d803984985e08e4df177ac5a8b1a9 · chenpangpang / transformers

03 Jun, 2024 4 commits

[`GemmaModel`] fix small typo (#31202) · 1749841a
Arthur authored Jun 03, 2024
```
* fixes

* fix-copies
```
1749841a

Ahmed Moubtahij authored Jun 03, 2024



* token healing impl + trie with extensions

* make fixup

* prefix-robust space tokenization

* examples readme and requirements

* make fixup

* allow input prompt and model

* redundant defaults

* Specialized Trie

* make fixup

* updated tests with new inherited Tree

* input ids to auto device_map

* rm unused import

* Update src/transformers/generation/utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* naming convention

* Revert "naming convention"

This reverts commit dd39d9c5b7a969e2d8a8d2a8e54f121b82dc44f0.

* naming convention

* last -hopefully- changes

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

39b2ff69

Remove copied froms for deprecated models (#31153) · 5b5b48b1
amyeroberts authored Jun 03, 2024
```
* Remove copied froms for deprecated models

* Remove automatically in script
```
5b5b48b1

Fix typo: use_safetenstors to use_safetensors (#31184) · 97e5a707

CharlesCNorton authored Jun 03, 2024

Corrected a typo in security.md. Changed `use_safetenstors` to `use_safetensors` in the section discussing the usage of safe formats for loading models to prevent arbitrary code execution.

97e5a707

31 May, 2024 10 commits

Diff converter v2 (#30868) · 96eb0628

Arthur authored May 31, 2024

* current working example!

* commit regex and result file

* update

* nit

* push the conversion file

* oups

* roadmap and nits

* attempt diffs for 3 files

* persimmon

* nit

* add diff file that is the same as the modeling_llama.py

* fix rope nits

* updates

* updates with converted versions

* give some breathing space to the code

* delete

* update

* update

* push the actual result

* update regex patterns

* update regex patterns

* fix some issues

* fix some issues

* fix some issues

* updates

* updates

* updates

* updates

* updates

* revert changes done to llama

* updates

* update gemma

* updates

* oups

* current state

* current state

* update

* ouiiii

* nit

* clear diffs

* nit

* fixup

* update

* doc 🚀

* 🔥

* for now use gemma

* deal with comments

* style

* handle funtions

* deal with assigns

* todos

* process inheritage

*...

96eb0628

Added description of quantization_config (#31133) · 372baec2

Vallepu Vamsi Krishna authored May 31, 2024

* Description of quantization_config

Added missing description about quantization_config in replace_with_bnb_linear for better readability.

* Removed trailing spaces

372baec2

Instance segmentation examples (#31084) · cdc81311

Pavel Iakubovskii authored May 31, 2024



* Initial setup

* Metrics

* Overfit on two batches

* Train 40 epochs

* Memory leak debugging

* Trainer fine-tuning

* Draft

* Fixup

* Trained end-to-end

* Add requirements

* Rewrite evaluator

* nits

* Add readme

* Add instance-segmentation to the table

* Support void masks

* Remove sh

* Update docs

* Add pytorch test

* Add accelerate test

* Update examples/pytorch/instance-segmentation/README.md

* Update examples/pytorch/instance-segmentation/run_instance_segmentation.py

* Update examples/pytorch/instance-segmentation/run_instance_segmentation_no_trainer.py

* Update examples/pytorch/instance-segmentation/run_instance_segmentation_no_trainer.py

* Update examples/pytorch/instance-segmentation/run_instance_segmentation.py

* Fix consistency oneformer

* Fix imports

* Fix imports sort

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update examples/pytorch/instance-segmentation/run_instance_segmentation.py
Co-authored-by: Sangbum Daniel Choi <34004152+SangbumChoi@users.noreply.github.com>

* Add resources to docs

* Update examples/pytorch/instance-segmentation/README.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/pytorch/instance-segmentation/README.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Remove explicit model_type argument

* Fix tests

* Update readme

* Note about other models

---------
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sangbum Daniel Choi <34004152+SangbumChoi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

cdc81311

Add streaming, various fixes (#30838) · 9837a254

Aymeric Roucher authored May 31, 2024

* Implement streaming run in ReAct agents
* Allow additional imports in code agents
* Python interpreter: support classes and exceptions, fixes

9837a254

[trainer] add sanity evaluation option (#31146) · f8e6ba45

Marc Sun authored May 31, 2024



* add sanity evaluation

* fix

* Apply suggestions from code review
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* fix

---------
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

f8e6ba45

Quantization: Enhance bnb error message (#31160) · fc5d3e11
Younes Belkada authored May 31, 2024
```
enhance error message
```
fc5d3e11

Update sam.md (#31130) · bd9d1ddf

Asif Ajrof authored May 31, 2024

`mask` variable is not defined. probably a writing mistake. it should be `segmentation_map`. `segmentation_map` should be a `1` channel image rather than `RGB`.
[on a different note, the `mask_url` is the same as `raw_image`. could provide a better example.

bd9d1ddf

Fix quantized cache output (#31143) · 48cada87
Marc Sun authored May 31, 2024

48cada87
pytest -rsfE (#31140) · d19566e8
Yih-Dar authored May 31, 2024
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d19566e8

helper (#31152) · f3f640dc

Arthur authored May 31, 2024



* helper

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* updates

* more doc

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

f3f640dc

30 May, 2024 4 commits
- Workflow: Remove `IS_GITHUB_CI` (#31147) · 6bd511a4
  Younes Belkada authored May 30, 2024
```
remove `IS_GITHUB_CI`
```
  6bd511a4
- Docs / Quantization: Replace all occurences of `load_in_8bit` with bnb config (#31136) · f5590dea
  Younes Belkada authored May 30, 2024
```
Replace all occurences of `load_in_8bit` with bnb config
```
  f5590dea
- fix get_scheduler when name is warmup_stable_decay (#31128) · cda9c82a
  zspo authored May 30, 2024
```
fix get_scheduler args
```
  cda9c82a
- FIX / Quantization: Add extra validation for bnb config (#31135) · 5e5c4d62
  Younes Belkada authored May 30, 2024
```
add validation for bnb config
```
  5e5c4d62
29 May, 2024 12 commits

Cleanup docker build (#31119) · 2b9e252b

Yih-Dar authored May 29, 2024



* remove

* build

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2b9e252b

Add on_optimizer_step to callback options (#31095) · 5c882535

Dhruv Pai authored May 29, 2024

* Modified test

* Added on_optimizer_step to callbacks

* Move callback after step is called

* Added on optimizer step callback

5c882535

Add VLM generation default contributor (#31115) · 4af705c6
Joao Gante authored May 29, 2024
```
* add Raushan

* add Raushan
```
4af705c6
FIX / Docs: Fix GPTQ expected number of bits (#31111) · cb879c58
Younes Belkada authored May 29, 2024
```
Update overview.md
```
cb879c58

Fix nightly circleci (#31114) · 1f841413

Yih-Dar authored May 29, 2024



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

1f841413

Rm maintainer + migrate (#31089) · d16053c8
Zach Mueller authored May 29, 2024

d16053c8
Fix faulty rstrip in module loading (#31108) · 0bef4a27
Matt authored May 29, 2024

0bef4a27
Fix env.py in cases where torch is not present (#31113) · 97a58a5d
Matt authored May 29, 2024
```
* Fix env.py in cases where torch is not present

* Simplify the fix (and avoid some issues)
```
97a58a5d

Improve `transformers-cli env` reporting (#31003) · c8861376

Huazhong Ji authored May 29, 2024

* Improve `transformers-cli env` reporting

* move the line `"Using GPU in script?": "<fill in>"` to in if conditional
statement

* same option for npu

c8861376

Use `HF_HUB_OFFLINE` + fix has_file in offline mode (#31016) · c3044ec2

Lucain authored May 29, 2024

* Fix has_file in offline mode

* harmonize env variable for offline mode

* Switch to HF_HUB_OFFLINE

* fix test

* revert test_offline to test TRANSFORMERS_OFFLINE

* Add new offline test

* merge conflicts

* docs

c3044ec2

FEAT: Add mistral v3 conversion script (#30981) · bfe6f513

Younes Belkada authored May 29, 2024



* add mistral v3 conversion script

* Update src/transformers/models/mistral/convert_mistral_weights_to_hf.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

bfe6f513

Quantized KV cache: update quanto (#31052) · d521ba57

Raushan Turganbay authored May 29, 2024



* quanto latest version was refactored

* add error msg

* incorrect compare sign

* Update src/transformers/cache_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

d521ba57

28 May, 2024 10 commits
- Deprecate low use models (#30781) · a564d10a
  amyeroberts authored May 28, 2024
```
* Deprecate models
- graphormer
- time_series_transformer
- xlm_prophetnet
- qdqbert
- nat
- ernie_m
- tvlt
- nezha
- mega
- jukebox
- vit_hybrid
- x_clip
- deta
- speech_to_text_2
- efficientformer
- realm
- gptsan_japanese

* Fix up

* Fix speech2text2 imports

* Make sure message isn't indented

* Fix docstrings

* Correctly map for deprecated models from model_type

* Uncomment out

* Add back time series transformer and x-clip

* Import fix and fix-up

* Fix up with updated ruff
```
  a564d10a
- Docs / Quantization: Redirect deleted page (#31063) · 7f08817b
  Younes Belkada authored May 28, 2024
```
Update _redirects.yml
```
  7f08817b
- TST: Fix instruct-blip tests (#31088) · 3264be41
  Younes Belkada authored May 28, 2024
```
* fix flan t5 tests

* better format
```
  3264be41
- Fix DeepSpeed compatibility with weight_norm (#30881) (#31018) · 476890e9
  Jonny Li authored May 28, 2024
  
  476890e9
- Fix PretrainedConfig docstring with deprecated resume_download (#31014) · aada568f
  Albert Villanova del Moral authored May 28, 2024
  
  aada568f
- skip `test_multi_gpu_data_parallel_forward` for `vit` and `deit` (#31086) · 3af7bf30
  Yih-Dar authored May 28, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  3af7bf30
- FIX / OPT: Fix OPT multi-GPU training for `OPTForQuestionAnswering` (#31092) · ab19f907
  Younes Belkada authored May 28, 2024
```
Update modeling_opt.py
```
  ab19f907
- FIX: Add `accelerate` as a hard requirement (#31090) · 94d416f0
  Younes Belkada authored May 28, 2024
```
add accelerate
```
  94d416f0
- Render chat template tojson filter as unicode (#31041) · 22dab246
  Sigbjørn Skjæret authored May 28, 2024
```
* Render chat template tojson filter as unicode

* ruff--
```
  22dab246
- Docs / PEFT: Add PEFT API documentation (#31078) · 4f98b144
  Younes Belkada authored May 28, 2024
```
* add peft references

* add peft references

* Update docs/source/en/peft.md

* Update docs/source/en/peft.md
```
  4f98b144