Commits · 7c8dd88d13fe722704e2167269c4717e6035ab05 · chenpangpang / transformers

15 Jan, 2024 7 commits

[GPTQ] Fix test (#28018) · 7c8dd88d
Marc Sun authored Jan 15, 2024
```
* fix test

* reduce length

* smaller model
```
7c8dd88d

Tokenizer kwargs in textgeneration pipe (#28362) · 366c0327

thedamnedrhino authored Jan 15, 2024

* added args to the pipeline

* added test

* more sensical tests

* fixup

* docs

* typo
;

* docs

* made changes to support named args

* fixed test

* docs update

* styles

* docs

* docs

366c0327

Add the XPU device check for pipeline mode (#28326) · a573ac74

yuanwu2017 authored Jan 15, 2024



* Add the XPU check for pipeline mode

When setting xpu device for pipeline, It needs to use is_torch_xpu_available to load ipex and determine whether the device is available.
Signed-off-by: yuanwu <yuan.wu@intel.com>

* Don't move model to device when hf_device_map isn't None

1. Don't move model to device when hf_device_map is not None
2. The device string maybe includes the device index, so use 'in'instead of equal
Signed-off-by: yuanwu <yuan.wu@intel.com>

* Raise the error when xpu is not available
Signed-off-by: yuanwu <yuan.wu@intel.com>

* Update src/transformers/pipelines/base.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/pipelines/base.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Modify the error message
Signed-off-by: yuanwu <yuan.wu@intel.com>

* Change message format.
Signed-off-by: yuanwu <yuan.wu@intel.com>

---------
Signed-off-by: yuanwu <yuan.wu@intel.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

a573ac74

[`core`/ FEAT] Add the possibility to push custom tags using `PreTrainedModel` itself (#28405) · 1b9a2e4c

Younes Belkada authored Jan 15, 2024



* v1 tags

* remove unneeded conversion

* v2

* rm unneeded warning

* add more utility methods

* Update src/transformers/utils/hub.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/utils/hub.py
Co-authored-by: Lucain <lucainp@gmail.com>

* Update src/transformers/utils/hub.py
Co-authored-by: Lucain <lucainp@gmail.com>

* more enhancements

* oops

* merge tags

* clean up

* revert unneeded change

* add extensive docs

* more docs

* more kwargs

* add test

* oops

* fix test

* Update src/transformers/modeling_utils.py
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update src/transformers/utils/hub.py
Co-authored-by: Lucain <lucainp@gmail.com>

* Update src/transformers/modeling_utils.py

* Update src/transformers/trainer.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add more conditions

* more logic

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Lucain <lucainp@gmail.com>
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

1b9a2e4c

Don't set `finetuned_from` if it is a local path (#28482) · 64bdbd88
Yih-Dar authored Jan 15, 2024
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
64bdbd88
[`chore`] Update warning text, a word was missing (#28017) · 881e966a
Tom Aarsen authored Jan 15, 2024
```
Update warning, a word was missing
```
881e966a
Fix paths to AI Sweden Models reference and model loading (#28423) · 121641ca
Francisco Kurucz authored Jan 15, 2024
```
Fix URL to Ai Sweden Models reference and model loading
```
121641ca

13 Jan, 2024 2 commits

Generate: fix candidate device placement (#28493) · bc72b4e2
Joao Gante authored Jan 13, 2024
```
* fix candidate device

* this line shouldn't have been in
```
bc72b4e2

Adding Prompt lookup decoding (#27775) · e304f976

Apoorv Saxena authored Jan 13, 2024



* MVP

* fix ci

* more ci

* remove redundant kwarg

* added and wired up PromptLookupCandidateGenerator

* rebased with main, working

* removed print

* style fixes

* fix test

* fixed tests

* added test for prompt lookup decoding

* fixed circleci

* fixed test issue

* Update src/transformers/generation/candidate_generator.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/candidate_generator.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/candidate_generator.py

* Update src/transformers/generation/candidate_generator.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Joao Gante <joao@huggingface.co>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

e304f976

12 Jan, 2024 12 commits

Change progress logging to once across all nodes (#28373) · 29a2b142
Siddartha Naidu authored Jan 12, 2024

29a2b142

Fix docstrings and update docstring checker error message (#28460) · 2382706a

Matt authored Jan 12, 2024

* Fix TF Regnet docstring

* Fix TF Regnet docstring

* Make a change to the PyTorch Regnet too to make sure the CI is checking it

* Add skips for TFRegnet

* Update error message for docstring checker

2382706a

TF: purge `TFTrainer` (#28483) · 4fb3d3a0
Joao Gante authored Jan 12, 2024

4fb3d3a0
Generate: refuse to save bad generation config files (#28477) · afc45b13
Joao Gante authored Jan 12, 2024

afc45b13
Docs: add model paths (#28475) · dc01cf9c
Joao Gante authored Jan 12, 2024

dc01cf9c
Generate: deprecate old public functions (#28478) · d0264988
Joao Gante authored Jan 12, 2024

d0264988
Fix torch.ones usage in xlnet (#28471) · edb314ae
sungho-ham authored Jan 12, 2024
```
Fix xlnet torch.ones usage
Co-authored-by: sungho-ham <sungho.ham@linecorp.com>
```
edb314ae

Bump jinja2 from 2.11.3 to 3.1.3 in /examples/research_projects/decision_transformer (#28457) · c45ef1c0

dependabot[bot] authored Jan 12, 2024

Bump jinja2 in /examples/research_projects/decision_transformer

Bumps [jinja2](https://github.com/pallets/jinja) from 2.11.3 to 3.1.3.
- [Release notes](https://github.com/pallets/jinja/releases)
- [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/jinja/compare/2.11.3...3.1.3

)

---
updated-dependencies:
- dependency-name: jinja2
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

c45ef1c0

[`Mixtral` / `Awq`] Add mixtral fused modules for Awq (#28240) · 266c67b0

Younes Belkada authored Jan 12, 2024



* add mixtral fused modules

* add changes from modeling utils

* add test

* fix test + rope theta issue

* Update src/transformers/modeling_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add tests

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

266c67b0

Update metadata loading for oneformer (#28398) · 666a6f07

amyeroberts authored Jan 12, 2024

* Update meatdata loading for oneformer

* Enable loading from a model repo

* Update docstrings

* Fix tests

* Update tests

* Clarify repo_path behaviour

666a6f07

Mark two logger tests as flaky (#28458) · 4e36a6cd
amyeroberts authored Jan 12, 2024
```
* Mark two logger tests as flaky

* Add description to is_flaky
```
4e36a6cd

[`Awq`] Add llava fused modules support (#28239) · 07bdbebb

Younes Belkada authored Jan 12, 2024



* add llava + fused modules

* Update src/transformers/models/llava/modeling_llava.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

07bdbebb

11 Jan, 2024 14 commits

Fix broken link on page (#28451) · 995a7ce9

Hankyeol Kyung authored Jan 12, 2024



* [docs] Fix broken link
Signed-off-by: Hankyeol Kyung <kghnkl0103@gmail.com>

* [docs] Use shorter domain
Signed-off-by: Hankyeol Kyung <kghnkl0103@gmail.com>

---------
Signed-off-by: Hankyeol Kyung <kghnkl0103@gmail.com>

995a7ce9

Fix docstring checker issues with PIL enums (#28450) · 14345135
Matt authored Jan 11, 2024

14345135

Doc (#28431) · 19e83d17

jiqing-feng authored Jan 12, 2024

* update version for cpu training

* update docs for cpu training

* fix readme

* fix readme

19e83d17

Byebye torch 1.10 (#28207) · 59cd9de3

Yih-Dar authored Jan 11, 2024



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

59cd9de3

Fix load balancing loss func for mixtral (#28256) · e768616a

liangxuZhang authored Jan 11, 2024



* Correct the implementation of auxiliary loss of mixtrtal

* correct the implementation of auxiliary loss of mixtrtal

* Implement a simpler calculation method

---------
Co-authored-by: zhangliangxu3 <zhangliangxu3@jd.com>

e768616a

Correctly resolve trust_remote_code=None for AutoTokenizer (#28419) · 5d4d62d0
Matt authored Jan 11, 2024
```
* Correctly resolve trust_remote_code=None for AutoTokenizer

* Second attempt at a proper resolution
```
5d4d62d0

[Phi] Extend implementation to use GQA/MQA. (#28163) · 55090585

Gustavo de Rosa authored Jan 11, 2024

* chore(phi): Updates configuration_phi with missing keys.

* chore(phi): Adds first draft of combined modeling_phi.

* fix(phi): Fixes according to latest review.

* fix(phi): Removes pad_vocab_size_multiple to prevent inconsistencies.

* fix(phi): Fixes unit and integration tests.

* fix(phi): Ensures that everything works with microsoft/phi-1 for first integration.

* fix(phi): Fixes output of docstring generation.

* fix(phi): Fixes according to latest review.

* fix(phi): Fixes according to latest review.

* fix(tests): Re-enables Phi-1.5 test.

* fix(phi): Fixes attention overflow on PhiAttention (for Phi-2).

* fix(phi): Improves how queries and keys are upcast.

* fix(phi): Small updates on latest changes.

55090585

Optionally preprocess segmentation maps for MobileViT (#28420) · d5606378

Harisankar Babu authored Jan 11, 2024

* optionally preprocess segmentation maps for mobilevit

* changed pretrained model name to that of segmentation model

* removed voc-deeplabv3 from model archive list

* added preprocess_image and preprocess_mask methods for processing images and segmentation masks respectively

* added tests for segmentation masks based on segformer feature extractor

* use crop_size instead of size

* reverting to initial model

d5606378

Set `cache_dir` for `evaluate.load()` in example scripts (#28422) · 95091e15

Alex Hedges authored Jan 11, 2024

While using `run_clm.py`,[^1] I noticed that some files were being added
to my global cache, not the local cache. I set the `cache_dir` parameter
for the one call to `evaluate.load()`, which partially solved the
problem. I figured that while I was fixing the one script upstream, I
might as well fix the problem in all other example scripts that I could.

There are still some files being added to my global cache, but this
appears to be a bug in `evaluate` itself. This commit at least moves
some of the files into the local cache, which is better than before.

To create this PR, I made the following regex-based transformation:
`evaluate\.load\((.*?)\)` -> `evaluate\.load\($1,
cache_dir=model_args.cache_dir\)`. After using that, I manually fixed
all modified files with `ruff` serving as useful guidance. During the
process, I removed one existing usage of the `cache_dir` parameter in a
script that did not have a corresponding `--cache-dir` argument
declared.

[^1]: I specifically used `pytorch/language-modeling/run_clm.py` from
v4.34.1 of the library. For the original code, see the following URL:
https://github.com/huggingface/transformers/tree/acc394c4f5e1283c19783581790b3dc3105a3697/examples/pytorch/language-modeling/run_clm.py.

95091e15

Fix docker file (#28452) · 5fd5ef76

Yih-Dar authored Jan 11, 2024



fix docker file
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5fd5ef76

Use python 3.10 for docbuild (#28399) · d019acb8
Yih-Dar authored Jan 11, 2024
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d019acb8

Optimize the speed of the truncate_sequences function. (#28263) · 2a85345a

ikkvix authored Jan 11, 2024



* change truncate_sequences

* Update tokenization_utils_base.py

* change format

* fix when ids_to_move=0

* fix

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

2a85345a

Enable multi-label image classification in pipeline (#28433) · 66964c00
amyeroberts authored Jan 11, 2024
```
Enable multi-label image classification
```
66964c00
Assitant model may on a different device (#27995) · 8205b264
jiqing-feng authored Jan 11, 2024
```
* Assitant model may on a different device

* fix tensor device
```
8205b264

10 Jan, 2024 5 commits

[Whisper] Fix slow test (#28407) · cbbe3074

Patrick von Platen authored Jan 10, 2024



* [Whisper] Fix slow test

* update

* update

* update

* update

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

cbbe3074

[docstring] Fix docstring for ErnieConfig, ErnieMConfig (#27029) · 6c78bbcb

Sparty authored Jan 10, 2024



* Remove ErnieConfig, ErnieMConfig check_docstrings

* Run fix_and_overwrite for ErnieConfig, ErnieMConfig

* Replace <fill_type> and <fill_docstring> in configuration_ernie, configuration_ernie_m.py with type and docstring values

---------
Co-authored-by: vignesh-raghunathan <vignesh_raghunathan@intuit.com>

6c78bbcb

Fix load correct tokenizer in Mixtral model documentation (#28437) · 3724156b
Francisco Kurucz authored Jan 10, 2024

3724156b

Fix for checkpoint rename race condition (#28364) · cef2e40e

Timothy Blattner authored Jan 10, 2024



* Changed logic for renaming staging directory when saving checkpoint to only operate with the main process.
Added fsync functionality to attempt to flush the write changes in case os.rename is not atomic.

* Updated styling using make fixup

* Updated check for main process to use built-in versions from trainer
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* Fixed incorrect usage of trainer main process checks
Added with open usage to ensure better file closing as suggested from PR
Added rotate_checkpoints into main process logic

* Removed "with open" due to not working with directory. os.open seems to work for directories.

---------
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

cef2e40e

update docs to add the `phi-2` example (#28392) · fff8ca8e
Susnato Dhar authored Jan 10, 2024
```
* update docs

* added Tip
```
fff8ca8e