Commits · ff86bc364d20a4e6e3c0d954485eab959c803394 · chenpangpang / transformers

15 Jan, 2024 12 commits

improve dev setup comments and hints (#28495) · ff86bc36
Timothy Cronin authored Jan 15, 2024
```
* improve dev setup comments and hints

* fix tests for new dev setup hints
```
ff86bc36
fix: sampling in flax keeps EOS (#28378) · 735968b6
Boris Dayma authored Jan 15, 2024

735968b6
Generate: consolidate output classes (#28494) · 7e0ddf89
Joao Gante authored Jan 15, 2024

7e0ddf89
Add a use_safetensors arg to TFPreTrainedModel.from_pretrained() (#28511) · 72db39c0
Matt authored Jan 15, 2024
```
* Add a use_safetensors arg to TFPreTrainedModel.from_pretrained()

* One more catch!

* One more one more catch
```
72db39c0
Fixed minor typos (#28489) · 78d767e3
Rishit Ratna authored Jan 15, 2024

78d767e3
[GPTQ] Fix test (#28018) · 7c8dd88d
Marc Sun authored Jan 15, 2024
```
* fix test

* reduce length

* smaller model
```
7c8dd88d

Tokenizer kwargs in textgeneration pipe (#28362) · 366c0327

thedamnedrhino authored Jan 15, 2024

* added args to the pipeline

* added test

* more sensical tests

* fixup

* docs

* typo
;

* docs

* made changes to support named args

* fixed test

* docs update

* styles

* docs

* docs

366c0327

Add the XPU device check for pipeline mode (#28326) · a573ac74

yuanwu2017 authored Jan 15, 2024



* Add the XPU check for pipeline mode

When setting xpu device for pipeline, It needs to use is_torch_xpu_available to load ipex and determine whether the device is available.
Signed-off-by: yuanwu <yuan.wu@intel.com>

* Don't move model to device when hf_device_map isn't None

1. Don't move model to device when hf_device_map is not None
2. The device string maybe includes the device index, so use 'in'instead of equal
Signed-off-by: yuanwu <yuan.wu@intel.com>

* Raise the error when xpu is not available
Signed-off-by: yuanwu <yuan.wu@intel.com>

* Update src/transformers/pipelines/base.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/pipelines/base.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Modify the error message
Signed-off-by: yuanwu <yuan.wu@intel.com>

* Change message format.
Signed-off-by: yuanwu <yuan.wu@intel.com>

---------
Signed-off-by: yuanwu <yuan.wu@intel.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

a573ac74

[`core`/ FEAT] Add the possibility to push custom tags using `PreTrainedModel` itself (#28405) · 1b9a2e4c

Younes Belkada authored Jan 15, 2024



* v1 tags

* remove unneeded conversion

* v2

* rm unneeded warning

* add more utility methods

* Update src/transformers/utils/hub.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/utils/hub.py
Co-authored-by: Lucain <lucainp@gmail.com>

* Update src/transformers/utils/hub.py
Co-authored-by: Lucain <lucainp@gmail.com>

* more enhancements

* oops

* merge tags

* clean up

* revert unneeded change

* add extensive docs

* more docs

* more kwargs

* add test

* oops

* fix test

* Update src/transformers/modeling_utils.py
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update src/transformers/utils/hub.py
Co-authored-by: Lucain <lucainp@gmail.com>

* Update src/transformers/modeling_utils.py

* Update src/transformers/trainer.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add more conditions

* more logic

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Lucain <lucainp@gmail.com>
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

1b9a2e4c

Don't set `finetuned_from` if it is a local path (#28482) · 64bdbd88
Yih-Dar authored Jan 15, 2024
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
64bdbd88
[`chore`] Update warning text, a word was missing (#28017) · 881e966a
Tom Aarsen authored Jan 15, 2024
```
Update warning, a word was missing
```
881e966a
Fix paths to AI Sweden Models reference and model loading (#28423) · 121641ca
Francisco Kurucz authored Jan 15, 2024
```
Fix URL to Ai Sweden Models reference and model loading
```
121641ca

13 Jan, 2024 2 commits

Generate: fix candidate device placement (#28493) · bc72b4e2
Joao Gante authored Jan 13, 2024
```
* fix candidate device

* this line shouldn't have been in
```
bc72b4e2

Adding Prompt lookup decoding (#27775) · e304f976

Apoorv Saxena authored Jan 13, 2024



* MVP

* fix ci

* more ci

* remove redundant kwarg

* added and wired up PromptLookupCandidateGenerator

* rebased with main, working

* removed print

* style fixes

* fix test

* fixed tests

* added test for prompt lookup decoding

* fixed circleci

* fixed test issue

* Update src/transformers/generation/candidate_generator.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/candidate_generator.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/candidate_generator.py

* Update src/transformers/generation/candidate_generator.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Joao Gante <joao@huggingface.co>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

e304f976

12 Jan, 2024 12 commits

Change progress logging to once across all nodes (#28373) · 29a2b142
Siddartha Naidu authored Jan 12, 2024

29a2b142

Fix docstrings and update docstring checker error message (#28460) · 2382706a

Matt authored Jan 12, 2024

* Fix TF Regnet docstring

* Fix TF Regnet docstring

* Make a change to the PyTorch Regnet too to make sure the CI is checking it

* Add skips for TFRegnet

* Update error message for docstring checker

2382706a

TF: purge `TFTrainer` (#28483) · 4fb3d3a0
Joao Gante authored Jan 12, 2024

4fb3d3a0
Generate: refuse to save bad generation config files (#28477) · afc45b13
Joao Gante authored Jan 12, 2024

afc45b13
Docs: add model paths (#28475) · dc01cf9c
Joao Gante authored Jan 12, 2024

dc01cf9c
Generate: deprecate old public functions (#28478) · d0264988
Joao Gante authored Jan 12, 2024

d0264988
Fix torch.ones usage in xlnet (#28471) · edb314ae
sungho-ham authored Jan 12, 2024
```
Fix xlnet torch.ones usage
Co-authored-by: sungho-ham <sungho.ham@linecorp.com>
```
edb314ae

Bump jinja2 from 2.11.3 to 3.1.3 in /examples/research_projects/decision_transformer (#28457) · c45ef1c0

dependabot[bot] authored Jan 12, 2024

Bump jinja2 in /examples/research_projects/decision_transformer

Bumps [jinja2](https://github.com/pallets/jinja) from 2.11.3 to 3.1.3.
- [Release notes](https://github.com/pallets/jinja/releases)
- [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/jinja/compare/2.11.3...3.1.3

)

---
updated-dependencies:
- dependency-name: jinja2
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

c45ef1c0

[`Mixtral` / `Awq`] Add mixtral fused modules for Awq (#28240) · 266c67b0

Younes Belkada authored Jan 12, 2024



* add mixtral fused modules

* add changes from modeling utils

* add test

* fix test + rope theta issue

* Update src/transformers/modeling_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add tests

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

266c67b0

Update metadata loading for oneformer (#28398) · 666a6f07

amyeroberts authored Jan 12, 2024

* Update meatdata loading for oneformer

* Enable loading from a model repo

* Update docstrings

* Fix tests

* Update tests

* Clarify repo_path behaviour

666a6f07

Mark two logger tests as flaky (#28458) · 4e36a6cd
amyeroberts authored Jan 12, 2024
```
* Mark two logger tests as flaky

* Add description to is_flaky
```
4e36a6cd

[`Awq`] Add llava fused modules support (#28239) · 07bdbebb

Younes Belkada authored Jan 12, 2024



* add llava + fused modules

* Update src/transformers/models/llava/modeling_llava.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

07bdbebb

11 Jan, 2024 14 commits

Fix broken link on page (#28451) · 995a7ce9

Hankyeol Kyung authored Jan 12, 2024



* [docs] Fix broken link
Signed-off-by: Hankyeol Kyung <kghnkl0103@gmail.com>

* [docs] Use shorter domain
Signed-off-by: Hankyeol Kyung <kghnkl0103@gmail.com>

---------
Signed-off-by: Hankyeol Kyung <kghnkl0103@gmail.com>

995a7ce9

Fix docstring checker issues with PIL enums (#28450) · 14345135
Matt authored Jan 11, 2024

14345135

Doc (#28431) · 19e83d17

jiqing-feng authored Jan 12, 2024

* update version for cpu training

* update docs for cpu training

* fix readme

* fix readme

19e83d17

Byebye torch 1.10 (#28207) · 59cd9de3

Yih-Dar authored Jan 11, 2024



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

59cd9de3

Fix load balancing loss func for mixtral (#28256) · e768616a

liangxuZhang authored Jan 11, 2024



* Correct the implementation of auxiliary loss of mixtrtal

* correct the implementation of auxiliary loss of mixtrtal

* Implement a simpler calculation method

---------
Co-authored-by: zhangliangxu3 <zhangliangxu3@jd.com>

e768616a

Correctly resolve trust_remote_code=None for AutoTokenizer (#28419) · 5d4d62d0
Matt authored Jan 11, 2024
```
* Correctly resolve trust_remote_code=None for AutoTokenizer

* Second attempt at a proper resolution
```
5d4d62d0

[Phi] Extend implementation to use GQA/MQA. (#28163) · 55090585

Gustavo de Rosa authored Jan 11, 2024

* chore(phi): Updates configuration_phi with missing keys.

* chore(phi): Adds first draft of combined modeling_phi.

* fix(phi): Fixes according to latest review.

* fix(phi): Removes pad_vocab_size_multiple to prevent inconsistencies.

* fix(phi): Fixes unit and integration tests.

* fix(phi): Ensures that everything works with microsoft/phi-1 for first integration.

* fix(phi): Fixes output of docstring generation.

* fix(phi): Fixes according to latest review.

* fix(phi): Fixes according to latest review.

* fix(tests): Re-enables Phi-1.5 test.

* fix(phi): Fixes attention overflow on PhiAttention (for Phi-2).

* fix(phi): Improves how queries and keys are upcast.

* fix(phi): Small updates on latest changes.

55090585

Optionally preprocess segmentation maps for MobileViT (#28420) · d5606378

Harisankar Babu authored Jan 11, 2024

* optionally preprocess segmentation maps for mobilevit

* changed pretrained model name to that of segmentation model

* removed voc-deeplabv3 from model archive list

* added preprocess_image and preprocess_mask methods for processing images and segmentation masks respectively

* added tests for segmentation masks based on segformer feature extractor

* use crop_size instead of size

* reverting to initial model

d5606378

Set `cache_dir` for `evaluate.load()` in example scripts (#28422) · 95091e15

Alex Hedges authored Jan 11, 2024

While using `run_clm.py`,[^1] I noticed that some files were being added
to my global cache, not the local cache. I set the `cache_dir` parameter
for the one call to `evaluate.load()`, which partially solved the
problem. I figured that while I was fixing the one script upstream, I
might as well fix the problem in all other example scripts that I could.

There are still some files being added to my global cache, but this
appears to be a bug in `evaluate` itself. This commit at least moves
some of the files into the local cache, which is better than before.

To create this PR, I made the following regex-based transformation:
`evaluate\.load\((.*?)\)` -> `evaluate\.load\($1,
cache_dir=model_args.cache_dir\)`. After using that, I manually fixed
all modified files with `ruff` serving as useful guidance. During the
process, I removed one existing usage of the `cache_dir` parameter in a
script that did not have a corresponding `--cache-dir` argument
declared.

[^1]: I specifically used `pytorch/language-modeling/run_clm.py` from
v4.34.1 of the library. For the original code, see the following URL:
https://github.com/huggingface/transformers/tree/acc394c4f5e1283c19783581790b3dc3105a3697/examples/pytorch/language-modeling/run_clm.py.

95091e15

Fix docker file (#28452) · 5fd5ef76

Yih-Dar authored Jan 11, 2024



fix docker file
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5fd5ef76

Use python 3.10 for docbuild (#28399) · d019acb8
Yih-Dar authored Jan 11, 2024
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d019acb8

Optimize the speed of the truncate_sequences function. (#28263) · 2a85345a

ikkvix authored Jan 11, 2024



* change truncate_sequences

* Update tokenization_utils_base.py

* change format

* fix when ids_to_move=0

* fix

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

2a85345a

Enable multi-label image classification in pipeline (#28433) · 66964c00
amyeroberts authored Jan 11, 2024
```
Enable multi-label image classification
```
66964c00
Assitant model may on a different device (#27995) · 8205b264
jiqing-feng authored Jan 11, 2024
```
* Assitant model may on a different device

* fix tensor device
```
8205b264