Commits · 695928e1e573e5a9953ad540c0f8345767348feb · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "502a10a6f89b2919444aba68cd0def51d5ba618c"

13 Jun, 2023 6 commits

Sylvain Gugger authored Jun 13, 2023

* First test

* Add info for all models

* style

* Repo consistency

* Fix last model and cleanup prints

* Repo consistency

* Use consistent function for detecting tied weights

695928e1

deprecate `use_mps_device` (#24239) · 3723329d
Sourab Mangrulkar authored Jun 13, 2023

3723329d

fix overflow when training mDeberta in fp16 (#24116) · 3e142cb0

Sebastian authored Jun 13, 2023

* Porting changes from https://github.com/microsoft/DeBERTa/ that hopefully allows for fp16 training of mdeberta

* Updates to deberta modeling from microsoft repo

* Performing some cleanup

* Undoing changes that weren't necessary

* Undoing float calls

* Minimally change the p2c block

* Fix error

* Minimally changing the c2p block

* Switch to torch sqrt

* Remove math

* Adding back the to calls to scale

* Undoing attention_scores change

* Removing commented out code

* Updating modeling_sew_d.py to satisfy utils/check_copies.py

* Missed changed

* Further reduce changes needed to get fp16 working

* Reverting changes to modeling_sew_d.py

* Make same change in TF

3e142cb0

Safely import pytest in testing_utils.py (#24241) · f91810da
amyeroberts authored Jun 13, 2023

f91810da
Improving error message when using `use_safetensors=True`. (#24232) · fdd78d91
Nicolas Patry authored Jun 13, 2023

fdd78d91

fix: TextIteratorStreamer cannot work with pipeline (#23641) · d7389cd2

yuanwu2017 authored Jun 13, 2023



* fix: TextIteratorStreamer cannot work with pipeline

Deepcopying the TextIteratorStreamer object causes the exception.
Signed-off-by: yuanwu <yuan.wu@intel.com>

* Update src/transformers/pipelines/text_generation.py

Got it. I will update the patch.
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/pipelines/text_generation.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update text_generation.py

---------
Signed-off-by: yuanwu <yuan.wu@intel.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

d7389cd2

12 Jun, 2023 11 commits
- Finish dataloader integration (#24201) · 4da84008
  Zach Mueller authored Jun 12, 2023
  
  4da84008
- Update `WhisperForAudioClassification` doc example (#24188) · 0675600a
  Yih-Dar authored Jun 12, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  0675600a
- Remove unnecessary aten::to overhead in llama (#24203) · e5dd7432
  fxmarty authored Jun 13, 2023
```
* fix dtype init

* fix copies

* fix fixcopies mess

* edit forward as well

* copy
```
  e5dd7432
- Skip RWKV test in past CI (#24204) · 4fe9716a
  Yih-Dar authored Jun 12, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  4fe9716a
- Fix `_load_pretrained_model` (#24200) · 08ae37c8
  Marc Sun authored Jun 12, 2023
```
Fix test
```
  08ae37c8
- 🚨🚨🚨 Replace DataLoader logic for Accelerate in Trainer, remove unneeded tests 🚨🚨🚨 (#24028) · ebd94b0f
  Zach Mueller authored Jun 12, 2023
```
* Working integration

* Fix failing test

* Revert label host logic

* Bring it back!
```
  ebd94b0f
- Generate: detect special architectures when loaded from PEFT (#24198) · 60b69f7d
  Joao Gante authored Jun 12, 2023
  
  60b69f7d
- Fix device issue in `OpenLlamaModelTest::test_model_parallelism` (#24195) · a9cdb059
  Yih-Dar authored Jun 12, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  a9cdb059
- Generate: force caching on the main model, in assisted generation (#24177) · 9f81f4f6
  Joao Gante authored Jun 12, 2023
  
  9f81f4f6
- Change ProgressCallback to use dynamic_ncols=True (#24101) · ba64ec07
  AinL authored Jun 12, 2023
```
* Change ProgressCallback to use dynamic_ncols=True

* style: make style

* Revert "style: make style"

This reverts commit dee484904cd30a072d80e3be0a3d74a03cff30c6.

* run make style only trainer_callback
```
  ba64ec07
- Fix push to hub (#24187) · 93f73a38
  NielsRogge authored Jun 12, 2023
```
Add fix
```
  93f73a38
10 Jun, 2023 1 commit

Avoid OOM in doctest CI (#24139) · 8f093fb7

Yih-Dar authored Jun 10, 2023



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

8f093fb7

09 Jun, 2023 10 commits
- Tool types (#24032) · deff5979
  Lysandre Debut authored Jun 09, 2023
```
* Tool types

* Tests + fixes

* Isolate types

* Oops

* Review comments + docs

* Tests + docs

* soundfile -> vision
```
  deff5979
- Fix typo in streamers.py (#24144) · 061580c8
  Freddie Vargus authored Jun 09, 2023
  
  061580c8
- [BlenderBotSmall] Update doc example (#24092) · a7501f6f
  Arthur authored Jun 09, 2023
```
* small tokenizer uses `__start__` and `__end__`

* fix PR doctest
```
  a7501f6f
- [lamaTokenizerFast] Update documentation (#24132) · 5af3a1aa
  Arthur authored Jun 09, 2023
```
* Update documentation

* nits
```
  5af3a1aa
- [`SAM`] Fix sam slow test (#24140) · 62fe7533
  Younes Belkada authored Jun 09, 2023
```
* fix sam test

* update pipeline typehint
```
  62fe7533
- fix bugs with trainer (#24134) · f2b91835
  Sourab Mangrulkar authored Jun 09, 2023
```
* fix the deepspeed test failures

* apex fix

* FSDP save ckpt fix

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  f2b91835
- Generate: PT's `top_p` enforces `min_tokens_to_keep` when it is `1` (#24111) · be10092e
  Joao Gante authored Jun 09, 2023
  
  be10092e
- Correctly build models and import call_context for older TF versions (#24138) · 03585f37
  Matt authored Jun 09, 2023
  
  03585f37
- [`bnb`] Fix bnb config json serialization (#24137) · a6d05d55
  Younes Belkada authored Jun 09, 2023
```
* fix bnb config json serialization

* forward contrib credits from discussions

---------
Co-authored-by: Andrechang <Andrechang@users.noreply.github.com>
```
  a6d05d55
- [Lllama] Update tokenization code to ensure parsing of the special tokens [core] (#24042) · 535542d3
  Arthur authored Jun 09, 2023
```
* preventllama fast from returning token type ids

* remove type hints

* normalised False
```
  535542d3
08 Jun, 2023 8 commits
- Fix typo in Llama docstrings (#24020) · 9322c244
  Serge Panev authored Jun 08, 2023
```
* Fix typo in Llama docstrings
Signed-off-by: Serge Panev <spanev@nvidia.com>

* Update
Signed-off-by: Serge Panev <spanev@nvidia.com>

* make style
Signed-off-by: Serge Panev <spanev@nvidia.com>

---------
Signed-off-by: Serge Panev <spanev@nvidia.com>
```
  9322c244
- add trust_remote_code option to CLI download cmd (#24097) · a73883ae
  Radamés Ajna authored Jun 08, 2023
```
* add trust_remote_code option

* require_torch
```
  a73883ae
- [`GPT2`] Add correct keys on `_keys_to_ignore_on_load_unexpected` on all child... · 8b169142
  Younes Belkada authored Jun 08, 2023
```
[`GPT2`] Add correct keys on `_keys_to_ignore_on_load_unexpected` on all child classes of `GPT2PreTrainedModel` (#24113)

* add correct keys on `_keys_to_ignore_on_load_unexpected`

* oops
```
  8b169142
- fix get_keys_to_not_convert function (#24095) · 71a114d3
  Marc Sun authored Jun 08, 2023
```
* fix get_keys_to_not_convert funct

* Fix style
```
  71a114d3
- Update the pin on Accelerate (#24110) · 8c5f3067
  Sylvain Gugger authored Jun 08, 2023
  
  8c5f3067
- [`Trainer`] Correct behavior of `_load_best_model` for PEFT models (#24103) · 2200bf7a
  Younes Belkada authored Jun 08, 2023
```
* v1

* some refactor

- add ST format as well

* fix

* add `ADAPTER_WEIGHTS_NAME` & `ADAPTER_SAFE_WEIGHTS_NAME`
```
  2200bf7a
- reset accelerate env variables after each test (#24107) · 0f236050
  Sourab Mangrulkar authored Jun 08, 2023
  
  0f236050
- Fix a tiny typo in `WhisperForConditionalGeneration::generate` docstring (#24045) · 5fa0a1b2
  Sadra Barikbin authored Jun 08, 2023
  
  5fa0a1b2
07 Jun, 2023 4 commits

v4.31.0.dev0 · ba695c1e
Sylvain Gugger authored Jun 07, 2023

ba695c1e

Add AzureOpenAiAgent (#24058) · c3572e6b

Sylvain Gugger authored Jun 07, 2023



* Add AzureOpenAiAgent

* quality

* Update src/transformers/tools/agents.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

---------
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

c3572e6b

Up pinned accelerate version (#24089) · 5eb3d3c7

Zachary Mueller authored Jun 07, 2023

* Min accelerate

* Also min version

* Min accelerate

* Also min version

* To different minor version

* Empty

5eb3d3c7

fix accelerator prepare during eval only mode (#24014) · d1c039e3

Sourab Mangrulkar authored Jun 08, 2023

* fix mixed precision prep during eval only mode

* update to address comments

* update to reflect the changes in accelerate

d1c039e3