Commits · b8fe259f163c48a18c9b27428b72b2ac104de346 · chenpangpang / transformers

09 Jun, 2023 8 commits
- Fix SAM OOM issue on CI (#24125) · b8fe259f
  Yih-Dar authored Jun 09, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  b8fe259f
- Fix TF Rag OOM issue (#24122) · 707023d1
  Yih-Dar authored Jun 09, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  707023d1
- fix bugs with trainer (#24134) · f2b91835
  Sourab Mangrulkar authored Jun 09, 2023
```
* fix the deepspeed test failures

* apex fix

* FSDP save ckpt fix

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  f2b91835
- Generate: PT's `top_p` enforces `min_tokens_to_keep` when it is `1` (#24111) · be10092e
  Joao Gante authored Jun 09, 2023
  
  be10092e
- Correctly build models and import call_context for older TF versions (#24138) · 03585f37
  Matt authored Jun 09, 2023
  
  03585f37
- [`bnb`] Fix bnb config json serialization (#24137) · a6d05d55
  Younes Belkada authored Jun 09, 2023
```
* fix bnb config json serialization

* forward contrib credits from discussions

---------
Co-authored-by: Andrechang <Andrechang@users.noreply.github.com>
```
  a6d05d55
- PLAM => PaLM (#24129) · e2972dff
  Elliott Wang authored Jun 09, 2023
  
  e2972dff
- [Lllama] Update tokenization code to ensure parsing of the special tokens [core] (#24042) · 535542d3
  Arthur authored Jun 09, 2023
```
* preventllama fast from returning token type ids

* remove type hints

* normalised False
```
  535542d3
08 Jun, 2023 9 commits
- Avoid `GPT-2` daily CI job OOM (in TF tests) (#24106) · 2e2088f2
  Yih-Dar authored Jun 08, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  2e2088f2
- Fix typo in Llama docstrings (#24020) · 9322c244
  Serge Panev authored Jun 08, 2023
```
* Fix typo in Llama docstrings
Signed-off-by: Serge Panev <spanev@nvidia.com>

* Update
Signed-off-by: Serge Panev <spanev@nvidia.com>

* make style
Signed-off-by: Serge Panev <spanev@nvidia.com>

---------
Signed-off-by: Serge Panev <spanev@nvidia.com>
```
  9322c244
- add trust_remote_code option to CLI download cmd (#24097) · a73883ae
  Radamés Ajna authored Jun 08, 2023
```
* add trust_remote_code option

* require_torch
```
  a73883ae
- [`GPT2`] Add correct keys on `_keys_to_ignore_on_load_unexpected` on all child... · 8b169142
  Younes Belkada authored Jun 08, 2023
```
[`GPT2`] Add correct keys on `_keys_to_ignore_on_load_unexpected` on all child classes of `GPT2PreTrainedModel` (#24113)

* add correct keys on `_keys_to_ignore_on_load_unexpected`

* oops
```
  8b169142
- fix get_keys_to_not_convert function (#24095) · 71a114d3
  Marc Sun authored Jun 08, 2023
```
* fix get_keys_to_not_convert funct

* Fix style
```
  71a114d3
- Update the pin on Accelerate (#24110) · 8c5f3067
  Sylvain Gugger authored Jun 08, 2023
  
  8c5f3067
- [`Trainer`] Correct behavior of `_load_best_model` for PEFT models (#24103) · 2200bf7a
  Younes Belkada authored Jun 08, 2023
```
* v1

* some refactor

- add ST format as well

* fix

* add `ADAPTER_WEIGHTS_NAME` & `ADAPTER_SAFE_WEIGHTS_NAME`
```
  2200bf7a
- reset accelerate env variables after each test (#24107) · 0f236050
  Sourab Mangrulkar authored Jun 08, 2023
  
  0f236050
- Fix a tiny typo in `WhisperForConditionalGeneration::generate` docstring (#24045) · 5fa0a1b2
  Sadra Barikbin authored Jun 08, 2023
  
  5fa0a1b2
07 Jun, 2023 18 commits
- v4.31.0.dev0 · ba695c1e
  Sylvain Gugger authored Jun 07, 2023
  
  ba695c1e
- Add AzureOpenAiAgent (#24058) · c3572e6b
  Sylvain Gugger authored Jun 07, 2023
```
* Add AzureOpenAiAgent

* quality

* Update src/transformers/tools/agents.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

---------
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
```
  c3572e6b
- Up pinned accelerate version (#24089) · 5eb3d3c7
  Zachary Mueller authored Jun 07, 2023
```
* Min accelerate

* Also min version

* Min accelerate

* Also min version

* To different minor version

* Empty
```
  5eb3d3c7
- fix accelerator prepare during eval only mode (#24014) · d1c039e3
  Sourab Mangrulkar authored Jun 08, 2023
```
* fix mixed precision prep during eval only mode

* update to address comments

* update to reflect the changes in accelerate
```
  d1c039e3
- Do not prepare lr scheduler as it as the right number of steps (#24088) · 2c887cf8
  Sylvain Gugger authored Jun 07, 2023
```
* Do not prepare lr scheduler as it as the right number of steps

* Trigger CI

* Trigger CI

* Trigger CI

* Add fake comment

* Remove fake comment

* Trigger CI please!
```
  2c887cf8
- fix executable batch size issue (#24067) · 12298cb6
  Sourab Mangrulkar authored Jun 07, 2023
```
* fix executable batch size issue

* fix

* undo
```
  12298cb6
- Update delete_doc_comment_trigger.yml (#24084) · ef010071
  Mishig authored Jun 07, 2023
```
fix base workflow name
```
  ef010071
- Fix expected value in tests of the test fetcher (#24077) · 89b00eef
  Sylvain Gugger authored Jun 07, 2023
```
* Fix expected value in tests of the test fetcher

* Fix trigger for repo util tests
```
  89b00eef
- [doc build] Use secrets (#24079) · 5c9394b5
  Mishig authored Jun 07, 2023
  
  5c9394b5
- Make the TF dummies even smaller (#24071) · 1fc832b4
  Matt authored Jun 07, 2023
```
* Let's see if we can use the smallest possible dummies

* Make GPT-2's dummies a little longer

* Just use (1,2) as the default shape

* Update other dummies in sync

* Correct imports for Keras 2.13

* Shrink the Wav2Vec2 dummies
```
  1fc832b4
- Be nice to TF (#24076) · 092c14c3
  Yih-Dar authored Jun 07, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  092c14c3
- [`bnb`] Fix bnb skip modules (#24043) · 47952192
  Younes Belkada authored Jun 07, 2023
```
* fix skip modules test

* oops

* address comments
```
  47952192
- Fix `is_optimum_neuron_available` (#23961) · a1160185
  Michael Benayoun authored Jun 07, 2023
```
Fix is_optimum_neuron_available
```
  a1160185
- [`Hub`] Add `safe_serialization` in push_to_hub (#24074) · 6b548129
  Younes Belkada authored Jun 07, 2023
```
add `safe_serialization` in push_to_hub
```
  6b548129
- Support PEFT models when saving the model using trainer (#24073) · 6daf7c31
  Younes Belkada authored Jun 07, 2023
```
* support PEFT models when saving the model using trainer

* fixup
```
  6daf7c31
- Add support for non-rust implemented tokenization for `__getitem__` method. (#24039) · 1e4a7737
  YangLiu authored Jun 07, 2023
```
* Add support for non-rust implemented tokenization for `__getitem__` method.

* Update for error message on adding new sub-branch for `__item__` method.

---------
Co-authored-by: liuyang17 <liuyang17@zhihu.com>
```
  1e4a7737
- [Wav2Vec2] Fix torch srcipt (#24062) · 52972e70
  Patrick von Platen authored Jun 07, 2023
```
* [Wav2Vec2] Fix torch srcipt

* fix more
```
  52972e70
- Generate: increase left-padding test atol (#23448) · 612b2a1a
  Joao Gante authored Jun 07, 2023
```
increase atol
```
  612b2a1a
06 Jun, 2023 5 commits

Remote code improvements (#23959) · f1660d7e

Sylvain Gugger authored Jun 06, 2023



* Fix model load when it has both code on the Hub and locally

* Add input check with timeout

* Add tests

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Some non-saved stuff

* Add feature extractors

* Add image processor

* Add model

* Add processor and tokenizer

* Reduce timeout

---------
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

f1660d7e

Fix device placement for model-parallelism in generate for encoder/de… (#24025) · 60825f2c
Sylvain Gugger authored Jun 06, 2023
```
* Fix device placement for model-parallelism in generate for encoder/decoders

* Remove debug statements
```
60825f2c
bring back `filtered_test_list_cross_tests.txt` (#24055) · 02d255db
Yih-Dar authored Jun 06, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
02d255db

Use new parametrization based weight norm if available (#24030) · bc9ecef9

Edward Z. Yang authored Jun 06, 2023

* Use new parametrization based weight norm if available

See https://github.com/pytorch/pytorch/pull/103001

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

* handle copies
Signed-off-by: Edward Z. Yang <ezyang@meta.com>

* black
Signed-off-by: Edward Z. Yang <ezyang@meta.com>

---------
Signed-off-by: Edward Z. Yang <ezyang@meta.com>

bc9ecef9

Move TF building to an actual build() method (#23760) · 4a55e478

Matt authored Jun 06, 2023

* A fun new PR where I break the entire codebase again

* A fun new PR where I break the entire codebase again

* Handle cross-attention

* Move calls to model(model.dummy_inputs) to the new build() method

* Seeing what fails with the build context thing

* make fix-copies

* Let's see what fails with new build methods

* Fix the pytorch crossload build calls

* Fix the overridden build methods in vision_text_dual_encoder

* Make sure all our build methods set self.built or call super().build(), which also sets it

* make fix-copies

* Remove finished TODO

* Tentatively remove unneeded (?) line

* Transpose b in deberta correctly and remove unused threading local

* Get rid of build_with_dummies and all it stands for

* Rollback some changes to TF-PT crossloading

* Correctly call super().build()

4a55e478