Commits · e1c3ac25515839146c93427e55941de9cee3401e · chenpangpang / transformers

10 Nov, 2023 7 commits

Add Phi-1 and Phi-1_5 (#26170) · e1c3ac25

Susnato Dhar authored Nov 10, 2023

* only dir not even init

* init

* tokenizer removed and reference of codegen added

* modeling file updated a lot remaining app_rotary_emb

* conversion script done

* conversion script fixed, a lot of factoring done and most tests pass

* added token_clf and extractive_QA_head

* integration tests pass

* flash attn tests pass!

* config done

* more docs in modeling file

* some style fix

* style and others

* doc test error fix

* more doc fix

* some attention fixes

* most fixes

* style and other fixes

* docs fix and config

* doc fix

* some comments

* conversion script updated

* conversion script updated

* Revert "conversion script updated"

This reverts commit e92378c54084ec0747041b113083d1746ecb6c7f.

* final comments

* add Phi to language_modeling.md

* edit phi.md file

* rebase and fix

* removed phi-1.5 example

* changed model_type from 'phi'->'mixformer-sequential'

* small change

* small change

* revert \small change

* changed mixformer-sequential->phi

* small change

* added phi-1.5 example instead of phi-1

* doc test might pass now

* rebase and small change

* added the dropout layer

* more fixes

* modified .md file

* very very small doc change

e1c3ac25

At most 2 GPUs for CI (#27435) · 00dc8562

Yih-Dar authored Nov 10, 2023



At most 2 GPUs
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

00dc8562

[`AttentionMaskConverter`] ]Fix-mask-inf (#27114) · 68afca3e

Arthur authored Nov 10, 2023

* fix?

* actual fix

* fixups

* add dataclass to the attention mask converter

* refine testing suite

* make sure there are no overflows

* update the test

68afca3e

Add CLVP (#24745) · 7e9f10ac

Susnato Dhar authored Nov 10, 2023

* init commit

* attention arch done except rotary emb

* rotary emb done

* text encoder working

* outputs matching

* arch first pass done

* make commands done, tests and docs remaining

* all tests passed, only docs remaining

* docs done

* doc-builder fix

* convert script removed(not relevant)

* minor comments done

* added ckpt conversion script

* tokenizer done

* very minor fix of index.md 2

* mostly make fixup related

* all done except fe and rotary emb

* very small change

* removed unidecode dependency

* style changes

* tokenizer removed require_backends

* added require_inflect to tokenizer tests

* removed VOCAB_FILES in tokenizer test

* inflect dependency removed

* added rotary pos emb cache and simplified the apply method

* style

* little doc change

* more comments

* feature extractor added

* added processor

* auto-regressive config added

* added CLVPConditioningEncoder

* comments done except the test one

* weights added successfull(NOT tested)

* tokenizer fix with numbers

* generate outputs matching

* almost tests passing Integ tests not written

* Integ tests added

* major CUDA error fixed

* docs done

* rebase and multiple fixes

* fixed rebase overwrites

* generate code simplified and tests for AutoRegressive model added

* minor changes

* refectored gpt2 code in clvp file

* weights done and all code refactored

* mostly done except the fast_tokenizer

* doc test fix

* config file's doc fixes

* more config fix

* more comments

* tokenizer comments mostly done

* modeling file mostly refactored and can load modules

* ClvpEncoder tested

* ClvpDecoder, ClvpModel and ClvpForCausalLM tested

* integration and all tests passed

* more fixes

* docs almost done

* ckpt conversion refectored

* style and some failing tests fix

* comments

* temporary output fix but test_assisted_decoding_matches_greedy_search test fails

* majority changes done

* use_cache outputs same now! Along with the asisted_greedy_decoding test fix

* more comments

* more comments

* prepare_inputs_for_generation fixed and _prepare_model_inputs added

* style fix

* clvp.md change

* moved clvpconditionalencoder norms

* add model to new index

* added tokenizer input_ids_with_special_tokens

* small fix

* config mostly done

* added config-tester and changed conversion script

* more comments

* comments

* style fix

* some comments

* tokenizer changed back to prev state

* small commnets

* added output hidden states for the main model

* style fix

* comments

* small change

* revert small change

* .

* Update clvp.md

* Update test_modeling_clvp.py

* :)

* some minor change

* new fixes

* remove to_dict from FE

7e9f10ac

update Bark FA2 docs (#27400) · 9dd58c53

Yoach Lacombe authored Nov 10, 2023



* update Bark FA2 docs

* update benchmark section

* Update bark.md

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* rephrase

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

9dd58c53

[`Quantization`] Add str to enum conversion for AWQ (#27320) · fd685cfd

Younes Belkada authored Nov 10, 2023



* add str to enum conversion

* fixup

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

fd685cfd

add attention_mask and position_ids in assisted model (#26892) · 184f60dc

jiqing-feng authored Nov 10, 2023

* add attention_mask and position_ids in assisted model

* fix bug

* fix attention mask

* fix attention_mask

* check assist inputs

* check assist input ids length

* fix assist model type

* set assist attention mask device

184f60dc

09 Nov, 2023 15 commits
- Run all tests if `circleci/create_circleci_config.py` is modified (#27413) · cf32c941
  Yih-Dar authored Nov 09, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  cf32c941
- Fix `Owlv2` checkpoint name and a default value in `Owlv2VisionConfig` (#27402) · 740cd935
  Yih-Dar authored Nov 09, 2023
```
* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  740cd935
- remove failing tests and clean FE files (#27414) · 51a98c40
  Yoach Lacombe authored Nov 09, 2023
```
* remove failing tests and clean FE files

* remove same similar text from tvlt
```
  51a98c40
- Fix RequestCounter to make it more future-proof (#27406) · e38348ae
  Lucain authored Nov 09, 2023
```
* Fix RequestCounter to make it more future-proof

* code quality
```
  e38348ae
- Final fix of the accelerate installation issue (#27408) · c8b6052f
  Yih-Dar authored Nov 09, 2023
```
* fix

* [test-all] commit

* fix

* [test-all] commit

* [test-all] commit

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  c8b6052f
- Use editable install for git deps (#27404) · c5037b45
  Zach Mueller authored Nov 09, 2023
```
* Use editable install

* Full command
```
  c5037b45
- Fix fuyu checkpoint repo in `FuyuConfig` (#27399) · cf2a3f37
  Yih-Dar authored Nov 09, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  cf2a3f37
- use `pytest.mark` directly (#27390) · 3258ff93
  Yih-Dar authored Nov 09, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  3258ff93
- Adds dvclive callback (#27352) · 791ec370
  Dave Berenbaum authored Nov 09, 2023
```
* dvclive trainer callback

* style fixes

* dvclive link fixes
```
  791ec370
- device-agnostic deepspeed testing (#27342) · c5d7754b
  Hz, Ji authored Nov 09, 2023
  
  c5d7754b
- Skip failing cache call tests (#27393) · 9999b739
  amyeroberts authored Nov 09, 2023
```
* Skip failing cache call tests

* Fixup
```
  9999b739
- Put doctest options back to `pyproject.toml` (#27366) · bc086a25
  Yih-Dar authored Nov 09, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  bc086a25
- Change thresh in test (#27378) · e9adb0c9
  Zach Mueller authored Nov 09, 2023
```
Change thresh
```
  e9adb0c9
- [`CodeLlamaTokenizer`] Nit, update __init__ to make sure the AddedTokens are... · 085ea7e5
  Arthur authored Nov 09, 2023
```
[`CodeLlamaTokenizer`] Nit, update __init__ to make sure the AddedTokens are not normalized because they are special (#27359)

* make sure tokens are properly initialized for codellama slow

* add m ore pretrained models

* style

* test more tokenizers checkpoints
```
  085ea7e5
- Smangrul/fix failing ds ci tests (#27358) · 7ecd229b
  Sourab Mangrulkar authored Nov 09, 2023
```
* fix failing DeepSpeed CI tests due to `safetensors` being default

* debug

* remove debug statements

* resolve comments

* Update test_deepspeed.py
```
  7ecd229b
08 Nov, 2023 13 commits

translate debugging.md to chinese (#27374) · ced9fd86
jiaqiw09 authored Nov 08, 2023
```
* update

* update
```
ced9fd86
Update deprecated `torch.range` in `test_modeling_ibert.py` (#27355) · 0e402e14
Sergii Dymchenko authored Nov 08, 2023
```
* Update deprecated torch.range

* Remove comment
```
0e402e14

Add Flash Attention 2 support to Bark (#27364) · a5bee89c

Yoach Lacombe authored Nov 08, 2023



* change handmade attention mask to _prepare_4d_attention_mask

* add flashattention2 support in Bark

* add flashattention2 tests on BarkSemanticModel

* make style

* fix flashattention and tests + make style

* fix memory leak and allow Bark to pass flash attention to sub-models

* make style

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* remove unecessary code from tests + justify overriding

* Update tests/models/bark/test_modeling_bark.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* make style

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

a5bee89c

translate big_models.md and performance.md to chinese (#27334) · ef716736

jiaqiw09 authored Nov 08, 2023

* translate performance.md

* tranlsate performance.md and big_models.md

* update translation

* update review

ef716736

Fix tiny model script: not using `from_pt=True` (#27372) · bd8f45b1
Yih-Dar authored Nov 08, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
bd8f45b1
[Flax Whisper] large-v3 compatibility (#27360) · 7b175cfa
Sanchit Gandhi authored Nov 08, 2023

7b175cfa
Remove unused param from example script tests (#27354) · 845aa832
Zach Mueller authored Nov 08, 2023
```
Unused param
```
845aa832

Translate index.md to Turkish (#27093) · eb30a49b

Mert Yanık authored Nov 08, 2023



* Add index.md for tukish language

* Fix index.md (huggingface/transformers#27088)

* Add 'tr' to additional files

* Update docs/source/tr/_toctree.yml
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update index.md

---------
Co-authored-by: Mert Yanık <mert.yanik@lcwaikiki.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

eb30a49b

MusicGen Update (#27084) · f16ff0f0

Sanchit Gandhi authored Nov 08, 2023

* [MusicGen] Add stereo model

* safe serialization

* Update src/transformers/models/musicgen/modeling_musicgen.py

* split over 2 lines

* fix slow tests on cuda

f16ff0f0

Fix `Kosmos-2` device issue (#27346) · 5ef650b0

Yih-Dar authored Nov 08, 2023



* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5ef650b0

Fix example tests from failing (#27353) · efa57cb2
Zach Mueller authored Nov 08, 2023
```
* Fix example tests from failing

* CHange thresh
```
efa57cb2
moving example of benchmarking to legacy dir (#27337) · b6dbfee0
Hz, Ji authored Nov 08, 2023
```
move example of benchmarking to legacy
```
b6dbfee0

Add numpy alternative to FE using torchaudio (#26339) · be74b2ea

Yoach Lacombe authored Nov 08, 2023

* add audio_utils usage in the FE of SpeechToText

* clean unecessary parameters of AudioSpectrogramTransformer FE

* add audio_utils usage in AST

* add serialization tests and function to FEs

* make style

* remove use_torchaudio and move to_dict to FE

* test audio_utils usage

* make style and fix import (remove torchaudio dependency import)

* fix torch dependency for jax and tensor tests

* fix typo

* clean tests with suggestions

* add lines to test if is_speech_availble is False

be74b2ea

07 Nov, 2023 5 commits

translate model_sharing.md and llm_tutorial.md to chinese (#27283) · e2647450

jiaqiw09 authored Nov 07, 2023

* translate model_sharing.md

* translate llm_tutorial.md to chiense

* update wrong translation

* update _torctree.yml

* update typos

* update

e2647450

translate the en tokenizer_summary.md to Chinese (#27291) · f213d5dd
九是否随意的称呼 authored Nov 08, 2023
```
* translate the en tokenizer_summary.md to Chinese

* revise WordPiece

* add to source/zh/_toctree.yml
```
f213d5dd

Allow scheduler parameters (#26480) · 7e1eff76

Plemeur authored Nov 08, 2023



* Allow for scheduler kwargs

* Formatting

* Arguments checks, passing the tests

* Black failed somehow

---------
Co-authored-by: Pierre <pierre@avatarin.com>

7e1eff76

FIx Bark batching feature (#27271) · ac5d4cf6
Yoach Lacombe authored Nov 07, 2023
```
* fix bark batching

* make style

* add tests and make style
```
ac5d4cf6
[`Whisper`] Nit converting the tokenizer (#27349) · 8f840edd
Arthur authored Nov 07, 2023
```
* `nospeech` instead of `nocaption` for the no speech token

* oups
```
8f840edd