"tests/models/vilt/test_image_processing_vilt.py" did not exist on "4975002df50c472cbb6f8ac3580e475f570606ab"
  1. 09 May, 2023 1 commit
    • Matthijs Hollemans's avatar
      audio_utils improvements (#21998) · 7f919509
      Matthijs Hollemans authored
      * silly change to allow making a PR
      
      * clean up doc comments
      
      * simplify hertz_to_mel and mel_to_hertz
      
      * fixup
      
      * clean up power_to_db
      
      * also add amplitude_to_db
      
      * move functions
      
      * clean up mel_filter_bank
      
      * fixup
      
      * credit librosa & torchaudio authors
      
      * add unit tests
      
      * tests for power_to_db and amplitude_to_db
      
      * add mel_filter_bank tests
      
      * rewrite STFT
      
      * add convenience spectrogram function
      
      * missing transpose
      
      * fewer transposes
      
      * add integration test to M-CTC-T
      
      * frame length can be either window or FFT length
      
      * rewrite stft API
      
      * add preemphasis coefficient
      
      * move argument
      
      * add log option to spectrogram
      
      * replace M-CTC-T feature extractor
      
      * fix api thing
      
      * replace whisper STFT
      
      * replace whisper mel filters
      
      * replace tvlt's stft
      
      * allow alternate window names
      
      * replace speecht5 stft
      
      * fixup
      
      * fix integration tests
      
      * fix doc comments
      
      * remove manual FFT length calculation
      
      * fix docs
      
      * go away, deprecation warnings
      
      * combine everything into spectrogram function
      
      * add deprecated functions back
      
      * fixup
      7f919509
  2. 07 May, 2023 1 commit
  3. 05 May, 2023 2 commits
  4. 04 May, 2023 3 commits
  5. 03 May, 2023 3 commits
  6. 02 May, 2023 1 commit
  7. 01 May, 2023 1 commit
  8. 28 Apr, 2023 2 commits
  9. 27 Apr, 2023 4 commits
  10. 26 Apr, 2023 2 commits
  11. 24 Apr, 2023 2 commits
  12. 23 Apr, 2023 1 commit
  13. 21 Apr, 2023 6 commits
  14. 20 Apr, 2023 3 commits
  15. 19 Apr, 2023 2 commits
    • Arthur's avatar
      Add Segment Anything Model (SAM) (#22654) · 474bf508
      Arthur authored
      
      
      * initial commit
      
      * keys match
      
      * update, fix conversion
      
      * fixes, inference working
      
      * fix
      
      * more fixes
      
      * more fixes
      
      * clean up
      
      * more clean up
      
      * fix copies and add convext copied layer norm
      
      * stash
      
      * pretty big upfate
      
      * cleaning
      
      * more cleaning
      
      * fixup stuffs
      
      * fix copies
      
      * fix iinit
      
      * update test removing tokenizer
      
      * nits
      
      * add pretrained
      
      * more nits
      
      * remove tracking of pipeline
      
      * few fixes
      
      * update san and conversion script
      
      * fix mask decoder and prompt encoder conversion
      
      * fixes
      
      * small update
      
      * fix order
      
      * fix
      
      * fix image embeddings
      
      * nites
      
      * few fixes
      
      * fix logits
      
      * clean up
      
      * fixes boxes inference
      
      * v1 AMG
      
      * clean up
      
      * some clean up
      
      * multi points support
      
      * amg working
      
      * fixup
      
      * clean up
      
      * readme
      
      * update toctree
      
      * fix type hint
      
      * multiple fixes
      
      * fixup
      
      * fixes
      
      * updates
      
      * updates
      
      * more tests
      
      * few fixes
      
      * change to `SamForMaskGeneration`
      
      * doc
      
      * fixup
      
      * fix more tests
      
      * multiple fixes
      
      * fix CI tests
      
      * refactor processor
      
      * renamings
      
      * draft the pipeline
      
      * refactor
      
      * fix tests
      
      * fix test
      
      * few cleanings
      
      * fix test
      
      * edit pipelien support chunking
      
      * udate
      
      * add slow tests
      
      * fix nit
      
      * fixup
      
      * fix nit
      
      * current chunk pipleine
      
      * cast boxes in fp32
      
      * nit
      
      * current updates
      
      * piepleine works
      
      * fixup
      
      * clean up config
      
      * fix slow tests
      
      * fix slow tests
      
      * clean up
      
      * update doc and pipeline
      
      * adds more slow tests
      
      * fix slow tests
      
      * cleaning
      
      * tests pass
      
      * add docstring
      
      * fix copies
      
      * clean up
      
      * support batch of images
      
      * style
      
      * dummy is needed, add tests
      
      * fix slow tests
      
      * fix CI
      
      * update
      
      * adds more tests
      
      * fixes
      
      * fixes
      
      * fixup
      
      * fixes
      
      * few fixes
      
      * filter
      
      * few fixes
      
      * some refactor
      
      * touches finales
      
      * fix
      
      * style
      
      * remove pipeline files
      
      * fixes nits
      
      * revert pipeline changes
      
      * fix test
      
      * fixup
      
      * remove automodel for automatic mask generation
      
      * fix failing torch tests
      
      * update mdx
      
      * revert removal of `MODEL_FOR_AUTOMATIC_MASK_GENERATION_MAPPING`
      
      * update sam config based on review
      Co-authored-by: default avataramyeroberts <aeroberts4444@gmail.com>
      Co-authored-by: default avatarsgugger <sylvain.gugger@gmail.com>
      
      * update low_resolution_masks -> pred_masks
      inti ln with layer_norm_eps
      add_decomposed_rel_pos doc
      forward doc of SamForMaskGeneration
      
      * update processor docstring
      
      * remove image processor import empty
      
      * update for testing
      
      * output vision hidden states + clean recomm
      also test all iou values
      
      * fixup
      
      * fixup
      
      * remove unused
      
      * Update src/transformers/models/sam/modeling_sam.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/sam/image_processing_sam.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * nits
      
      * fix
      
      * fix CI tests and slow tests
      
      * replace with Amy's processor
      
      * clearer docstring
      
      * add `SamVisionNeck`
      
      * refactor - all CI tests should pass
      
      * fix broken import on Gcolab
      
      * few fixes here and there
      
      * fix another bug
      
      * fix more bugs
      
      * update and merge
      
      * correct ckpt
      
      * address comments
      
      * add tips
      
      * revert
      
      * fix docstring
      
      * replace with `SamModel`
      
      * make fixup
      
      * add support for bathed images and batch ed points
      
      * make fixup this time, really
      
      * make fixup again and again
      
      * few fixes here and there, this should be the touche finale
      
      * Update docs/source/en/model_doc/sam.mdx
      
      * fixup
      
      * correct checkpoints
      
      * correct name
      
      * rm unneeded file
      
      * add notebook
      
      ---------
      Co-authored-by: default avataryounesbelkada <younesbelkada@gmail.com>
      Co-authored-by: default avataramyeroberts <aeroberts4444@gmail.com>
      Co-authored-by: default avatarsgugger <sylvain.gugger@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarYounes Belkada <49240599+younesbelkada@users.noreply.github.com>
      474bf508
    • Yih-Dar's avatar
      Remove some pipeline skip cases (#22865) · 06bab003
      Yih-Dar authored
      
      
      fix
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      06bab003
  16. 18 Apr, 2023 3 commits
    • Sylvain Gugger's avatar
      Use code on the Hub from another repo (#22814) · 5f9b825c
      Sylvain Gugger authored
      * initial work
      
      * Add other classes
      
      * Refactor code
      
      * Move warning and fix dynamic pipeline
      
      * Issue warning when necessary
      
      * Add test
      
      * Do not skip auto tests
      
      * Fix failing tests
      
      * Refactor and address review comments
      
      * Address review comments
      5f9b825c
    • Joao Gante's avatar
      Generate: Add assisted generation (#22211) · 78cda46f
      Joao Gante authored
      * working mvp
      
      * remove breakpoint
      
      * fix commit
      
      * standardize outputs
      
      * tmp commit
      
      * tests almost ready
      
      * tmp commit
      
      * skip a few models
      
      * Add streaming; Docs and examples
      
      * document limitations
      
      * PR commits
      
      * Amy PR comments
      78cda46f
    • Matthijs Hollemans's avatar
      TTS fine-tuning for SpeechT5 (#21824) · ac2bc50a
      Matthijs Hollemans authored
      
      
      * wrong argument name
      
      * append eos_token_id
      
      * all tokenizers need mask and ctc_blank tokens
      
      * remove reduction factor from feature extractor
      
      * add proper TTS loss
      
      * did shifting the wrong way around
      
      * mask out padded portions
      
      * remove logits again (don't really need it)
      
      * fix unit tests
      
      * fixup
      
      * pad also returns the decoder attention mask, since that's useful to have
      
      * clean up feature extractor logic
      
      * pad can handle TTS task too
      
      * remove stop_labels from loss calculation
      
      * simplify logic
      
      * fixup
      
      * do -100 masking properly
      
      * small STFT optimization (calculate mel filterbanks only once)
      
      * replace torchaudio fbanks with audio_utils
      
      * remove torchaudio dependency
      
      * simplify & speed up the STFT
      
      * don't serialize window and mel filters
      
      * output cross attentions when generating speech
      
      * add guided attention loss
      
      * fix failing test
      
      * Update src/transformers/models/speecht5/feature_extraction_speecht5.py
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * Update src/transformers/models/speecht5/modeling_speecht5.py
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * change type annotation of attention_mask to LongTensor
      
      * extract loss into class
      
      * remove unused frame_signal_scale argument
      
      * use config object in loss class
      
      * fix type annotations in doc comments
      
      * change optional to just bool
      
      * implement missing tokenizer method
      
      * add deprecation warning
      
      * Update src/transformers/models/speecht5/feature_extraction_speecht5.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/speecht5/feature_extraction_speecht5.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * add deprecation warning for stop_labels
      
      ---------
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      ac2bc50a
  17. 17 Apr, 2023 3 commits