1. 07 Dec, 2022 5 commits
  2. 06 Dec, 2022 16 commits
  3. 05 Dec, 2022 19 commits
    • Sylvain Gugger's avatar
      Fix test for file not found (#20604) · 5764efe5
      Sylvain Gugger authored
      5764efe5
    • Steven Liu's avatar
      Split autoclasses on modality (#20559) · 720e9599
      Steven Liu authored
      * split autoclasses on modality
      
      * apply review
      
      * auto classes
      720e9599
    • Steven Liu's avatar
      Fix code sample in preprocess (#20561) · 7d1c1c5b
      Steven Liu authored
      * change to image_processor
      
      * apply review
      7d1c1c5b
    • Sourab Mangrulkar's avatar
      README in Hindi 馃嚠馃嚦 (#20097) · 73ec12ea
      Sourab Mangrulkar authored
      * Created README_hd.md
      
      A Hindi Translation for README
      
      * updated check_copies.py
      
      Added the Proper info for Hindi Translation of README File !
      
      * updated README_hd.md
      
      Fixed some translation issues !
      
      * Update README_hd.md
      
      * Update README_hd.md
      
      * Update README_hd.md
      
      * fixing 馃悰 for `make fix-copies`
      
      * run `make fix-copies`
      
      * `make fix-copies` 馃槄
      
      Co-authored-by: default avatarAkshit Gulyan <103456810+AkshitGulyan@users.noreply.github.com>
      73ec12ea
    • Arthur's avatar
      Add-whisper-conversion (#20600) · aef9aac3
      Arthur authored
      * add whisper conversion scrip
      
      * update conversion script
      
      * update arg names
      
      * fix missing encoder_ffn_dim
      
      * fixup
      
      * ast nits
      aef9aac3
    • Sanchit Gandhi's avatar
      [Whisper] Fix decoder ids methods (#20599) · 74fb524e
      Sanchit Gandhi authored
      * [Whisper] Fix decoder ids methods
      
      * enum property
      74fb524e
    • Younes Belkada's avatar
      [Vision] `.to` function for ImageProcessors (#20536) · ef0f85cd
      Younes Belkada authored
      
      
      * add v1 with tests
      
      * add checker
      
      * simplified version
      
      * update docstring
      
      * better version
      
      * fix docstring + change order
      
      * make style
      
      * tests + change conditions
      
      * final tests
      
      * modify docstring
      
      * Update src/transformers/feature_extraction_utils.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * replace by `ValueError`
      
      * fix logic
      
      * apply suggestions
      
      * `dtype` is not needed
      
      * adapt suggestions
      
      * remove `_parse_args_to_device`
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      ef0f85cd
    • Yih-Dar's avatar
      Replace `set-output` by `$GITHUB_OUTPUT` (#20547) · 67d32f46
      Yih-Dar authored
      
      
      * remove set-output
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      67d32f46
    • Arthur's avatar
      Fix whisper and speech to text doc (#20595) · 9763f829
      Arthur authored
      * Fix whisper and speech to text doc
      # What does this PR do?
      Previously the documentation was badly indented for both models and indicated that
      > If `decoder_input_ids` and `decoder_inputs_embeds` are both unset, `decoder_inputs_embeds` takes the value of `inputs_embeds`.`
      Which is on valid for the forward pass of the `ForConditionnalGeneration` not for the model alone.
      
      * other fixes
      9763f829
    • Yih-Dar's avatar
    • Francisco Kurucz's avatar
      Fix link to table transformer detection microsoft model (#20560) · eefae413
      Francisco Kurucz authored
      * Fix link to table transformer detection microsoft model
      
      * Fix doc styles
      eefae413
    • Francisco Kurucz's avatar
    • Francisco Kurucz's avatar
      ac3bccdc
    • Erin's avatar
      Add RemBERT ONNX config (#20520) · 87282cb7
      Erin authored
      
      
      * rembert onnx config
      
      * formatting
      Co-authored-by: default avatarHo <erincho@bcd0745f972b.ant.amazon.com>
      87282cb7
    • Matthew Hoffman's avatar
      ESM openfold_utils type hints (#20544) · afe2a466
      Matthew Hoffman authored
      
      
      * add type annotations for esm chunk_utils
      
      use isinstance builtin instead of 'type(x) is y'; add assertions to aid in type inferencing; use bools instead of ints in _get_minimal_slice_set for improved type clarity; refactor to avoid re-assigning to the same variable with a different type
      
      * add type annotations for esm data_transforms
      
      refactor to avoid re-assigning to the same variable with a different type
      
      * add type annotations for esm feats utils
      
      refactor to avoid re-assigning to the same variable with a different type
      
      * add type annotations for esm loss utils
      
      * add/fix type annotations for esm rigit_utils
      
      refactor to avoid re-assigning to the same variable with a different type; fix Callable, Tuple type hints; match conditional structure to other methods; fix return type on Rotation.cat and Rotation.unsqueeze
      
      * add type annotations for esm tensor_utils
      
      overload for tree_map; use insinstance builtin instead of 'type(x) is y'; export dict_multimap, flatten_final_dims, permute_final_dims in openfold_utils
      
      * add type annotations for esm protein utils
      
      add FIXME for attempted string mutation; add missing None check in get_pdb_headers; fix potentially unbound variable 'chain_tag' in to_pdb; modify get_pdb_headers return type
      
      * add type annotations for esm residue constants
      
      hints on collection constants; remove magic trailing comma to reduce number of lines; change list -> tuple for rigid_group_atom_positions for improved hinting
      
      * code style fixup
      Co-authored-by: default avatarMatt <rocketknight1@gmail.com>
      afe2a466
    • Mihai Cernusca's avatar
      Make convert_to_onnx runable as script again (#20009) · 8ea6694d
      Mihai Cernusca authored
      * Make convert_to_onnx runable as script again
      
      Fix `convert_graph_to_onnx.py` relative import so it can be run as a script again.
      
      * Trigger CI
      8ea6694d
    • Arthur's avatar
      cross platform from_pretrained (#20538) · 84c9bf74
      Arthur authored
      
      
      * add support for `from_pt`
      
      * add tf_flax utility file
      
      * Update src/transformers/modeling_tf_flax_utils.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * remove flax related modifications
      
      * add test
      
      * remove FLAX related commits
      
      * fixup
      
      * remove safetensor todos
      
      * revert deletion
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      84c9bf74
    • Arthur's avatar
      Ci-whisper-asr (#20588) · 538e5248
      Arthur authored
      * Expected output for the test changed
      
      * fix failing asr test
      538e5248
    • Kamal Raj Kanakarajan's avatar
      Add BioGPT (#20420) · 13e73668
      Kamal Raj Kanakarajan authored
      * biogpt initial commit
      
      * updated init
      
      * fix faster decoding with use_cache
      
      * 1. fix input_ids and input_embeds with correct device
      2. added _keys_to_ignore_on_load_missing
      3. updated prepare_inputs_for_generation
      
      * add activation_dropout and scale_embedding
      
      * replace fsmt attention with bart attention
      
      * added test
      
      * run make fix-copies
      
      * doc init and fix build
      
      * updated README with proper information
      
      * 1. added tips to docs
      2. updated BioGptTokenizer func
      
      * 1. added tokenizer test
      2. refactor tokenizer
      
      * make fixup
      
      * add biogpt fairseq to hf converter
      
      * updated layer names more
      similar to original checkpoints
      
      * config update doc string and set defaults
      
      * added "#copied" from bart model and
      updated doc strings
      
      * enable model_input_names in tokenizer
      
      * 1.  positionalembedding depending on attention_mask
      2. added attention mask to prepare for generation
      
      * added test to verify past and generation
      
      * BioGptLMHeadModel -> BioGptForCausalLM
      
      * fix typo
      
      * tokenization and test
      Copyright and updated assertion
      
      * updated Copyright and
      one func at time in line
      
      * Copyright updates and
      minor doc fix
      
      * replace assertion with ValueError
      
      * rm extra space
      
      * added code syntax
      
      * revert cmnt position change
      
      * add tokenizer to auto
      
      * updated doc string
      
      * tokenizer doc string update
      
      * biogpt hub model update to microsoft/biogpt
      
      * make fixup
      
      * rm cmnt to fix flake8 5.0.4 vs 6 error
      13e73668