- 25 Feb, 2023 1 commit
-
-
Moshe Berchansky authored
* Fix resume_from_checkpoint for deepspeed Fix resume_from_checkpoint for deepspeed, by ensuring that the deepspeed engine is the one to load the checkpoint. * Empty commit to trigger CI * Removed deepspeed skipping Removed deepspeed skipping inside the _load_from_checkpoint function, as it is obsolete * another adjustment * Trigger CI * trigger circleci * style --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
Stas Bekman <stas@stason.org>
-
- 24 Feb, 2023 12 commits
-
-
Sanchit Gandhi authored
-
Yi Heng Lim authored
* fix gradient checkpointing bug * fix gradient checkpointing bug * ran make fix-copies * fixed bug * fixed bug
-
Kashif Rasul authored
* updated expected * prediction_length fix * prediction_length default value * default prediction_length 24 * revert back prediction_length default * move prediction_length test
-
Joao Gante authored
Generate - update cookie cutters to not initialize cache with training and gradient checkpointing (#21759)
-
Arthur authored
* fix history * input_features instead of input ids for TFWhisport doctest * use translate intead of transcribe
-
bofeng huang authored
* Return and rescale attention_mask * Add SpecAugment to Whisper modeling * Fix test * Update docstring * Add SpecAug related parameters to model config * Add the _mask_input_features function to doc * Fix quality * Apply suggestions from code review Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com> * Remove dev comments * Add test * Resolve conflict * feat: mask {feature, time} prob fast tests * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by:
sanchit-gandhi <sanchit@huggingface.co> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Sanchit Gandhi authored
-
Arthur authored
* Different behavior in DistilBERT when using "inputs_embeds" Fixes #21089 * fix failing test
-
Sanchit Gandhi authored
* [Examples] Generalise run audio classification for log-mel models * batch feature extractor * make style
-
Shubhamai authored
* [flax] adding support for batch norm layers * fixing bugs related to pt+flax integration * cleanup, batchnorm support in sharded pt to flax * support for batchnorm tests in pt+flax integration * simplifying checking batch norm layer
-
Connor Henderson authored
* fix: Change is_last chunk calc and add conditional break * format fix * account for 0 and full stride_rights, add comment * add new test * make style * update slow whisper asr test timestamps * use nested_simplify on output and round timestamp to hundreths place
-
Cl茅mentine Fourrier authored
* Removed useless check for backend * fix style check for graphormer * Reverted change and corrected requires_backend for cython * code qual
-
- 23 Feb, 2023 10 commits
-
-
Stas Bekman authored
* [deepspeed tests] fix issues introduced by #21700 * fix * fix
-
Maria Khalusova authored
* troubleshooting guide: added an error description for missing auto-mapping * minor polishing * changed the example * Apply suggestions from code review Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/troubleshooting.mdx Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Batese2001 authored
* Ran Black formatting * Added imports and reformatted * Update src/transformers/models/encoder_decoder/modeling_tf_encoder_decoder.py --------- Co-authored-by:Matt <Rocketknight1@users.noreply.github.com>
-
ydshieh authored
-
Joao Gante authored
-
Younes Belkada authored
* fix bug * forward contrib credits from discussions * change logic --------- Co-authored-by:edbeeching <edbeeching@users.noreply.github.com>
-
Yih-Dar authored
* Update expect output values - as Hub repo. files are updated * Update expect output values - as librosa is from 0.9.2 to 0.10.0 on CI docker * fix * update one more --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
update Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Naga Sai Abhinay authored
* Add subfolder support * Add kwarg docstring * formatting fix * Add test
-
Thomas Paviot authored
-
- 22 Feb, 2023 12 commits
-
-
Maria Khalusova authored
added Open in Colab to task guides
-
Matt authored
* Stop doing dict-things to non-dict inputs * Add a debug check * Add a debug check * Remove debug checks, looks good now! * make fixup
-
Sanchit Gandhi authored
* [SpeechT5HifiGan] Handle batched inputs * fix docstring * rebase and new ruff style
-
Yih-Dar authored
* fix * skip test_model_parallelism * skip test_model_parallelism --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
* remove .parameters()).device * fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
Change doc example for BigBirdForQuestionAnswering Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
remove from doctest list to avoid GPU OOM Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sylvain Gugger authored
* Respect documentation on passive log level * Fix test and set log level in examples * Add doc
-
Sylvain Gugger authored
-
Younes Belkada authored
fix typo
-
Aaron Gokaslan authored
-
Kashif Rasul authored
* added loc and scale outputs from scalers * fix typo * fix tests * fixed formatting * initial StdScaler * move scaling to optional str * calculate std feature for scalers * undid change as it does not help * added StdScaler with weights * added input projection layer and d_model hyperparam * use linear proj * add back layernorm_embedding * add sin-cos pos embeddings * updated scalers * formatting * fix type * fixed test * fix repeated_past_values cal. * fix when keepdim=false * fix default_scale * backward compatibility of scaling config * update integration test expected output * fix style * fix docs * use the actual num_static_real_features in feature_dim cal * clarified docs * Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * prediction_length is not optional * fix for reviewer * Update src/transformers/models/time_series_transformer/configuration_time_series_transformer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * get rid of un-needed new lines * fix doc * remove unneeded new lines * fix style * static_categorical_features and static_real_features are optional * fix integration test * Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * fixing docs for multivariate setting * documentation for generate --------- Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 21 Feb, 2023 5 commits
-
-
mollerup23 authored
* Adding type hints to call() functions in this file * make fixup * Update src/transformers/models/marian/modeling_tf_marian.py * Update src/transformers/models/marian/modeling_tf_marian.py * Update src/transformers/models/marian/modeling_tf_marian.py * Update src/transformers/models/marian/modeling_tf_marian.py * Update src/transformers/models/marian/modeling_tf_marian.py * Update src/transformers/models/marian/modeling_tf_marian.py * Update src/transformers/models/marian/modeling_tf_marian.py * Update src/transformers/models/marian/modeling_tf_marian.py --------- Co-authored-by:
Matt <rocketknight1@gmail.com> Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com>
-
Maria Khalusova authored
* added resources: links to task guides that support these models * minor polishing * conflict resolved * link fix * Update docs/source/en/model_doc/vision-encoder-decoder.mdx Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Yih-Dar authored
* fix tvlt ci * fix tvlt ci * fix tvlt ci --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
Fix get_class_in_module Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
* Add test * Fix GITProcessor * Update --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-