"docs/source/en/tasks/image_classification.mdx" did not exist on "867f3950fa908632ddb3564873293b620d73c2dc"
- 28 Feb, 2023 7 commits
-
-
Younes Belkada authored
* fix torchquant issue * add tests
-
anruijian authored
* fix tf random mask tokens probability * fix tf random mask tokens probability in collator for langauge modelling
-
Karim Foda authored
* Fix gradient checkpointing bug in gptneox * Fix gradient checkpointing bug in modeling_imagegpt.py * Revert gpt neox changes --------- Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Karim Foda authored
Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Andy Ehrenberg authored
-
saswatmeher authored
Co-authored-by:saswatmeher <saswatmeher@cse.iitb.ac.in>
-
Yih-Dar authored
Let's give TF a bit more love
❤ ️🙏 Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 27 Feb, 2023 13 commits
-
-
Stas Bekman authored
* logger.warning_once * style
-
Yih-Dar authored
fix quality with ruff 0.0.253 Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Joao Gante authored
-
Karim Foda authored
* Fix gradient checkpointing bug in gptneox * Remove use_cache block
-
fxmarty authored
fix nn.init.trunc_normal_ call on half data
-
fxmarty authored
* fix perceiver fp16 * hopefully fix tests
-
Younes Belkada authored
* add `accelerate` marker * add to docs * Update docs/source/en/testing.mdx
-
Arthur authored
* uint8 -> bool * fix copies * style * update test modeling commen when checking attention buffers * style * use logical not on random mask instead of subtraction with 1 * remove torch uint8 * quality * remove modified modeling utils * Update based on review Co-authored-by:
sgugger <sylvain.gugger@gmail.com> --------- Co-authored-by:
sgugger <sylvain.gugger@gmail.com>
-
Arthur authored
* add pipeline * update init * add zero shot to init * update inits and correct checkpoints * update base to support input features * add tests * Update src/transformers/pipelines/zero_shot_audio_classification.py Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update src/transformers/pipelines/zero_shot_audio_classification.py Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * update pieline code * use tiny checkpoint * nits and expected value with tiny model * style * last nit on tests values * fix styling * fix collate fn that was casting t float * update --------- Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
-
Tianqi Zhang (张天启) authored
make concrete_args from outside available
-
Thomas Paviot authored
* fix wrong url * typos in english documentation
-
Julian Weber authored
Fix docstring gpt2 config
-
bofeng huang authored
* Override the decoding parameters of Seq2SeqTrainer * Fix quality * Fix max_length parameter * Fix quality * Remove redundant parameter max_length * Separate the preprocess of train and validation to use different max_target_length
-
- 25 Feb, 2023 1 commit
-
-
Moshe Berchansky authored
* Fix resume_from_checkpoint for deepspeed Fix resume_from_checkpoint for deepspeed, by ensuring that the deepspeed engine is the one to load the checkpoint. * Empty commit to trigger CI * Removed deepspeed skipping Removed deepspeed skipping inside the _load_from_checkpoint function, as it is obsolete * another adjustment * Trigger CI * trigger circleci * style --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
Stas Bekman <stas@stason.org>
-
- 24 Feb, 2023 12 commits
-
-
Sanchit Gandhi authored
-
Yi Heng Lim authored
* fix gradient checkpointing bug * fix gradient checkpointing bug * ran make fix-copies * fixed bug * fixed bug
-
Kashif Rasul authored
* updated expected * prediction_length fix * prediction_length default value * default prediction_length 24 * revert back prediction_length default * move prediction_length test
-
Joao Gante authored
Generate - update cookie cutters to not initialize cache with training and gradient checkpointing (#21759)
-
Arthur authored
* fix history * input_features instead of input ids for TFWhisport doctest * use translate intead of transcribe
-
bofeng huang authored
* Return and rescale attention_mask * Add SpecAugment to Whisper modeling * Fix test * Update docstring * Add SpecAug related parameters to model config * Add the _mask_input_features function to doc * Fix quality * Apply suggestions from code review Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com> * Remove dev comments * Add test * Resolve conflict * feat: mask {feature, time} prob fast tests * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by:
sanchit-gandhi <sanchit@huggingface.co> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Sanchit Gandhi authored
-
Arthur authored
* Different behavior in DistilBERT when using "inputs_embeds" Fixes #21089 * fix failing test
-
Sanchit Gandhi authored
* [Examples] Generalise run audio classification for log-mel models * batch feature extractor * make style
-
Shubhamai authored
* [flax] adding support for batch norm layers * fixing bugs related to pt+flax integration * cleanup, batchnorm support in sharded pt to flax * support for batchnorm tests in pt+flax integration * simplifying checking batch norm layer
-
Connor Henderson authored
* fix: Change is_last chunk calc and add conditional break * format fix * account for 0 and full stride_rights, add comment * add new test * make style * update slow whisper asr test timestamps * use nested_simplify on output and round timestamp to hundreths place
-
Clémentine Fourrier authored
* Removed useless check for backend * fix style check for graphormer * Reverted change and corrected requires_backend for cython * code qual
-
- 23 Feb, 2023 7 commits
-
-
Stas Bekman authored
* [deepspeed tests] fix issues introduced by #21700 * fix * fix
-
Maria Khalusova authored
* troubleshooting guide: added an error description for missing auto-mapping * minor polishing * changed the example * Apply suggestions from code review Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/troubleshooting.mdx Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Batese2001 authored
* Ran Black formatting * Added imports and reformatted * Update src/transformers/models/encoder_decoder/modeling_tf_encoder_decoder.py --------- Co-authored-by:Matt <Rocketknight1@users.noreply.github.com>
-
ydshieh authored
-
Joao Gante authored
-
Younes Belkada authored
* fix bug * forward contrib credits from discussions * change logic --------- Co-authored-by:edbeeching <edbeeching@users.noreply.github.com>
-
Yih-Dar authored
* Update expect output values - as Hub repo. files are updated * Update expect output values - as librosa is from 0.9.2 to 0.10.0 on CI docker * fix * update one more --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-