"docs/source/en/model_doc/efficientformer.md" did not exist on "78a53d59cb6fa444a95d6be4d15fb3a25e6a8a2e"
  1. 13 Sep, 2023 1 commit
  2. 05 Sep, 2023 1 commit
  3. 25 Aug, 2023 1 commit
  4. 11 Jul, 2023 1 commit
  5. 14 Apr, 2023 1 commit
  6. 06 Apr, 2023 1 commit
  7. 27 Mar, 2023 3 commits
  8. 13 Mar, 2023 1 commit
  9. 06 Feb, 2023 1 commit
    • Sylvain Gugger's avatar
      Update quality tooling for formatting (#21480) · 6f79d264
      Sylvain Gugger authored
      * Result of black 23.1
      
      * Update target to Python 3.7
      
      * Switch flake8 to ruff
      
      * Configure isort
      
      * Configure isort
      
      * Apply isort with line limit
      
      * Put the right black version
      
      * adapt black in check copies
      
      * Fix copies
      6f79d264
  10. 31 Jan, 2023 1 commit
  11. 18 Oct, 2022 1 commit
  12. 01 Sep, 2022 1 commit
  13. 06 Jul, 2022 1 commit
  14. 22 Jun, 2022 1 commit
    • Eran Hirsch's avatar
      Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer`... · 13570381
      Eran Hirsch authored
      Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer` methods `evaluate` and `predict` (#17805)
      
      * Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer` methods `evaluate` and `predict`
      
      * Add all generate parameters to `Seq2SeqTrainer`, and also to `QuestionAnsweringSeq2SeqTrainer` which overrides it
      
      * Remove `self._num_beams` from trainer classes
      
      * - Run fixup
      - Fix "Constraint" not exposed
      - Fix synced_gpus to actually read from param
      
      * Use kwargs
      
      * Copy kwargs before making changes to it
      
      * Fix style issues unused imports
      13570381
  15. 25 May, 2022 1 commit
  16. 05 Apr, 2022 1 commit
    • John Giorgi's avatar
      Add global_attention_mask to gen_kwargs (#16485) · b33ab4eb
      John Giorgi authored
      If global_attention_mask is found in the models inputs (used by certain
      models, like LED) in the prediction_step method of Seq2SeqTrainer,
      it is added to the gen_kwargs, which are passed to model.decode().
      This allows us to properly set the global attention when decoding.
      b33ab4eb
  17. 10 Feb, 2022 1 commit
  18. 11 Jan, 2022 1 commit
  19. 27 Dec, 2021 2 commits
    • Stas Bekman's avatar
      [doc] :obj: hunt (#14954) · e13f72fb
      Stas Bekman authored
      * redo sans examples
      
      * style
      e13f72fb
    • Sylvain Gugger's avatar
      Doc styler v2 (#14950) · 87e6e4fe
      Sylvain Gugger authored
      * New doc styler
      
      * Fix issue with args at the start
      
      * Code sample fixes
      
      * Style code examples in MDX
      
      * Fix more patterns
      
      * Typo
      
      * Typo
      
      * More patterns
      
      * Do without black for now
      
      * Get more info in error
      
      * Docstring style
      
      * Re-enable check
      
      * Quality
      
      * Fix add_end_docstring decorator
      
      * Fix docstring
      87e6e4fe
  20. 23 Dec, 2021 1 commit
  21. 21 Dec, 2021 1 commit
    • Sylvain Gugger's avatar
      Mass conversion of documentation from rst to Markdown (#14866) · 27b3031d
      Sylvain Gugger authored
      * Convert docstrings of all configurations and tokenizers
      
      * Processors and fixes
      
      * Last modeling files and fixes to models
      
      * Pipeline modules
      
      * Utils files
      
      * Data submodule
      
      * All the other files
      
      * Style
      
      * Missing examples
      
      * Style again
      
      * Fix copies
      
      * Say bye bye to rst docstrings forever
      27b3031d
  22. 20 Dec, 2021 1 commit
  23. 07 Dec, 2021 2 commits
  24. 29 Oct, 2021 1 commit
  25. 28 Oct, 2021 1 commit
    • NielsRogge's avatar
      Fix EncoderDecoderModel classes to be more like BART and T5 (#14139) · ac12a5ae
      NielsRogge authored
      * First draft
      
      * Make tuple output more readable
      
      * Replace assertions by value errors
      
      * Make it possible to predict_with_generate for vision and speech models
      
      * Adapt Seq2SeqTrainer to work with VisionEncoderDecoder/SpeechEncoderDecoder
      
      * Add deprecation warning
      
      * Add copied from statements to vision and speech encoder decoders
      
      * Fix failing test
      
      * Apply @patrickvonplaten's suggestion
      
      * Use reshape instead of view for consistency
      ac12a5ae
  26. 31 Aug, 2021 1 commit
  27. 19 Aug, 2021 1 commit
  28. 27 Jul, 2021 1 commit
  29. 02 Jun, 2021 1 commit
  30. 26 Apr, 2021 1 commit
  31. 08 Apr, 2021 1 commit
    • Stas Bekman's avatar
      [DeepSpeed] ZeRO Stage 3 (#10753) · c6d66484
      Stas Bekman authored
      
      
      * synced gpus
      
      * fix
      
      * fix
      
      * need to use t5-small for quality tests
      
      * notes
      
      * complete merge
      
      * fix a disappearing std stream problem
      
      * start zero3 tests
      
      * wip
      
      * tune params
      
      * sorting out the pre-trained model loading
      
      * reworking generate loop wip
      
      * wip
      
      * style
      
      * fix tests
      
      * split the tests
      
      * refactor tests
      
      * wip
      
      * parameterized
      
      * fix
      
      * workout the resume from non-ds checkpoint pass + test
      
      * cleanup
      
      * remove no longer needed code
      
      * split getter/setter functions
      
      * complete the docs
      
      * suggestions
      
      * gpus and their compute capabilities link
      
      * Apply suggestions from code review
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * style
      
      * remove invalid paramgd
      
      * automatically configure zero3 params that rely on hidden size
      
      * make _get_resized_embeddings zero3-aware
      
      * add test exercising resize_token_embeddings()
      
      * add docstring
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      c6d66484
  32. 01 Feb, 2021 1 commit
  33. 27 Dec, 2020 1 commit
  34. 22 Dec, 2020 1 commit