1. 15 Jul, 2022 1 commit
  2. 12 May, 2022 1 commit
  3. 20 Apr, 2022 1 commit
  4. 23 Mar, 2022 1 commit
    • Sylvain Gugger's avatar
      Reorganize file utils (#16264) · 4975002d
      Sylvain Gugger authored
      * Split file_utils in several submodules
      
      * Fixes
      
      * Add back more objects
      
      * More fixes
      
      * Who exactly decided to import that from there?
      
      * Second suggestion to code with code review
      
      * Revert wront move
      
      * Fix imports
      
      * Adapt all imports
      
      * Adapt all imports everywhere
      
      * Revert this import, will fix in a separate commit
      4975002d
  5. 28 Jan, 2022 1 commit
    • Nicolas Patry's avatar
      Fixing support `batch_size` and `num_return_Sequences` in `text-generation` pipeline (#15318) · 06107541
      Nicolas Patry authored
      * Fixing support `batch_size` and `num_return_Sequences` in
      `text-generation` pipeline
      
      And `text2text-generation` too.
      
      The bug was caused by the batch_size containing both the incoming batch
      **and** the generated `num_sequences`.
      
      The fix simply consists into splitting both of these again into
      different dimensions.
      
      * TF support.
      
      * Odd backward compatibility script in the way.
      06107541
  6. 27 Dec, 2021 1 commit
    • Sylvain Gugger's avatar
      Doc styler v2 (#14950) · 87e6e4fe
      Sylvain Gugger authored
      * New doc styler
      
      * Fix issue with args at the start
      
      * Code sample fixes
      
      * Style code examples in MDX
      
      * Fix more patterns
      
      * Typo
      
      * Typo
      
      * More patterns
      
      * Do without black for now
      
      * Get more info in error
      
      * Docstring style
      
      * Re-enable check
      
      * Quality
      
      * Fix add_end_docstring decorator
      
      * Fix docstring
      87e6e4fe
  7. 21 Dec, 2021 1 commit
    • Sylvain Gugger's avatar
      Mass conversion of documentation from rst to Markdown (#14866) · 27b3031d
      Sylvain Gugger authored
      * Convert docstrings of all configurations and tokenizers
      
      * Processors and fixes
      
      * Last modeling files and fixes to models
      
      * Pipeline modules
      
      * Utils files
      
      * Data submodule
      
      * All the other files
      
      * Style
      
      * Missing examples
      
      * Style again
      
      * Fix copies
      
      * Say bye bye to rst docstrings forever
      27b3031d
  8. 06 Dec, 2021 1 commit
  9. 29 Oct, 2021 1 commit
    • Nicolas Patry's avatar
      Adding `handle_long_generation` paramters for `text-generation` pipeline. (#14118) · dc540dd3
      Nicolas Patry authored
      * Adding `handle_long_generation` paramters for `text-generation` pipeline.
      
      * More error handling
      
      * Fixing tests by dropping tf support on this functionality, it needs
      
      `max_new_tokens` to make it possible to understand user's intent.
      Otherwise, `max_length` == `tokenizer.model_max_length` <
      input_ids.shape[0].
      
      * Fixing doc ?
      
      * Doc ?
      
      * Remove link from doc.
      
      * Catched an issue on roberta.
      
      * Damn doc.
      
      * Non BC proposal ?
      
      * Cleaning the fix ?
      
      * Finally using only a test override.
      
      * Don't need to modify this.
      
      * Bad print.
      dc540dd3
  10. 05 Oct, 2021 1 commit
  11. 10 Sep, 2021 1 commit
    • Nicolas Patry's avatar
      [Large PR] Entire rework of pipelines. (#13308) · c63fcabf
      Nicolas Patry authored
      
      
      * Enabling dataset iteration on pipelines.
      
      Enabling dataset iteration on pipelines.
      
      Unifying parameters under `set_parameters` function.
      
      Small fix.
      
      Last fixes after rebase
      
      Remove print.
      
      Fixing text2text `generate_kwargs`
      
      No more `self.max_length`.
      
      Fixing tf only conversational.
      
      Consistency in start/stop index over TF/PT.
      
      Speeding up drastically on TF (nasty bug where max_length would increase
      a ton.)
      
      Adding test for support for non fast tokenizers.
      
      Fixign GPU usage on zero-shot.
      
      Fix working on Tf.
      
      Update src/transformers/pipelines/base.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      Update src/transformers/pipelines/base.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      Small cleanup.
      
      Remove all asserts + simple format.
      
      * Fixing audio-classification for large PR.
      
      * Overly explicity null checking.
      
      * Encapsulating GPU/CPU pytorch manipulation directly within `base.py`.
      
      * Removed internal state for parameters of the  pipeline.
      
      Instead of overriding implicitly internal state, we moved
      to real named arguments on every `preprocess`, `_forward`,
      `postprocess` function.
      
      Instead `_sanitize_parameters` will be used to split all kwargs
      of both __init__ and __call__ into the 3 kinds of named parameters.
      
      * Move import warnings.
      
      * Small fixes.
      
      * Quality.
      
      * Another small fix, using the CI to debug faster.
      
      * Last fixes.
      
      * Last fix.
      
      * Small cleanup of tensor moving.
      
      * is not None.
      
      * Adding a bunch of docs + a iteration test.
      
      * Fixing doc style.
      
      * KeyDataset = None guard.
      
      * RRemoving the Cuda test for pipelines (was testing).
      
      * Even more simple iteration test.
      
      * Correct import .
      
      * Long day.
      
      * Fixes in docs.
      
      * [WIP] migrating object detection.
      
      * Fixed the target_size bug.
      
      * Fixup.
      
      * Bad variable name.
      
      * Fixing `ensure_on_device` respects original ModelOutput.
      c63fcabf
  12. 31 Aug, 2021 1 commit
  13. 26 Aug, 2021 1 commit
  14. 30 Mar, 2021 1 commit
  15. 29 Jan, 2021 1 commit
    • Nicolas Patry's avatar
      Adding a new `return_full_text` parameter to TextGenerationPipeline. (#9852) · c2d0ffec
      Nicolas Patry authored
      * Adding a new `return_full_text` parameter to TextGenerationPipeline.
      
      For text-generation, it's sometimes used as prompting text.
      In that context, prefixing `generated_text` with the actual input
      forces the caller to take an extra step to remove it.
      
      The proposed change adds a new parameter (for backward compatibility).
      `return_full_text` that enables the caller to prevent adding the prefix.
      
      * Doc quality.
      c2d0ffec
  16. 11 Jan, 2021 1 commit
    • Nicolas Patry's avatar
      Enable TruncationStrategy override for pipelines (#9432) · d20e9c72
      Nicolas Patry authored
      * Enable TruncationStrategy override for pipelines
      
      * Update isort.
      
      * Fixing test
      
      * Fixing text_generation pipeline.
      
      * Using same DummyTok as other PR  for easier merge later.
      
      * Some more import guards.
      
      * Remove bogus file.
      
      * Do not pass `generate_kwargs` to `_parse_and_tokenize`.
      @patrickvonplaten
      
      * Removed DummyTok.
      
      * Doc quality.
      d20e9c72
  17. 06 Jan, 2021 1 commit
    • Nicolas Patry's avatar
      [Refactor] Splitting pipelines.py into its own module. (#9279) · 090d28e3
      Nicolas Patry authored
      * Splitting pipelines into its own module.
      
      * Moving everything into base.py
      
      * Moving FeatureExtractionPipeline into its own file.
      
      * TextGenerationPipeline.
      
      * TextClassifictionPipeline
      
      * ZeroShot + get_framework import.
      
      * FillMaskPipeline
      
      * NerPipeline + TokenClassificationPipeline
      
      * QuestionAnsweringPipeline
      
      * TableQuestionAnsweringPipeline
      
      * ConversationnalPipeline
      
      * Text2TextGenerationPipeline, TranslationPipeline, SummarizationPipeline
      
      * Typo import fix.
      
      * Relative imports.
      090d28e3