1. 09 Sep, 2022 2 commits
  2. 08 Sep, 2022 1 commit
    • NielsRogge's avatar
      Add X-CLIP (#18852) · bb6f6d53
      NielsRogge authored
      * First draft
      
      * Improve conversion script
      
      * Make vision encoder work
      
      * More improvements
      
      * Improve conversion script
      
      * Fix quality
      
      * Add MultiframeIntegrationTransformer
      
      * More improvements
      
      * Make MiT output work
      
      * Fix quality
      
      * Add prompts generator
      
      * Add tests
      
      * Fix some tests
      
      * Fix some more tests
      
      * Fix more tests
      
      * Improve conversion script
      
      * Fix model outputs
      
      * Fix more tests
      
      * Add XClipProcessor
      
      * Use processor in conversion script
      
      * Fix integration test
      
      * Update README, fix docs
      
      * Fix all tests
      
      * Add MIT output to XClipOutput
      
      * Create better variable names
      
      * Rename XClip to XCLIP
      
      * Extend conversion script
      
      * Add support for large models
      
      * Add support for 16 frame models
      
      * Add another model'
      
      * Fix module issue
      
      * Apply suggestions from code review
      
      * Add figure to docs
      
      * Fix CLIPProcessor issue
      
      * Apply suggestions from code review
      
      * Delete file
      
      * Convert more checkpoints
      
      * Convert last checkpoint
      
      * Update nielsr to microsoft
      bb6f6d53
  3. 07 Sep, 2022 2 commits
    • Ankur Goyal's avatar
      Add DocumentQuestionAnswering pipeline (#18414) · 2ef77421
      Ankur Goyal authored
      
      
      * [WIP] Skeleton of VisualQuestionAnweringPipeline extended to support LayoutLM-like models
      
      * Fixup
      
      * Use the full encoding
      
      * Basic refactoring to DocumentQuestionAnsweringPipeline
      
      * Cleanup
      
      * Improve args, docs, and implement preprocessing
      
      * Integrate OCR
      
      * Refactor question_answering pipeline
      
      * Use refactored QA code in the document qa pipeline
      
      * Fix tests
      
      * Some small cleanups
      
      * Use a string type annotation for Image.Image
      
      * Update encoding with image features
      
      * Wire through the basic docs
      
      * Handle invalid response
      
      * Handle empty word_boxes properly
      
      * Docstring fix
      
      * Integrate Donut model
      
      * Fixup
      
      * Incorporate comments
      
      * Address comments
      
      * Initial incorporation of tests
      
      * Address Comments
      
      * Change assert to ValueError
      
      * Comments
      
      * Wrap `score` in float to make it JSON serializable
      
      * Incorporate AutoModeLForDocumentQuestionAnswering changes
      
      * Fixup
      
      * Rename postprocess function
      
      * Fix auto import
      
      * Applying comments
      
      * Improve docs
      
      * Remove extra assets and add copyright
      
      * Address comments
      Co-authored-by: default avatarAnkur Goyal <ankur@impira.com>
      2ef77421
    • Yih-Dar's avatar
      remvoe `_create_and_check_torch_fx_tracing` in specific test files (#18667) · 10c774cf
      Yih-Dar authored
      
      
      * remvoe _create_and_check_torch_fx_tracing defined in specific model test files
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      10c774cf
  4. 06 Sep, 2022 2 commits
  5. 05 Sep, 2022 2 commits
  6. 02 Sep, 2022 4 commits
  7. 01 Sep, 2022 3 commits
  8. 31 Aug, 2022 4 commits
  9. 30 Aug, 2022 5 commits
  10. 29 Aug, 2022 3 commits
  11. 26 Aug, 2022 2 commits
  12. 25 Aug, 2022 2 commits
    • Craig Chan's avatar
      Determine framework automatically before ONNX export (#18615) · fbf382c8
      Craig Chan authored
      
      
      * Automatic detection for framework to use when exporting to ONNX
      
      * Log message change
      
      * Incorporating PR comments, adding unit test
      
      * Adding tf for pip install for run_tests_onnxruntime CI
      
      * Restoring past changes to circleci yaml and test_onnx_v2.py, tests moved to tests/onnx/test_features.py
      
      * Fixup
      
      * Adding test to fetcher
      
      * Updating circleci config to log more
      
      * Changing test class name
      
      * Comment typo fix in tests/onnx/test_features.py
      Co-authored-by: default avatarlewtun <lewis.c.tunstall@gmail.com>
      
      * Moving torch_str/tf_str to self.framework_pt/tf
      
      * Remove -rA flag in circleci config
      Co-authored-by: default avatarlewtun <lewis.c.tunstall@gmail.com>
      fbf382c8
    • Patrick Deutschmann's avatar
      Add ONNX support for Longformer (#17176) · 3223d493
      Patrick Deutschmann authored
      * Implement ONNX support for Longformer
      
      Fix repo consistency check complaints
      
      Fix value mismatches
      
      Add pooler output for default model
      
      Increase validation atol to accommodate multiple-choice error
      
      Fix copies
      
      Fix chunking for longer sequence lengths
      
      Add future comment
      
      * Fix issue in mask_invalid_locations
      
      * Remove torch imports in configuration_longformer
      
      * Change config access to fix LED
      
      * Push opset version to support tril
      
      * Work in review comments (mostly style)
      
      * Add Longformer to ONNX tests
      3223d493
  13. 24 Aug, 2022 2 commits
  14. 22 Aug, 2022 1 commit
  15. 19 Aug, 2022 1 commit
  16. 18 Aug, 2022 2 commits
  17. 17 Aug, 2022 2 commits
    • amyeroberts's avatar
      Update feature extractor methods to enable type cast before normalize (#18499) · 49e44b21
      amyeroberts authored
      * Update methods to optionally rescale
      This is necessary to allow for casting our images / videos to numpy arrays within the feature extractors' call. We want to do this to make sure the behaviour is as expected when flags like  are False. If some transformations aren't applied, then the output type can't be unexpected e.g. a list of PIL images instead of numpy arrays.
      
      * Cast images to numpy arrays in call to enable consistent behaviour with different configs
      
      * Remove accidental clip changes
      
      * Update tests to reflect the scaling logic
      We write a generic  function to handle rescaling of our arrays. In order for the API to be intuitive, we take some factor c and rescale the image values by that. This means, the rescaling done in normalize and to_numpy_array are now done with array * (1/255) instead of array / 255. This leads to small differences in the resulting image. When testing, this was in the order of 1e-8, and so deemed OK
      49e44b21
    • Yih-Dar's avatar
      Fix Yolos ONNX export test (#18606) · c99e9846
      Yih-Dar authored
      
      Co-authored-by: default avatarlewtun <lewis.c.tunstall@gmail.com>
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      c99e9846