1. 25 Aug, 2020 2 commits
    • Philip Meier's avatar
      Places365 dataset (#2610) · fc69c225
      Philip Meier authored
      * initial draft
      
      * [dirty] progress
      
      * remove inheritance from ImageFolder
      
      * add tests
      
      * lint
      
      * fix type hints
      
      * align getitem with other datasets
      
      * remove unused import
      
      * add docstring
      
      * guard existing image folders from overwrite
      
      * add missing entry in docstring
      
      * make fixpath more legible
      
      * add Places365 to docs
      fc69c225
    • Philip Meier's avatar
      fix FashionMNIST docstring (#2614) · 01fb0df0
      Philip Meier authored
      01fb0df0
  2. 20 Aug, 2020 1 commit
    • Harsh Rangwani's avatar
      Only pull keys from db in lsun for faster cache. (#2544) · ea6b879e
      Harsh Rangwani authored
      * Only pull keys from db in lsun for faster cache.
      
      This pull request inhances the speed of the cache creation for lsun dataset. For the "kitchen_train" the speed was getting slow with cache creation taking more then two hours. This speeds up to cache creation in within minutes. The issue was pulling the large image values each time and dropping them.
      
      For more details on this please refer this issue https://github.com/jnwatson/py-lmdb/issues/195.
      
      * Fixed bug in lsun.py when loading multiple categories
      
      * Make linter happy
      ea6b879e
  3. 03 Aug, 2020 11 commits
  4. 31 Jul, 2020 9 commits
  5. 30 Jul, 2020 1 commit
  6. 03 Jul, 2020 1 commit
  7. 22 Jun, 2020 1 commit
  8. 18 May, 2020 1 commit
  9. 07 May, 2020 1 commit
    • Guillem Orellana Trullols's avatar
      Update ucf101.py (#2186) · 14af9de6
      Guillem Orellana Trullols authored
      Now the dataset is not working properly because of this line of code `indices = [i for i in range(len(video_list)) if video_list[i][len(self.root) + 1:] in selected_files]`. 
      Performing the `len(self.root) + 1` only make sense if there is no training / to root
      
      ```
      >>> root = 'data/ucf-101/videos'
      >>> video_path = 'data/ucf-101/videos/activity/video.avi'
      >>> video_path [len(root ):]
      '/activity/video.avi'
      >>> video_path [len(root ) + 1:]
      'activity/video.avi'
      ```
      
      Appending the root path also to the selected files is a simple solution and make the dataset works with and without a trailing slash.
      14af9de6
  10. 04 May, 2020 1 commit
  11. 29 Apr, 2020 1 commit
  12. 27 Apr, 2020 1 commit
  13. 16 Apr, 2020 1 commit
  14. 01 Apr, 2020 1 commit
  15. 31 Mar, 2020 2 commits
    • Philip Meier's avatar
      Remove python2 compability code (#2033) · 24f16a33
      Philip Meier authored
      * remove sys.version_info == 2
      
      * remove sys.version_info < 3
      
      * remove from __future__ imports
      24f16a33
    • Philip Meier's avatar
      Remove six dependency (#2017) · 42b8d462
      Philip Meier authored
      * remove six from python code
      
      * remove six from setup.py
      
      * remove six from tests
      
      * remove six from references
      
      * remove six from packaging
      
      * revert str to torch._six._string_classes
      
      * revert str to torch._six._string_classes
      42b8d462
  16. 30 Mar, 2020 1 commit
  17. 17 Mar, 2020 2 commits
    • Tee Jung's avatar
      cc43e0a9
    • Francisco Massa's avatar
      Update video reader to use new decoder (#1978) · 32e16805
      Francisco Massa authored
      * Base decoder for video. (#1747)
      
      Summary:
      Pull Request resolved: https://github.com/pytorch/vision/pull/1747
      
      Pull Request resolved: https://github.com/pytorch/vision/pull/1746
      
      Added the implementation of ffmpeg based decoder with functionality that can be used in VUE and TorchVision.
      
      Reviewed By: fmassa
      
      Differential Revision: D19358914
      
      fbshipit-source-id: abb672f89bfaca6351dda2354f0d35cf8e47fa0f
      
      * Integrated base decoder into VideoReader class and video_utils.py (#1766)
      
      Summary:
      Pull Request resolved: https://github.com/pytorch/vision/pull/1766
      
      Replaced FfmpegDecoder (incompativle with VUE) by base decoder (compatible with VUE).
      Modified python utilities video_utils.py for internal simplification. Public interface got preserved.
      
      Reviewed By: fmassa
      
      Differential Revision: D19415903
      
      fbshipit-source-id: 4d7a0158bd77bac0a18732fe4183fdd9a57f6402
      
      * Optimizating base decoder performance. (#1852)
      
      Summary:
      Pull Request resolved: https://github.com/pytorch/vision/pull/1852
      
      Changed base decoder internals for a faster clip processing.
      
      Reviewed By: stephenyan1231
      
      Differential Revision: D19748379
      
      fbshipit-source-id: 58a435f0a0b25545e7bd1a3edb0b1d558176a806
      
      * Minor fix and decoder class members access.
      
      Summary:
      Found and fix a bug in cropping algorithm (simple mistyping).
      Also derived classes need access to some decoder class members, like initialization parameters - make it protected.
      
      Reviewed By: stephenyan1231, fmassa
      
      Differential Revision: D19895076
      
      fbshipit-source-id: 691336c8e18526b085ae5792ac3546bc387a6db9
      
      * Added missing header for less dependencies. (#1898)
      
      Summary:
      Pull Request resolved: https://github.com/pytorch/vision/pull/1898
      
      Include streams/samplers shouldn't depend on decoder headers. Add dependencies directly to the place where they are required.
      
      Reviewed By: stephenyan1231
      
      Differential Revision: D19911404
      
      fbshipit-source-id: ef322a053708405c02cee4562b456b1602fb12fc
      
      * Implemented VUE Asynchronous Decoder
      
      Summary: For Mothership we have found that asynchronous decoder provides a better performance.
      
      Differential Revision: D20026194
      
      fbshipit-source-id: 627b91844b4e3f917002031dd32cb19c239f4ba8
      
      * fix a bug in API read_video_from_memory (#1942)
      
      Summary:
      Pull Request resolved: https://github.com/pytorch/vision/pull/1942
      
      In D18720474, it introduces a bug in `read_video_from_memory` API. Thank weiyaowang for reporting it.
      
      Reviewed By: weiyaowang
      
      Differential Revision: D20270179
      
      fbshipit-source-id: 66348c99a5ad1f9129b90e934524ddfaad59de03
      
      * extend decoder to support new video_max_dimension argument (#1924)
      
      Summary:
      Pull Request resolved: https://github.com/pytorch/vision/pull/1924
      
      Extend `video reader` decoder python API in Torchvision to support a new argument `video_max_dimension`. This enables the new video decoding use cases. When setting `video_width=0`, `video_height=0`, `video_min_dimension != 0`, and `video_max_dimension != 0`, we can rescale the video clips so that its spatial resolution (height, width) becomes
       - (video_min_dimension, video_max_dimension) if original height < original width
       - (video_max_dimension, video_min_dimension) if original height >= original width
      
      This is useful at video model testing stage, where we perform fully convolution evaluation and take entire video frames without cropping as input. Previously, for instance we can only set `video_width=0`, `video_height=0`, `video_min_dimension = 128`, which will preserve aspect ratio. In production dataset, there are a small number of videos where aspect ratio is either extremely large or small, and when the shorter edge is rescaled to 128, the longer edge is still large. This will easily cause GPU memory OOM when we sample multiple video clips, and put them in a single minibatch.
      
      Now, we can set (for instance) `video_width=0`, `video_height=0`, `video_min_dimension = 128` and `video_max_dimension = 171` so that the rescale resolution is either (128, 171) or (171, 128) depending on whether original height is larger than original width. Thus, we are less likely to have gpu OOM because the spatial size of video clips is determined.
      
      Reviewed By: putivsky
      
      Differential Revision: D20182529
      
      fbshipit-source-id: f9c40afb7590e7c45e6908946597141efa35f57c
      
      * Fixing samplers initialization (#1967)
      
      Summary:
      Pull Request resolved: https://github.com/pytorch/vision/pull/1967
      
      
      
      No-ops for torchvision diff, which fixes samplers.
      
      Differential Revision: D20397218
      
      fbshipit-source-id: 6dc4d04364f305fbda7ca4f67a25ceecd73d0f20
      
      * Exclude C++ test files
      Co-authored-by: default avatarYuri Putivsky <yuri@fb.com>
      Co-authored-by: default avatarZhicheng Yan <zyan3@fb.com>
      32e16805
  18. 16 Mar, 2020 1 commit
  19. 12 Mar, 2020 1 commit