Commits · a81d99b7977f2064926a23006e4f64ae458ca09d · OpenDAS / vision

11 May, 2020 1 commit

Add static type check with mypy (#2195) · a81d99b7

Philip Meier authored May 11, 2020

* add mypy config

* fix syntax error

* fix annotations in torchvision/utils.py

* add mypy type check to CircleCI

* add mypy cache to ignore files

* try fix CI

* ignore flake8 F821 since it interferes with mypy

* add mypy type check to config generator

* explicitly set config files

a81d99b7

09 Apr, 2020 1 commit

Make read_video_meta_data_from_memory and read_video_from_memory private (#2077) (#2084) · dec8628d

Francisco Massa authored Apr 09, 2020

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/2077

Pull Request resolved: https://github.com/facebookresearch/SlowFast/pull/164

This is a follow-up diff from D18720474

We will be releasing a new version of torchvision soon and the signature of those functions is not ready yet, following my comment in https://our.intern.facebook.com/intern/diff/D18720474/?transaction_id=561239541337402

Reviewed By: stephenyan1231

Differential Revision: D20914571

fbshipit-source-id: 1a7560b8f8e46ab42ef376c50b494a4f73923e94
Co-authored-by: Francisco Massa <fmassa@fb.com>

dec8628d

17 Mar, 2020 2 commits

replace imp with importlib (fixes #1607) (#1976) · 7a36388c
NVS Abhilash authored Mar 17, 2020
```
Co-authored-by: Francisco Massa <fvsmassa@gmail.com>
```
7a36388c

Update video reader to use new decoder (#1978) · 32e16805

Francisco Massa authored Mar 17, 2020

* Base decoder for video. (#1747)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1747

Pull Request resolved: https://github.com/pytorch/vision/pull/1746

Added the implementation of ffmpeg based decoder with functionality that can be used in VUE and TorchVision.

Reviewed By: fmassa

Differential Revision: D19358914

fbshipit-source-id: abb672f89bfaca6351dda2354f0d35cf8e47fa0f

* Integrated base decoder into VideoReader class and video_utils.py (#1766)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1766

Replaced FfmpegDecoder (incompativle with VUE) by base decoder (compatible with VUE).
Modified python utilities video_utils.py for internal simplification. Public interface got preserved.

Reviewed By: fmassa

Differential Revision: D19415903

fbshipit-source-id: 4d7a0158bd77bac0a18732fe4183fdd9a57f6402

* Optimizating base decoder performance. (#1852)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1852

Changed base decoder internals for a faster clip processing.

Reviewed By: stephenyan1231

Differential Revision: D19748379

fbshipit-source-id: 58a435f0a0b25545e7bd1a3edb0b1d558176a806

* Minor fix and decoder class members access.

Summary:
Found and fix a bug in cropping algorithm (simple mistyping).
Also derived classes need access to some decoder class members, like initialization parameters - make it protected.

Reviewed By: stephenyan1231, fmassa

Differential Revision: D19895076

fbshipit-source-id: 691336c8e18526b085ae5792ac3546bc387a6db9

* Added missing header for less dependencies. (#1898)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1898

Include streams/samplers shouldn't depend on decoder headers. Add dependencies directly to the place where they are required.

Reviewed By: stephenyan1231

Differential Revision: D19911404

fbshipit-source-id: ef322a053708405c02cee4562b456b1602fb12fc

* Implemented VUE Asynchronous Decoder

Summary: For Mothership we have found that asynchronous decoder provides a better performance.

Differential Revision: D20026194

fbshipit-source-id: 627b91844b4e3f917002031dd32cb19c239f4ba8

* fix a bug in API read_video_from_memory (#1942)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1942

In D18720474, it introduces a bug in `read_video_from_memory` API. Thank weiyaowang for reporting it.

Reviewed By: weiyaowang

Differential Revision: D20270179

fbshipit-source-id: 66348c99a5ad1f9129b90e934524ddfaad59de03

* extend decoder to support new video_max_dimension argument (#1924)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1924

Extend `video reader` decoder python API in Torchvision to support a new argument `video_max_dimension`. This enables the new video decoding use cases. When setting `video_width=0`, `video_height=0`, `video_min_dimension != 0`, and `video_max_dimension != 0`, we can rescale the video clips so that its spatial resolution (height, width) becomes
- (video_min_dimension, video_max_dimension) if original height < original width
- (video_max_dimension, video_min_dimension) if original height >= original width

This is useful at video model testing stage, where we perform fully convolution evaluation and take entire video frames without cropping as input. Previously, for instance we can only set `video_width=0`, `video_height=0`, `video_min_dimension = 128`, which will preserve aspect ratio. In production dataset, there are a small number of videos where aspect ratio is either extremely large or small, and when the shorter edge is rescaled to 128, the longer edge is still large. This will easily cause GPU memory OOM when we sample multiple video clips, and put them in a single minibatch.

Now, we can set (for instance) `video_width=0`, `video_height=0`, `video_min_dimension = 128` and `video_max_dimension = 171` so that the rescale resolution is either (128, 171) or (171, 128) depending on whether original height is larger than original width. Thus, we are less likely to have gpu OOM because the spatial size of video clips is determined.

Reviewed By: putivsky

Differential Revision: D20182529

fbshipit-source-id: f9c40afb7590e7c45e6908946597141efa35f57c

* Fixing samplers initialization (#1967)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1967

No-ops for torchvision diff, which fixes samplers.

Differential Revision: D20397218

fbshipit-source-id: 6dc4d04364f305fbda7ca4f67a25ceecd73d0f20

* Exclude C++ test files
Co-authored-by: Yuri Putivsky <yuri@fb.com>
Co-authored-by: Zhicheng Yan <zyan3@fb.com>

32e16805

28 Jan, 2020 1 commit

torchscriptable functions for video io (#1653) (#1794) · e130c6cc

Francisco Massa authored Jan 28, 2020

* torchscriptable functions for video io (#1653)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1653



created new torchscriptable video io functions as part of the api: read_video_meta_data_from_memory and read_video_from_memory.

Updated the implementation of some of the internal functions to be torchscriptable.

Reviewed By: stephenyan1231

Differential Revision: D18720474

fbshipit-source-id: 4ee646b66afecd2dc338a71fd8f249f25a3263bc

* BugFix
Co-authored-by: Jon Guerin <54725679+jguerin-fb@users.noreply.github.com>

e130c6cc

29 Oct, 2019 1 commit
- Unify video metadata in VideoClips (#1527) · 7d509c5d
  Francisco Massa authored Oct 29, 2019
```
* Unify video metadata in VideoClips

* Bugfix

* Make tests a bit more robust
```
  7d509c5d
23 Oct, 2019 1 commit

Unify video backend (#1514) · 97b53f96

Francisco Massa authored Oct 23, 2019

* Unify video backend interfaces

* Remove reference cycle

* Make functions private and enable tests on OSX

* Disable test if video_reader backend not available

* Lint

* Fix import after refactoring

* Fix lint

97b53f96

21 Oct, 2019 1 commit
- fix a bug when video decoding fails and empty frames are returned (#1506) · 2804c122
  Zhicheng Yan authored Oct 21, 2019
  
  2804c122
15 Oct, 2019 1 commit

Better handle corrupted videos (#1463) · da89dade

Francisco Massa authored Oct 15, 2019

* Handle corrupted video headers in io

* Catch exceptions while decoding partly-corrupted files

* Add more tests

da89dade

12 Oct, 2019 1 commit

extend video reader to support fast video probing (#1437) · ed5b2dc3

Zhicheng Yan authored Oct 12, 2019

* extend video reader to support fast video probing

* fix c++ lint

* small fix

* allow to accept input video of type torch.Tensor

ed5b2dc3

08 Oct, 2019 1 commit
- expose more io api (#1423) · e48b9584
  Zhicheng Yan authored Oct 08, 2019
  
  e48b9584
03 Oct, 2019 1 commit

add metadata to video dataset classes. bug fix. more robustness (#1376) · 49b01e3a

Zhicheng Yan authored Oct 03, 2019

* add metadata to video dataset classes. bug fix. more robustness

* query video backend within VideoClips class

* Fix tests

* Fix lint

49b01e3a

30 Sep, 2019 1 commit

modified code of io.read_video and io.read_video_timestamps to intepret pts... · 17e355f7

Chandresh Kanani authored Sep 30, 2019

modified code of io.read_video and io.read_video_timestamps to intepret pts values in seconds (#1331)

* modified code of io.read_video and io.read_video_timestamps to interpret pts values in seconds

* changed default value for pts_unit to pts, corrected formatting

* hanndliing both fractions and floats for start_pts and end_pts, added test cases for pts_unit sec

* moved unit conversion logic to _read_from_stream method

17e355f7

24 Sep, 2019 1 commit

add _backend argument to __init__() of class VideoClips (#1363) · 78743740

Zhicheng Yan authored Sep 24, 2019

* add _backend argument to __init__() of class VideoClips

* minor fix

* minor fix

* Make backend private in VideoClips

* Fix lint

78743740

20 Sep, 2019 1 commit

[video reader] inception commit (#1303) · 31fad34f

Zhicheng Yan authored Sep 20, 2019

* [video reader] inception commit

* add method save_metadata to class VideoClips in video_utils.py

* add load_metadata() method to VideoClips class

* add Exception to not catch unexpected events such as memory erros, interrupt

* fix bugs in video_plus.py

* [video reader]remove logging. update setup.py

* remove time measurement in test_video_reader.py

* Remove glog and try making ffmpeg finding more robust

* Add ffmpeg to conda build

* Add ffmpeg to conda build [again]

* Make library path finding more robust

* Missing import

* One more missing fix for import

* Py2 compatibility and change package to av to avoid version conflict with ffmpeg

* Fix for python2

* [video reader] support to decode one stream only (e.g. video/audio stream)

* remove argument _precomputed_metadata_filepath

* remove save_metadata method

* add get_metadata method

* expose _precomputed_metadata and frame_rate arguments in video dataset __init__ method

* remove ssize_t

* remove size_t to pass CI check on Windows

* add PyInit__video_reader function to pass CI check on Windows

* minor fix to define PyInit_video_reader symbol

* Make c++ video reader optional

* Temporarily revert changes to test_io

* Revert changes to python files

* Rename files to make it private

* Fix python lint

* Fix C++ lint

* add a functor object EnumClassHash to make Enum class instances usable as key type of std::unordered_map

* fix cpp format check

31fad34f

07 Aug, 2019 1 commit
- Rewrite torchvision packaging (#1209) · 64ccfe34
  Edward Z. Yang authored Aug 07, 2019
```
Following a similar line of inquiry to pytorch/audio#217
```
  64ccfe34
02 Aug, 2019 1 commit

Expose docs for io and ops package (#1189) · 4ec38d49

Francisco Massa authored Aug 02, 2019

* Expose docs for io and ops package

Had do modify the docstrings to use Napoleon NumPy style, because Napoleon Google Style doesn't support multiple return arguments

* Add video section

4ec38d49

31 Jul, 2019 1 commit

Video reference scripts (#1180) · 5c0b7f31

Francisco Massa authored Jul 31, 2019

* Copy classification scripts for video classification

* Initial version of video classification

* add version

* Training of r2plus1d_18 on kinetics work

Gives even slightly better results than expected, with 57.336 top1 clip accuracy. But we count some clips twice in this evaluation

* Cleanups on training script

* Lint

* Minor improvements

* Remove some hacks

* Lint

5c0b7f31

26 Jul, 2019 2 commits
- Optimize read_video_timestamps for some formats (#1168) · 2287c8f2
  Francisco Massa authored Jul 26, 2019
```
* Optimize read_video_timestamps for some formats

* Add some tests
```
  2287c8f2
- Miscellaneous fixes and improvements for video reading (#1161) · 81021581
  Francisco Massa authored Jul 26, 2019
```
* Miscellaneous fixes and improvements

* Guard against videos without video stream

* Fix lint

* Add test for packed b-frames videos

* Fix missing import
```
  81021581
24 Jul, 2019 2 commits

Test videos with B-Frames (#1157) · 010984d4

Francisco Massa authored Jul 24, 2019

Also extend video saving to support different codecs and options. Notably, we can now save with lossless compression

010984d4

Properly order videos with B-frames (#1155) · b25f81e0

Francisco Massa authored Jul 24, 2019

* Properly order videos with B-frames

* seek doesn't seek to pts, but dts

Find a way of overcoming this problem.

b25f81e0

23 Jul, 2019 1 commit
- Ignore utf-8 decode errors (#1153) · fe3b4c8f
  Francisco Massa authored Jul 23, 2019
```
Also make logging less verbose
```
  fe3b4c8f
19 Jul, 2019 1 commit

Add VideoClips and Kinetics dataset (#1077) · 5d1372c0

Francisco Massa authored Jul 19, 2019

* Add VideoClips and Kinetics dataset

* Lint + add back missing line

* Adds ClipSampler following Bruno comment

* Change name following Bruno's suggestion

* Enable specifying a target framerate

* Fix test_io for new interface

* Add comment mentioning drop_last behavior

* Make compute_clips more robust

* Flake8

* Fix for Python2

5d1372c0

02 Jul, 2019 1 commit

Adds video reading / saving functionalities (#1039) · d293c4c5

Francisco Massa authored Jul 02, 2019

* WIP

* WIP

* Add some documentation

* Improve tests and add GC collection

* [WIP] add timestamp getter

* Bugfixes

* Improvements and travis

* Add audio fine-grained alignment

* More doc

* Remove unecessary file

* Remove comment

* Lazy import av

* Remove hard-coded constants for the test

* Return info stats from read

* Fix for Python-2

d293c4c5