Commits · eb00e2ad9da0df409d62033a1d7078572df67fb5 · OpenDAS / vision

26 Jan, 2021 1 commit
- Fix sphinx warnings and turn warnings into errors (#3290) · eb00e2ad
  Nicolas Hug authored Jan 26, 2021
```
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>
```
  eb00e2ad
22 Jan, 2021 1 commit
- Unify parameters formatting in docstrings (#3268) · 89bc3079
  Nicolas Hug authored Jan 22, 2021
  
  89bc3079
01 Dec, 2020 1 commit

Add option to write audio to video file (#2304) · 1b00af38

Francisco Massa authored Dec 01, 2020



* Add option to write audio to video file

Summary:
I was trying to use torchvision's `write_video` function and realized there was no option to add in the audio.

Thus, this diff contains the changes necessary such that this is possible. This is my first time trying to contribute to this project, so be as harsh as you need!

Reviewed By: fmassa

Differential Revision: D21480083

fbshipit-source-id: 2e11f2c8728d42f86c94068f75b843793d5a94aa

* Fix typo

* Try fix Windows

* Disable test on Windows
Co-authored-by: Joanna Bitton <jbitton@fb.com>

1b00af38

14 Sep, 2020 1 commit

Add typehints for torchvision.io (#2543) · f8bf06d5

Philip Meier authored Sep 14, 2020



* enable typing check for torchvision.io

* fix existing errors

* Update torchvision/io/_video_opt.py
Co-authored-by: Francisco Massa <fvsmassa@gmail.com>

* add ignores for FileFinder

* use python 3 type hints

* lint

* video_opt

* video

* try quote av type hints

* revert from .dim() to .ndim

* revert changes to _video_opt.py and ignore errors

* fix type hints

* fix type hints for read_video_timestamps

* change offset int to float

* remove unused import
Co-authored-by: Francisco Massa <fvsmassa@gmail.com>

f8bf06d5

22 Jun, 2020 2 commits

Refactoring to use contexts managers, list comprehensions when more idiomatic,... · 42aa9b26

Quentin Duval authored Jun 22, 2020

Refactoring to use contexts managers, list comprehensions when more idiomatic, and minor renaming to help reader clarity (#2335)

* Refactoring to use contexts managers, list comprehensions when more idiomatic, and minor renaming to help reader clarity.

* Fix flake8 warning in video_utils.py

42aa9b26

Fix: PyAV does not support floating point numbers with decimals as FPS when... · 32f21dad

Quentin Duval authored Jun 22, 2020

Fix: PyAV does not support floating point numbers with decimals as FPS when writing and will throw in case this constraint is not satisfied. (#2334)

32f21dad

29 May, 2020 1 commit

video.py read_video_timestamps (follow-up PR #2202) (#2268) · a85f21d2

Marc authored May 29, 2020

* get pts directly instead of storing full frames to get pts later

* fix linting

* add initial pts value
sort pts

* catch decoding errors for read_video_timestamp

a85f21d2

19 May, 2020 1 commit
- video.py read_video_timestamps calculate pts without storing full frames (#2202) · e6b4078e
  Marc authored May 19, 2020
```
* get pts directly instead of storing full frames to get pts later

* fix linting

* add initial pts value
sort pts
```
  e6b4078e
14 May, 2020 1 commit
- Set should_buffer to True by default in _read_from_stream (#2201) · cc4c2af7
  Marc authored May 14, 2020
  
  cc4c2af7
09 Apr, 2020 1 commit

Make read_video_meta_data_from_memory and read_video_from_memory private (#2077) (#2084) · dec8628d

Francisco Massa authored Apr 09, 2020

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/2077

Pull Request resolved: https://github.com/facebookresearch/SlowFast/pull/164

This is a follow-up diff from D18720474

We will be releasing a new version of torchvision soon and the signature of those functions is not ready yet, following my comment in https://our.intern.facebook.com/intern/diff/D18720474/?transaction_id=561239541337402

Reviewed By: stephenyan1231

Differential Revision: D20914571

fbshipit-source-id: 1a7560b8f8e46ab42ef376c50b494a4f73923e94
Co-authored-by: Francisco Massa <fmassa@fb.com>

dec8628d

17 Mar, 2020 1 commit

Update video reader to use new decoder (#1978) · 32e16805

Francisco Massa authored Mar 17, 2020

* Base decoder for video. (#1747)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1747

Pull Request resolved: https://github.com/pytorch/vision/pull/1746

Added the implementation of ffmpeg based decoder with functionality that can be used in VUE and TorchVision.

Reviewed By: fmassa

Differential Revision: D19358914

fbshipit-source-id: abb672f89bfaca6351dda2354f0d35cf8e47fa0f

* Integrated base decoder into VideoReader class and video_utils.py (#1766)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1766

Replaced FfmpegDecoder (incompativle with VUE) by base decoder (compatible with VUE).
Modified python utilities video_utils.py for internal simplification. Public interface got preserved.

Reviewed By: fmassa

Differential Revision: D19415903

fbshipit-source-id: 4d7a0158bd77bac0a18732fe4183fdd9a57f6402

* Optimizating base decoder performance. (#1852)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1852

Changed base decoder internals for a faster clip processing.

Reviewed By: stephenyan1231

Differential Revision: D19748379

fbshipit-source-id: 58a435f0a0b25545e7bd1a3edb0b1d558176a806

* Minor fix and decoder class members access.

Summary:
Found and fix a bug in cropping algorithm (simple mistyping).
Also derived classes need access to some decoder class members, like initialization parameters - make it protected.

Reviewed By: stephenyan1231, fmassa

Differential Revision: D19895076

fbshipit-source-id: 691336c8e18526b085ae5792ac3546bc387a6db9

* Added missing header for less dependencies. (#1898)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1898

Include streams/samplers shouldn't depend on decoder headers. Add dependencies directly to the place where they are required.

Reviewed By: stephenyan1231

Differential Revision: D19911404

fbshipit-source-id: ef322a053708405c02cee4562b456b1602fb12fc

* Implemented VUE Asynchronous Decoder

Summary: For Mothership we have found that asynchronous decoder provides a better performance.

Differential Revision: D20026194

fbshipit-source-id: 627b91844b4e3f917002031dd32cb19c239f4ba8

* fix a bug in API read_video_from_memory (#1942)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1942

In D18720474, it introduces a bug in `read_video_from_memory` API. Thank weiyaowang for reporting it.

Reviewed By: weiyaowang

Differential Revision: D20270179

fbshipit-source-id: 66348c99a5ad1f9129b90e934524ddfaad59de03

* extend decoder to support new video_max_dimension argument (#1924)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1924

Extend `video reader` decoder python API in Torchvision to support a new argument `video_max_dimension`. This enables the new video decoding use cases. When setting `video_width=0`, `video_height=0`, `video_min_dimension != 0`, and `video_max_dimension != 0`, we can rescale the video clips so that its spatial resolution (height, width) becomes
- (video_min_dimension, video_max_dimension) if original height < original width
- (video_max_dimension, video_min_dimension) if original height >= original width

This is useful at video model testing stage, where we perform fully convolution evaluation and take entire video frames without cropping as input. Previously, for instance we can only set `video_width=0`, `video_height=0`, `video_min_dimension = 128`, which will preserve aspect ratio. In production dataset, there are a small number of videos where aspect ratio is either extremely large or small, and when the shorter edge is rescaled to 128, the longer edge is still large. This will easily cause GPU memory OOM when we sample multiple video clips, and put them in a single minibatch.

Now, we can set (for instance) `video_width=0`, `video_height=0`, `video_min_dimension = 128` and `video_max_dimension = 171` so that the rescale resolution is either (128, 171) or (171, 128) depending on whether original height is larger than original width. Thus, we are less likely to have gpu OOM because the spatial size of video clips is determined.

Reviewed By: putivsky

Differential Revision: D20182529

fbshipit-source-id: f9c40afb7590e7c45e6908946597141efa35f57c

* Fixing samplers initialization (#1967)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1967

No-ops for torchvision diff, which fixes samplers.

Differential Revision: D20397218

fbshipit-source-id: 6dc4d04364f305fbda7ca4f67a25ceecd73d0f20

* Exclude C++ test files
Co-authored-by: Yuri Putivsky <yuri@fb.com>
Co-authored-by: Zhicheng Yan <zyan3@fb.com>

32e16805

28 Jan, 2020 1 commit

torchscriptable functions for video io (#1653) (#1794) · e130c6cc

Francisco Massa authored Jan 28, 2020

* torchscriptable functions for video io (#1653)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1653



created new torchscriptable video io functions as part of the api: read_video_meta_data_from_memory and read_video_from_memory.

Updated the implementation of some of the internal functions to be torchscriptable.

Reviewed By: stephenyan1231

Differential Revision: D18720474

fbshipit-source-id: 4ee646b66afecd2dc338a71fd8f249f25a3263bc

* BugFix
Co-authored-by: Jon Guerin <54725679+jguerin-fb@users.noreply.github.com>

e130c6cc

23 Oct, 2019 1 commit

Unify video backend (#1514) · 97b53f96

Francisco Massa authored Oct 23, 2019

* Unify video backend interfaces

* Remove reference cycle

* Make functions private and enable tests on OSX

* Disable test if video_reader backend not available

* Lint

* Fix import after refactoring

* Fix lint

97b53f96

21 Oct, 2019 1 commit
- fix a bug when video decoding fails and empty frames are returned (#1506) · 2804c122
  Zhicheng Yan authored Oct 21, 2019
  
  2804c122
15 Oct, 2019 1 commit

Better handle corrupted videos (#1463) · da89dade

Francisco Massa authored Oct 15, 2019

* Handle corrupted video headers in io

* Catch exceptions while decoding partly-corrupted files

* Add more tests

da89dade

30 Sep, 2019 1 commit

modified code of io.read_video and io.read_video_timestamps to intepret pts... · 17e355f7

Chandresh Kanani authored Sep 30, 2019

modified code of io.read_video and io.read_video_timestamps to intepret pts values in seconds (#1331)

* modified code of io.read_video and io.read_video_timestamps to interpret pts values in seconds

* changed default value for pts_unit to pts, corrected formatting

* hanndliing both fractions and floats for start_pts and end_pts, added test cases for pts_unit sec

* moved unit conversion logic to _read_from_stream method

17e355f7

07 Aug, 2019 1 commit
- Rewrite torchvision packaging (#1209) · 64ccfe34
  Edward Z. Yang authored Aug 07, 2019
```
Following a similar line of inquiry to pytorch/audio#217
```
  64ccfe34
02 Aug, 2019 1 commit

Expose docs for io and ops package (#1189) · 4ec38d49

Francisco Massa authored Aug 02, 2019

* Expose docs for io and ops package

Had do modify the docstrings to use Napoleon NumPy style, because Napoleon Google Style doesn't support multiple return arguments

* Add video section

4ec38d49

31 Jul, 2019 1 commit

Video reference scripts (#1180) · 5c0b7f31

Francisco Massa authored Jul 31, 2019

* Copy classification scripts for video classification

* Initial version of video classification

* add version

* Training of r2plus1d_18 on kinetics work

Gives even slightly better results than expected, with 57.336 top1 clip accuracy. But we count some clips twice in this evaluation

* Cleanups on training script

* Lint

* Minor improvements

* Remove some hacks

* Lint

5c0b7f31

26 Jul, 2019 2 commits
- Optimize read_video_timestamps for some formats (#1168) · 2287c8f2
  Francisco Massa authored Jul 26, 2019
```
* Optimize read_video_timestamps for some formats

* Add some tests
```
  2287c8f2
- Miscellaneous fixes and improvements for video reading (#1161) · 81021581
  Francisco Massa authored Jul 26, 2019
```
* Miscellaneous fixes and improvements

* Guard against videos without video stream

* Fix lint

* Add test for packed b-frames videos

* Fix missing import
```
  81021581
24 Jul, 2019 2 commits

Test videos with B-Frames (#1157) · 010984d4

Francisco Massa authored Jul 24, 2019

Also extend video saving to support different codecs and options. Notably, we can now save with lossless compression

010984d4

Properly order videos with B-frames (#1155) · b25f81e0

Francisco Massa authored Jul 24, 2019

* Properly order videos with B-frames

* seek doesn't seek to pts, but dts

Find a way of overcoming this problem.

b25f81e0

23 Jul, 2019 1 commit
- Ignore utf-8 decode errors (#1153) · fe3b4c8f
  Francisco Massa authored Jul 23, 2019
```
Also make logging less verbose
```
  fe3b4c8f
19 Jul, 2019 1 commit

Add VideoClips and Kinetics dataset (#1077) · 5d1372c0

Francisco Massa authored Jul 19, 2019

* Add VideoClips and Kinetics dataset

* Lint + add back missing line

* Adds ClipSampler following Bruno comment

* Change name following Bruno's suggestion

* Enable specifying a target framerate

* Fix test_io for new interface

* Add comment mentioning drop_last behavior

* Make compute_clips more robust

* Flake8

* Fix for Python2

5d1372c0

02 Jul, 2019 1 commit

Adds video reading / saving functionalities (#1039) · d293c4c5

Francisco Massa authored Jul 02, 2019

* WIP

* WIP

* Add some documentation

* Improve tests and add GC collection

* [WIP] add timestamp getter

* Bugfixes

* Improvements and travis

* Add audio fine-grained alignment

* More doc

* Remove unecessary file

* Remove comment

* Lazy import av

* Remove hard-coded constants for the test

* Return info stats from read

* Fix for Python-2

d293c4c5