Commits · 3d60f498e71ba63b428edb184c9ac38fa3737fa6 · OpenDAS / vision

22 Dec, 2020 1 commit
- [*.py] Rename "Arguments:" to "Args:" (#3203) · 3d60f498
  Samuel Marks authored Dec 23, 2020
```
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>
```
  3d60f498
07 Dec, 2020 1 commit
- DatasetFolder: change documentation to include files in subfolders. (#3131) · 70ed29d0
  Robert-Jan Bruintjes authored Dec 07, 2020
  
  70ed29d0
01 Dec, 2020 1 commit

concatenate small tensors into big ones to reduce the use of shared f… (#1795) · 9fc6522d

Francisco Massa authored Dec 01, 2020

* concatenate small tensors into big ones to reduce the use of shared file descriptor (#1694)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1694

- PT dataloader forks worker process to speed up the fetching of dataset example. The recommended way of multiprocess context is `forkserver` rather than `fork`.

- Main process and worker processes will share the dataset class instance, which avoid duplicating the dataset and save memory. In this process, `ForkPickler(..).dumps(...)` will be called to serialize the objects, including objects within dataset instance recursively. `VideoClips` instance internally uses O(N) `torch.Tensor` to store per-video information, such as pts, and possible clips, where N is the No. of videos.

- During dumping, each `torch.Tensor` will use one File Descriptor (FD). The OS default max limit of FD is 65K by using `ulimit -n` to query. The number of tensors in `VideoClips` often exceeds the limit.

- To resolve this issue, we use a few big tensors by concatenating small tensors in the `__getstate__()` method, which will be called during pickling. This will only require O(1) tensors.

- When this diff is landed, we can abondon D19173248

In D19173397, in ClassyVision, we change the mp context from `fork` to `forkserver`, and finally can run the PT dataloader without hanging issues.

Reviewed By: fmassa

Differential Revision: D19179991

fbshipit-source-id: c8716775c7c154aa33d93b25d112d2a59ea688a9

* Try to fix Windows

* Try fix Windows v2

* Disable tests on Windows

* Add back necessary part

* Try fix OSX (and maybe Windows)

* Fix

* Try enabling Windows
Co-authored-by: Zhicheng Yan <zyan3@fb.com>

9fc6522d

26 Nov, 2020 1 commit

Add a warning if any clip can't be obtained from a video in VideoClips. (#2513) · 1bdda8cb

Santiago Castro authored Nov 26, 2020



* Add a warning if a clip can't be get from a video in VideoClips

* Update torchvision/datasets/video_utils.py
Co-authored-by: Philip Meier <github.pmeier@posteo.de>

* Add a test
Co-authored-by: Philip Meier <github.pmeier@posteo.de>

1bdda8cb

20 Nov, 2020 1 commit
- temporarily disable check if quota exceeded for Google drive (#3035) · cd0268cd
  Philip Meier authored Nov 20, 2020
  
  cd0268cd
06 Nov, 2020 1 commit

[docs] initial docstring for makedataset (#2879) · 3852b419

Bruno Korbar authored Nov 06, 2020



* initial docstring

* Revert "initial docstring"

This reverts commit 2bf68ca26e58096885901da0cfa330530974e731.

* revert the formatting changes

* clear up per Victor's comment

* Addressing PR comments
Co-authored-by: Bruno Korbar <korbar@vggdev9.vggdev.cluster>

3852b419

23 Oct, 2020 1 commit

Fixes EMNIST split and label issues (#2673) · cffac640

Vaibhav Balloli authored Oct 23, 2020



* Add float support to ColorJitter

* Fix byclass EMNIST

* Fix bymerge, balance, letters EMNIST

* Fix whitespace indent

* Revert unrelated file changes

* Revert unrelated file changes

* Removing unnecessary type conversions.

* Removing the transform and adding dummy class instead.
Co-authored-by: Vasileios Vryniotis <vvryniotis@fb.com>

cffac640

12 Oct, 2020 1 commit
- use more precise return type for gzip.open() (#2792) · f9c4fdf9
  Philip Meier authored Oct 12, 2020
  
  f9c4fdf9
14 Sep, 2020 1 commit
- add typehints for .datasets.samplers (#2667) · 6662b30a
  Philip Meier authored Sep 14, 2020
  
  6662b30a
09 Sep, 2020 1 commit
- Update dead links to coco website (#2659) · a4736ea6
  Peter Whidden authored Sep 09, 2020
  
  a4736ea6
27 Aug, 2020 1 commit

Fix Places365 dataset (#2625) · 6f028212

Philip Meier authored Aug 27, 2020

* fix images extraction

* remove test split

* fix tests

* be less clever in test data generation

* remove micro optimization

* lint

6f028212

25 Aug, 2020 2 commits

Places365 dataset (#2610) · fc69c225

Philip Meier authored Aug 25, 2020

* initial draft

* [dirty] progress

* remove inheritance from ImageFolder

* add tests

* lint

* fix type hints

* align getitem with other datasets

* remove unused import

* add docstring

* guard existing image folders from overwrite

* add missing entry in docstring

* make fixpath more legible

* add Places365 to docs

fc69c225

fix FashionMNIST docstring (#2614) · 01fb0df0
Philip Meier authored Aug 25, 2020

01fb0df0

20 Aug, 2020 1 commit

Only pull keys from db in lsun for faster cache. (#2544) · ea6b879e

Harsh Rangwani authored Aug 20, 2020

* Only pull keys from db in lsun for faster cache.

This pull request inhances the speed of the cache creation for lsun dataset. For the "kitchen_train" the speed was getting slow with cache creation taking more then two hours. This speeds up to cache creation in within minutes. The issue was pulling the large image values each time and dropping them.

For more details on this please refer this issue https://github.com/jnwatson/py-lmdb/issues/195.

* Fixed bug in lsun.py when loading multiple categories

* Make linter happy

ea6b879e

03 Aug, 2020 11 commits
- add typehints for torchvision.datasets.stl10 (#2540) · 3a159dfe
  Philip Meier authored Aug 03, 2020
```
* add typehints for torchvision.datasets.stl10

* move annotation from class to instance scope
```
  3a159dfe
- add typehints for torchvision.datasets.phototour (#2531) · 62e3fbd8
  Philip Meier authored Aug 03, 2020
  
  62e3fbd8
- add typehints for torchvision.datasets.svhn (#2539) · 1a6148d4
  Philip Meier authored Aug 03, 2020
  
  1a6148d4
- add typehints for torchvision.datasets.voc (#2537) · 7c1ed419
  Philip Meier authored Aug 03, 2020
  
  7c1ed419
- add typehints for torchvision.datasets.usps (#2538) · 0acbf663
  Philip Meier authored Aug 03, 2020
  
  0acbf663
- add typehints for torchvision.datasets.mnist (#2532) · 3f70e3c4
  Philip Meier authored Aug 03, 2020
  
  3f70e3c4
- add typehints to torchvision.datasets.sbu (#2536) · 203a7841
  Philip Meier authored Aug 03, 2020
  
  203a7841
- add typehints for torchvision.datasets.sbd (#2535) · bf584072
  Philip Meier authored Aug 03, 2020
  
  bf584072
- add typehints for torchvision.datasets.semeion (#2534) · 49ec4a16
  Philip Meier authored Aug 03, 2020
  
  49ec4a16
- add typehints for torchvision.datasets.omniglot (#2533) · 262d6177
  Philip Meier authored Aug 03, 2020
  
  262d6177
- add typehints for torchvision.datasets.lsun (#2530) · ec9c7a54
  Philip Meier authored Aug 03, 2020
  
  ec9c7a54
31 Jul, 2020 9 commits
- cifar (#2527) · 6db1569c
  Philip Meier authored Jul 31, 2020
  
  6db1569c
- cityscapes (#2525) · 47f80acc
  Philip Meier authored Jul 31, 2020
  
  47f80acc
- flickr (#2529) · 15bd87f2
  Philip Meier authored Jul 31, 2020
  
  15bd87f2
- fakedata (#2528) · f1d7c92d
  Philip Meier authored Jul 31, 2020
  
  f1d7c92d
- vision (#2526) · 5405739e
  Philip Meier authored Jul 31, 2020
  
  5405739e
- coco (#2524) · 3245b10d
  Philip Meier authored Jul 31, 2020
  
  3245b10d
- folder (#2523) · bf7994b0
  Philip Meier authored Jul 31, 2020
  
  bf7994b0
- celba (#2522) · 40333c5a
  Philip Meier authored Jul 31, 2020
  
  40333c5a
- caltech (#2521) · 31245cb8
  Philip Meier authored Jul 31, 2020
  
  31245cb8
30 Jul, 2020 1 commit

[WIP] add typehints to datasets (#2487) · 8b029651

Philip Meier authored Jul 30, 2020

* enable mypy for torchvision.datasets and ignore existing errors

* imagenet

* utils

* lint

8b029651

03 Jul, 2020 1 commit
- add error message if Google Drive quota is exceeded (#2321) · e757d521
  Philip Meier authored Jul 03, 2020
```
Co-authored-by: Francisco Massa <fvsmassa@gmail.com>
```
  e757d521
22 Jun, 2020 1 commit

Refactoring to use contexts managers, list comprehensions when more idiomatic,... · 42aa9b26

Quentin Duval authored Jun 22, 2020

Refactoring to use contexts managers, list comprehensions when more idiomatic, and minor renaming to help reader clarity (#2335)

* Refactoring to use contexts managers, list comprehensions when more idiomatic, and minor renaming to help reader clarity.

* Fix flake8 warning in video_utils.py

42aa9b26

18 May, 2020 1 commit
- Fix Python lint (#2226) · e2e511be
  Francisco Massa authored May 18, 2020
  
  e2e511be
07 May, 2020 1 commit

Update ucf101.py (#2186) · 14af9de6

Guillem Orellana Trullols authored May 07, 2020

Now the dataset is not working properly because of this line of code `indices = [i for i in range(len(video_list)) if video_list[i][len(self.root) + 1:] in selected_files]`. 
Performing the `len(self.root) + 1` only make sense if there is no training / to root

```
>>> root = 'data/ucf-101/videos'
>>> video_path = 'data/ucf-101/videos/activity/video.avi'
>>> video_path [len(root ):]
'/activity/video.avi'
>>> video_path [len(root ) + 1:]
'activity/video.avi'
```

Appending the root path also to the selected files is a simple solution and make the dataset works with and without a trailing slash.

14af9de6

04 May, 2020 1 commit
- fix link URL to flickr8k (#2178) · 8438f6d1
  Peter Steinbach authored May 04, 2020
  
  8438f6d1