Commits · 3b19d6fc0f47280f947af5cebb83827d0ce93f7d · OpenDAS / vision

15 Jan, 2021 1 commit

'make_dataset' as staticmethod of 'DatasetFolder' (#3215) · 3b19d6fc

Ren Pang authored Jan 15, 2021



* 'make_dataset' as staticmethod of 'DatasetFolder'

* a better fix
Co-authored-by: Francisco Massa <fvsmassa@gmail.com>

3b19d6fc

11 Jan, 2021 1 commit

Add widerface dataset (#2883) · d0063f3d

Josh Bradley authored Jan 11, 2021



* initial commit of widerface dataset

* comment out old code

* improve parsing of annotation files

* code cleanup and fix docstring comments

* speed up check for quota exceeded

* cleanup print statements

* reformat code and remove print statements

* minor code cleanup and reformatting

* add more comments

* reuse variable

* reverse formatting changes

* fix flake8 errors

* add type annotations

* fix mypy errors

* add a base_folder to root directory

* some formatting fixes

* GDrive threshold does not throw 403 error

* testing new download logic

* cleanup logic for download and integrity check

* use a better variable name

* format fix

* reorder list in docstring

* initial widerface unit test - fails on MD5 check

* use list of dictionaries to store dataset

* fix docstring formatting

* remove unnecessary error checking

* fix type checker error

* revert typo fix

* rename var constants, use file context manager, verify str args

* fix flake8 error

* fix checking target_type argument values

* create uncompressed dataset folders

* cleanup unit tests for widerface

* use correct os function

* add more info to docstring

* disable unittests for windows

* fix _check_integrity logic

* update docstring

* remove citation

* remove target_type option

* fix formatting issue
Co-authored-by: Philip Meier <github.pmeier@posteo.de>

* remove comment and add more info to docstring

* update type annotations

* restart CI jobs
Co-authored-by: Joshua Bradley <jgbrad3@evoforge.org>
Co-authored-by: Philip Meier <github.pmeier@posteo.de>
Co-authored-by: vfdev <vfdev.5@gmail.com>

d0063f3d

07 Jan, 2021 1 commit

Remove unused imports after manual review (#3229) · 7536e298

Ben Weinstein authored Jan 07, 2021



* remove unused imports after manual review

* Update torchvision/datasets/voc.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* remove two more instances
Co-authored-by: Ben Weinstein <benweinstein@Bens-MacBook-Pro.local>
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

7536e298

22 Dec, 2020 1 commit
- [*.py] Rename "Arguments:" to "Args:" (#3203) · 3d60f498
  Samuel Marks authored Dec 23, 2020
```
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>
```
  3d60f498
07 Dec, 2020 1 commit
- DatasetFolder: change documentation to include files in subfolders. (#3131) · 70ed29d0
  Robert-Jan Bruintjes authored Dec 07, 2020
  
  70ed29d0
01 Dec, 2020 1 commit

concatenate small tensors into big ones to reduce the use of shared f… (#1795) · 9fc6522d

Francisco Massa authored Dec 01, 2020

* concatenate small tensors into big ones to reduce the use of shared file descriptor (#1694)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1694

- PT dataloader forks worker process to speed up the fetching of dataset example. The recommended way of multiprocess context is `forkserver` rather than `fork`.

- Main process and worker processes will share the dataset class instance, which avoid duplicating the dataset and save memory. In this process, `ForkPickler(..).dumps(...)` will be called to serialize the objects, including objects within dataset instance recursively. `VideoClips` instance internally uses O(N) `torch.Tensor` to store per-video information, such as pts, and possible clips, where N is the No. of videos.

- During dumping, each `torch.Tensor` will use one File Descriptor (FD). The OS default max limit of FD is 65K by using `ulimit -n` to query. The number of tensors in `VideoClips` often exceeds the limit.

- To resolve this issue, we use a few big tensors by concatenating small tensors in the `__getstate__()` method, which will be called during pickling. This will only require O(1) tensors.

- When this diff is landed, we can abondon D19173248

In D19173397, in ClassyVision, we change the mp context from `fork` to `forkserver`, and finally can run the PT dataloader without hanging issues.

Reviewed By: fmassa

Differential Revision: D19179991

fbshipit-source-id: c8716775c7c154aa33d93b25d112d2a59ea688a9

* Try to fix Windows

* Try fix Windows v2

* Disable tests on Windows

* Add back necessary part

* Try fix OSX (and maybe Windows)

* Fix

* Try enabling Windows
Co-authored-by: Zhicheng Yan <zyan3@fb.com>

9fc6522d

26 Nov, 2020 1 commit

Add a warning if any clip can't be obtained from a video in VideoClips. (#2513) · 1bdda8cb

Santiago Castro authored Nov 26, 2020



* Add a warning if a clip can't be get from a video in VideoClips

* Update torchvision/datasets/video_utils.py
Co-authored-by: Philip Meier <github.pmeier@posteo.de>

* Add a test
Co-authored-by: Philip Meier <github.pmeier@posteo.de>

1bdda8cb

20 Nov, 2020 1 commit
- temporarily disable check if quota exceeded for Google drive (#3035) · cd0268cd
  Philip Meier authored Nov 20, 2020
  
  cd0268cd
06 Nov, 2020 1 commit

[docs] initial docstring for makedataset (#2879) · 3852b419

Bruno Korbar authored Nov 06, 2020



* initial docstring

* Revert "initial docstring"

This reverts commit 2bf68ca26e58096885901da0cfa330530974e731.

* revert the formatting changes

* clear up per Victor's comment

* Addressing PR comments
Co-authored-by: Bruno Korbar <korbar@vggdev9.vggdev.cluster>

3852b419

23 Oct, 2020 1 commit

Fixes EMNIST split and label issues (#2673) · cffac640

Vaibhav Balloli authored Oct 23, 2020



* Add float support to ColorJitter

* Fix byclass EMNIST

* Fix bymerge, balance, letters EMNIST

* Fix whitespace indent

* Revert unrelated file changes

* Revert unrelated file changes

* Removing unnecessary type conversions.

* Removing the transform and adding dummy class instead.
Co-authored-by: Vasileios Vryniotis <vvryniotis@fb.com>

cffac640

12 Oct, 2020 1 commit
- use more precise return type for gzip.open() (#2792) · f9c4fdf9
  Philip Meier authored Oct 12, 2020
  
  f9c4fdf9
14 Sep, 2020 1 commit
- add typehints for .datasets.samplers (#2667) · 6662b30a
  Philip Meier authored Sep 14, 2020
  
  6662b30a
09 Sep, 2020 1 commit
- Update dead links to coco website (#2659) · a4736ea6
  Peter Whidden authored Sep 09, 2020
  
  a4736ea6
27 Aug, 2020 1 commit

Fix Places365 dataset (#2625) · 6f028212

Philip Meier authored Aug 27, 2020

* fix images extraction

* remove test split

* fix tests

* be less clever in test data generation

* remove micro optimization

* lint

6f028212

25 Aug, 2020 2 commits

Places365 dataset (#2610) · fc69c225

Philip Meier authored Aug 25, 2020

* initial draft

* [dirty] progress

* remove inheritance from ImageFolder

* add tests

* lint

* fix type hints

* align getitem with other datasets

* remove unused import

* add docstring

* guard existing image folders from overwrite

* add missing entry in docstring

* make fixpath more legible

* add Places365 to docs

fc69c225

fix FashionMNIST docstring (#2614) · 01fb0df0
Philip Meier authored Aug 25, 2020

01fb0df0

20 Aug, 2020 1 commit

Only pull keys from db in lsun for faster cache. (#2544) · ea6b879e

Harsh Rangwani authored Aug 20, 2020

* Only pull keys from db in lsun for faster cache.

This pull request inhances the speed of the cache creation for lsun dataset. For the "kitchen_train" the speed was getting slow with cache creation taking more then two hours. This speeds up to cache creation in within minutes. The issue was pulling the large image values each time and dropping them.

For more details on this please refer this issue https://github.com/jnwatson/py-lmdb/issues/195.

* Fixed bug in lsun.py when loading multiple categories

* Make linter happy

ea6b879e

03 Aug, 2020 11 commits
- add typehints for torchvision.datasets.stl10 (#2540) · 3a159dfe
  Philip Meier authored Aug 03, 2020
```
* add typehints for torchvision.datasets.stl10

* move annotation from class to instance scope
```
  3a159dfe
- add typehints for torchvision.datasets.phototour (#2531) · 62e3fbd8
  Philip Meier authored Aug 03, 2020
  
  62e3fbd8
- add typehints for torchvision.datasets.svhn (#2539) · 1a6148d4
  Philip Meier authored Aug 03, 2020
  
  1a6148d4
- add typehints for torchvision.datasets.voc (#2537) · 7c1ed419
  Philip Meier authored Aug 03, 2020
  
  7c1ed419
- add typehints for torchvision.datasets.usps (#2538) · 0acbf663
  Philip Meier authored Aug 03, 2020
  
  0acbf663
- add typehints for torchvision.datasets.mnist (#2532) · 3f70e3c4
  Philip Meier authored Aug 03, 2020
  
  3f70e3c4
- add typehints to torchvision.datasets.sbu (#2536) · 203a7841
  Philip Meier authored Aug 03, 2020
  
  203a7841
- add typehints for torchvision.datasets.sbd (#2535) · bf584072
  Philip Meier authored Aug 03, 2020
  
  bf584072
- add typehints for torchvision.datasets.semeion (#2534) · 49ec4a16
  Philip Meier authored Aug 03, 2020
  
  49ec4a16
- add typehints for torchvision.datasets.omniglot (#2533) · 262d6177
  Philip Meier authored Aug 03, 2020
  
  262d6177
- add typehints for torchvision.datasets.lsun (#2530) · ec9c7a54
  Philip Meier authored Aug 03, 2020
  
  ec9c7a54
31 Jul, 2020 9 commits
- cifar (#2527) · 6db1569c
  Philip Meier authored Jul 31, 2020
  
  6db1569c
- cityscapes (#2525) · 47f80acc
  Philip Meier authored Jul 31, 2020
  
  47f80acc
- flickr (#2529) · 15bd87f2
  Philip Meier authored Jul 31, 2020
  
  15bd87f2
- fakedata (#2528) · f1d7c92d
  Philip Meier authored Jul 31, 2020
  
  f1d7c92d
- vision (#2526) · 5405739e
  Philip Meier authored Jul 31, 2020
  
  5405739e
- coco (#2524) · 3245b10d
  Philip Meier authored Jul 31, 2020
  
  3245b10d
- folder (#2523) · bf7994b0
  Philip Meier authored Jul 31, 2020
  
  bf7994b0
- celba (#2522) · 40333c5a
  Philip Meier authored Jul 31, 2020
  
  40333c5a
- caltech (#2521) · 31245cb8
  Philip Meier authored Jul 31, 2020
  
  31245cb8
30 Jul, 2020 1 commit

[WIP] add typehints to datasets (#2487) · 8b029651

Philip Meier authored Jul 30, 2020

* enable mypy for torchvision.datasets and ignore existing errors

* imagenet

* utils

* lint

8b029651

03 Jul, 2020 1 commit
- add error message if Google Drive quota is exceeded (#2321) · e757d521
  Philip Meier authored Jul 03, 2020
```
Co-authored-by: Francisco Massa <fvsmassa@gmail.com>
```
  e757d521
22 Jun, 2020 1 commit

Refactoring to use contexts managers, list comprehensions when more idiomatic,... · 42aa9b26

Quentin Duval authored Jun 22, 2020

Refactoring to use contexts managers, list comprehensions when more idiomatic, and minor renaming to help reader clarity (#2335)

* Refactoring to use contexts managers, list comprehensions when more idiomatic, and minor renaming to help reader clarity.

* Fix flake8 warning in video_utils.py

42aa9b26