- 26 Jan, 2021 1 commit
-
-
Nicolas Hug authored
Co-authored-by:Vasilis Vryniotis <datumbox@users.noreply.github.com>
-
- 01 Dec, 2020 1 commit
-
-
Francisco Massa authored
* concatenate small tensors into big ones to reduce the use of shared file descriptor (#1694) Summary: Pull Request resolved: https://github.com/pytorch/vision/pull/1694 - PT dataloader forks worker process to speed up the fetching of dataset example. The recommended way of multiprocess context is `forkserver` rather than `fork`. - Main process and worker processes will share the dataset class instance, which avoid duplicating the dataset and save memory. In this process, `ForkPickler(..).dumps(...)` will be called to serialize the objects, including objects within dataset instance recursively. `VideoClips` instance internally uses O(N) `torch.Tensor` to store per-video information, such as pts, and possible clips, where N is the No. of videos. - During dumping, each `torch.Tensor` will use one File Descriptor (FD). The OS default max limit of FD is 65K by using `ulimit -n` to query. The number of tensors in `VideoClips` often exceeds the limit. - To resolve this issue, we use a few big tensors by concatenating small tensors in the `__getstate__()` method, which will be called during pickling. This will only require O(1) tensors. - When this diff is landed, we can abondon D19173248 In D19173397, in ClassyVision, we change the mp context from `fork` to `forkserver`, and finally can run the PT dataloader without hanging issues. Reviewed By: fmassa Differential Revision: D19179991 fbshipit-source-id: c8716775c7c154aa33d93b25d112d2a59ea688a9 * Try to fix Windows * Try fix Windows v2 * Disable tests on Windows * Add back necessary part * Try fix OSX (and maybe Windows) * Fix * Try enabling Windows Co-authored-by:
Zhicheng Yan <zyan3@fb.com>
-
- 30 Mar, 2020 1 commit
-
-
Fernando Pérez-García authored
* Improve code readability and docstring * Remove unused argument * Improve make_dataset() readability
-
- 07 Oct, 2019 1 commit
-
-
Jaesun Park authored
* Fix hmdb51.py typo * Fix ucf101.py typo
-
- 03 Oct, 2019 1 commit
-
-
Zhicheng Yan authored
* add metadata to video dataset classes. bug fix. more robustness * query video backend within VideoClips class * Fix tests * Fix lint
-
- 20 Sep, 2019 1 commit
-
-
Francisco Massa authored
-
- 28 Aug, 2019 1 commit
-
-
JMistele authored
Fixed video labelling after `subset` call for HMDB51 dataset (hmdb51.py) (EDIT: UCF101 as well) (#1240) * Fixed video labelling after subset for HMDB51 dataset * Fixed video labelling after subset for HMDB51 dataset Co-authored-by:
Eric Tang <etang21@stanford.edu> Co-authored-by:
Ryan Cao <ryancao@stanford.edu> * UCF 101 Labeling fixes - Analogous fix to HMDB51 to maintain correct labels after the train-test split - Additional change to the `select_fold` method in `ucf101.py` to correctly reflect the annotation format Co-authored-by:
Ryan Cao <ryancao@stanford.edu> Co-authored-by:
Eric Tang <etang21@stanford.edu>
-
- 01 Aug, 2019 1 commit
-
-
Francisco Massa authored
* Add docs to UCF101 and HMDB51 datasets * Remove extra line * Lint
-
- 24 Jul, 2019 1 commit
-
-
Francisco Massa authored
* Add HMDB51 and UCF101 * Remove debug code
-