Commits · b93d5ee2ddcf2a9876b5871cbb958016e263336b · OpenDAS / vision

30 Oct, 2020 5 commits

PSROIPool + Dispatcher + Autocast + Code Cleanup (#2926) · b93d5ee2

Vasilis Vryniotis authored Oct 30, 2020

* Fixing types.

* Dispatcher + Autocast.

* + Autograd.

* Formating.

* Clean up and refactor PSROIPool implementation:
- Remove primitive const declaration from method names.
- Using references when possible.
- Fix variable naming.

* Restore include headers.

* New line at end of file.

* Resolving conflict, final cleanup, ordering method consistently across files.

b93d5ee2

ROIPool + Dispatcher + Autocast + Code Cleanup (#2922) · 0125a7dc

Vasilis Vryniotis authored Oct 30, 2020

* Fixing types.

* Dispatcher + Autocast.

* + Autograd.

* Formating.

* Fixing return casting with autocast.

* Clean up and refactor ROIPool implementation:
- Remove primitive const declaration from method names.
- Using references when possible.

* Restore include headers.

* New line at end of file.

0125a7dc

ROIAlign code cleanup (#2906) · f0c92d85

Vasilis Vryniotis authored Oct 30, 2020

* Clean up and refactor ROIAlign implementation:
- Remove primitive const declaration from method names.
- Passing as const ref instead of value where possible.
- Remove unnecessary headers.

* Adding back include for cpu.

* Restore include headers.

f0c92d85

PSROIAlign + Dispatcher + Autocast + Code Cleanup (#2928) · b06e43d6

Vasilis Vryniotis authored Oct 30, 2020

* Fixing types.

* Dispatcher + Autocast.

* + Autograd.

* Clean up and refactor PSROIAlign implementation:
- Remove primitive const declaration from method names.
- Using references when possible.
- Sync naming of internal methods with other ops.

* Restoring names of internal methods to avoid conflicts.

* Restore include headers.

b06e43d6

NMS code cleanup (#2907) · 455cd57c

Vasilis Vryniotis authored Oct 30, 2020

* Clean up and refactor ROIAlign implementation:
- Remove primitive const declaration from method names.
- Remove unnecessary headers.
- Aligning method names between cpu and cuda.

* Adding back include for cpu.

* Restoring method names of private methods to avoid conflicts.

* Restore include headers.

455cd57c

27 Oct, 2020 1 commit

Port DeformConv to use the Dispatcher and support Autocast (#2898) · e8b6e3f0

Vasilis Vryniotis authored Oct 27, 2020

* Splitting tuples of stride, padding and dilation of DeformConv.

* Fixing types.

* Dispatcher + Autocast.

* + Autograd.

* Moving contiguous() convertions away dispatcher and into the implementations.

* Removing rvalue references.

e8b6e3f0

16 Oct, 2020 1 commit

Ensure torchvision operators are added in C++ (#2798) · adfc15c4

bmanga authored Oct 16, 2020

* Ensure torchvision operators are registered in C++ via weak symbols

* Add note to README on how to ensure that torchvision operators are available in C++

* Fix dllimport/dllexport on windows, format files

* Factor out common macros in single file

* Expose cuda_version in the API, use it to avoid pruning of ops initializer

adfc15c4

14 Sep, 2020 1 commit

PR: Add CMake build and function tracing tests (#2577) · a075d629

Edgar Andrés Margffoy Tuay authored Sep 14, 2020



* Add CMake build pipeline

* Add CMake build workflow

* Add executable permissions to script

* Install cmake on Windows/MacOS

* Install conda-build before setting up MSVC

* Install PyTorch from nightly

* Do not use conda-build variables

* Add path to CMake

* Install libpng and libjpeg

* Perform make

* Call msbuild on Windows

* Add missing yq

* Use vc_env_helper

* Use string instruction

* Escape configuration option

* Remove configuration flag

* Try to pass -p

* Use caret to escape equal sign

* Escape string option in Windows

* Try to call other bat

* Remove Windows/GPU CMake

* Add tracing cpp test

* Script model instead of tracing it

* Try to register operators manually

* Use manylinux-cuda102

* Activate conda env on Linux

* Build and run sample tracing test

* Add empty echo

* Remove unnecessary register

* Copy headers on Mac

* Revert to 2xlarge

* Include /usr/local/include on Mac

* Install pillow on Windows

* Install future

* Install torchvision on Windows

* Set include flag

* Add torchlib to PATH

* Normalize path via cygpath

* Register ops on Windows

* Minor error correction

* Register CPU/GPU ops on DLL library and register ops via reference

* Install dataclasses

* Install dataclasses using pip

* Address clang formatting issue

* Try to use an actual GPU instance on Linux

* Remove extra environment section

* Declare environment explicitly

* Regenerate

* Pass env variables to Dokcer

* Regenerate circleci

* Test tracing on GPU

* Use GPU medium

* Regenerate

* Use cuda101

* Regenerate

* Do not use pre-trained weights

Avoids having to download pretrained files, which could cause flaky tests
Co-authored-by: Francisco Massa <fvsmassa@gmail.com>

a075d629

09 Jul, 2020 1 commit

[WIP] Allow autocast for 1.6 (#2384) · 0a8586c9

mcarilli authored Jul 09, 2020



* Fixes Xiao's repro

* Ports nms to use full dispatcher

* Move HIPGuard to nms_cuda

* clang-format

* run models in test_models.py on GPU if available

* Francisco's comment, also disable cuda model tests to see if CPU alone still passes

* cuda tests now pass locally, although still not comparing to saved numerics

* add note for thing to ask francisco

* Allow cuda and cpu tests to share a data file

* ignore suffix if unneeded

* Skip autocast numerics checks for a few models

* Add roi_align test
Co-authored-by: Michael Carilli <mcarilli@nvidia.com>

0a8586c9

30 Jun, 2020 1 commit

Port roi_align to actually use dispatcher (#2366) · 44806038

Edward Z. Yang authored Jun 30, 2020



* Switch torchvision registrations to new operator registration API.

This is still registering everything as catchalls, so we're really just
moving deck chairs around, but payoff is coming soon.
Signed-off-by: Edward Z. Yang <ezyang@fb.com>

* Port roi_align to actually use dispatcher
Signed-off-by: Edward Z. Yang <ezyang@fb.com>

44806038

04 Mar, 2020 1 commit

`aligned` flag in ROIAlign (#1908) · e1e975f9

AhnDW authored Mar 04, 2020

* Aligned flag in the interfaces

* Aligned flag in the impl, and remove unused comments

* Handling empty bin in forward

* Remove raise error in roi_width

* Aligned flag in the Testcodes

e1e975f9

04 Dec, 2019 1 commit

Add Deformable Convolution operation. (#1586) · 52b8685b

pedrofreire authored Dec 04, 2019

* Add Deformable Convolution operation.

This adds the deformable convolution operation, as described in Deformable Convolutional Networks (https://arxiv.org/abs/1703.06211).

- The code is based on https://github.com/open-mmlab/mmdetection/blob/master/mmdet/ops/dcn/src/deform_conv_cuda.cpp ; the whole code was modified and refactored to remove redundancies and increase clarity, and to adapt it to torchvision.

- The CPU part is a direct copy of the CUDA code; it might make sense to do follow-up adjustments in the CPU code to simplify it / optimize it, or to reuse functionality between CPU and CUDA..

- We also add tests (with a non-trivial set of parameters); they can be made more robust by randomizing the parameters and executing multiple times.

* Update DeformConv to be more consistent w/ Conv2d

* rename some variables and arguments to match Conv2d;
* add optional bias;
* add weight, offset and bias as module parameters;
* remove the n_parallel_imgs parameter;
* Fix __repr__;
* etc..

Initialization of weight and bias is the same as in Conv2d, and
initialization of offsets to zero is the same as in the paper.

This also includes some other small unrelated fixes/improvements.

* Apply clang-format in DeformConv files.

* Import Optional type annotation

* Remove offset param from DeformConv2d module

- We pass the offset in the forward of DeformConv2d, instead of having
an internal parameter. This adds some complexity to creating the module
(e.g. now you have to worry about the output size, to create the
offset), but it gives more flexibility.
- We also use make_tuple for tuple creation, in an attempt to fix error
w/ older compilers.

* Replace abs by std::abs

Old gcc versions were giving wrong results here, because they would
resolve abs as int -> int, thus causing undesired truncation. Replacing
abs by std::abs should allow for correct overloading of abs as float -> float.

* Reorder declarations for clarity

* Reorder weight and offset args in deform_conv2d

We place offset arg before the weight arg, to be more
consistent with DeformConv2d.forward(input, offset)

* Replace abs by std::abs in DeformConv_cuda

52b8685b

16 Oct, 2019 1 commit

Implementation for Position-sensitive ROI Pool/Align [updated] (#1410) · 896d7ec7

Lukas Bommes authored Oct 16, 2019

* added PSRoiAlign and PSRoiPool with C++ autograd and torch ops

* fixed linter errors

* fixed linter errors 2

* fixed linter errors 3

896d7ec7

25 Jun, 2019 1 commit
- Renamed vision.h files to vision_cpu.h and vision_cuda.h (#1051) · 44c66270
  Shahriar authored Jun 26, 2019
  
  44c66270
23 May, 2019 1 commit

nms_cuda signature update (#945) · 249cfbf5

Varun Agrawal authored May 23, 2019

Updated nms_cuda signature to accept detections and scores as separate tensors.
This also required updating the indexing in the NMS CUDA kernel.

Also made the iou_threshold parameter name consistent across implementations.

249cfbf5

07 May, 2019 1 commit

Add C++ ops to torchvision (#826) · dc3ac290

Francisco Massa authored May 07, 2019

* Initial layout for layers with cpp extensions

* Move files around

* Fix import after move

* Add support for multiple types to ROIAlign

* Different organization

CUDA extensions work now

* Cleanups

* Reduce memory requirements for backwards

* Replace runtime_error by AT_ERROR

* Add nms test

* Add support for compilation using CPP extensions

* Change folder structure

* Add ROIPool cuda

* Cleanups

* Add roi_pool.py

* Fix lint

* Add initial structures folder for bounding boxes

* Assertion macros compatible with pytorch master (#540)

* Support for ROI Pooling (#592)

* ROI Pooling with tests. Fix for cuda context in ROI Align.

* renamed bottom and top to follow torch conventions

* remove .type().tensor() calls in favor of the new approach to tensor initialization (#626)

* Consistent naming for rois variable (#627)

* remove .type().tensor() calls in favor of the new approach to tensor initialization

* Consistent naming for rois variable in ROIPool

* ROIPool: Support for all datatypes (#632)

* Use of torch7 naming scheme for ROIAlign forward and backward

* use common cuda helpers in ROIAlign

* use .options() in favor of .type() where applicable

* Added tests for forward pass of ROIAlign, as well as more consistent naming scheme for CPU vs CUDA

* working ROIAlign cuda backwards pass

* working ROIAlign backwards pass for CPU

* added relevant headers for ROIAlign backwards

* tests for ROIAlign layer

* replace .type() with .options() for tensor initialization in ROIAlign layers

* support for Half types in ROIAlign

* gradcheck tests for ROIAlign

* updated ROIPool on CPU to work with all datatypes

* updated and cleaned tests for ROI Pooling

* Fix rebase problem

* Remove structures folder

* Improve cleanup and bugfix in test_layers

* Update C++ headers

* Add CUDAGuard to cu files

* Add more checks to layers

* Add CUDA NMS and tests

* Add multi-type support for NMS CUDA

* Avoid using THCudaMalloc

* Add clang-format and reformat c++ code

* Remove THC includes

* Rename layers to ops

* Add documentation and rename functions

* Improve the documentation a bit

* Fix some lint errors

* Fix remaining lint inssues

* Area computation doesn't add +1 in NMS

* Update CI to use PyTorch nightly

* Make NMS return indices sorted according to the score

* Address reviewer comments

* Lint fixes

* Improve doc for roi_align and roi_pool

* move to xenial

* Fix bug pointed by @lopuhin

* Fix RoIPool reference implementation in Python 2

Also fixes a bug in the clip_boxes_to_image -- this function needs a test!

* Remove change in .travis

dc3ac290