* Add tests for NMS * Fix linting errors * Add DeviceGaurd to nms_cuda
* port nms extension from maskrcnn-benchmark * fix linting error