"src/vscode:/vscode.git/clone" did not exist on "f794432e814416120a6464a48696fd97912e64e1"
  1. 08 Jun, 2022 1 commit
    • Nicolas Hug's avatar
      [FBcode->GH] [quant][core][better-engineering] Rename files in quantized directory… (#6133) · a7e4fbdc
      Nicolas Hug authored
      * [quant][core][better-engineering] Rename files in quantized directory to conform with non-quantized countertpart filenames (#77037)
      
      Summary:
      X-link: https://github.com/pytorch/pytorch/pull/77037
      
      
      
      Names of analogous files in quantized directory (previously snake case) were inconsistent with
      their non-quantized filename counterparts (pascal case). This is the first of a series of PRs that changes
      all files in quantized (and sub-directories) dir to have pascal case.
      
      `aten/src/ATen/native/quantized/qconv_unpack.cpp` has not been renamed yet
      because (for reasons currently unknown) after making the name change, `import torch` produces the below error (`qlinear_unpack.cpp` renaming also seems to fail some phabricator CI tests for similar reasons). We suspect that these may be undefined errors and will revisit naming these files in a future PR.
      
      ```
      terminate called after throwing an instance of 'c10::Error'
        what():  Type c10::intrusive_ptr<ConvPackedParamsBase<2> > could not be converted to any of the known types.
      Exception raised from operator() at ../aten/src/ATen/core/jit_type.h:1735 (most recent call first):
      frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x55 (0x7f26745c0c65 in /data/users/dzdang/pytorch/torch/lib/libc10.so)
      frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xb1 (0x7f26745bdcd1 in /data/users/dzdang/pytorch/torch/lib/libc10.so)
      frame #2: <unknown function> + 0x1494e24 (0x7f2663b14e24 in /data/users/dzdang/pytorch/torch/lib/libtorch_cpu.so)
      frame #3: <unknown function> + 0xfed0bc (0x7f266366d0bc in /data/users/dzdang/pytorch/torch/lib/libtorch_cpu.so)
      frame #4: c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >&&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >&&, c10::ArrayRef<c10::detail::infer_schema::ArgumentDef>, c10::ArrayRef<c10::detail::infer_schema::ArgumentDef>) + 0x5a (0x7f266366d71a in /data/users/dzdang/pytorch/torch/lib/libtorch_cpu.so)
      frame #5: c10::detail::infer_schema::make_function_schema(c10::ArrayRef<c10::detail::infer_schema::ArgumentDef>, c10::ArrayRef<c10::detail::infer_schema::ArgumentDef>) + 0x7b (0x7f266366e06b in /data/users/dzdang/pytorch/torch/lib/libtorch_cpu.so)
      frame #6: <unknown function> + 0x1493f32 (0x7f2663b13f32 in /data/users/dzdang/pytorch/torch/lib/libtorch_cpu.so)
      frame #7: <unknown function> + 0xe227dd (0x7f26634a27dd in /data/users/dzdang/pytorch/torch/lib/libtorch_cpu.so)
      frame #8: <unknown function> + 0x14e0a (0x7f268c934e0a in /lib64/ld-linux-x86-64.so.2)
      ..........................truncated.............
      ```
      
      Reviewed By: malfet
      
      Differential Revision: D36862332
      
      Pulled By: dzdang
      
      fbshipit-source-id: 598c36656b4e71f906d940e7ff19ecf82d43031d
      
      * empty commit
      
      * empty commit
      
      * empty commit
      Co-authored-by: default avatardzdang <dzdang@umich.edu>
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      a7e4fbdc
  2. 22 Nov, 2021 1 commit
  3. 13 Apr, 2021 1 commit
  4. 30 Mar, 2021 1 commit
    • Nicolas Hug's avatar
      Add quantized version of nms (#3601) · f74bfab6
      Nicolas Hug authored
      * Add quantized version of nms
      
      * Added tests
      
      * Compute areas only once
      
      * remove calls to dequantize_val
      
      * fix return type for empty tensor
      
      * flake8
      
      * remove use of scale as it gets cancelled out
      
      * simpler int convertion in tests
      
      * explicitly set ovr to double
      
      * add tests for more values of scale and zero_point
      
      * comment about underflow
      
      * remove unnecessary accessor
      
      * properly convert to float for division
      
      * Add comments about underflow
      
      * explicitely cast coordinates to float to allow vectorization
      
      * clang
      
      * clang  again
      
      * hopefully OK now
      f74bfab6
  5. 04 Jan, 2021 1 commit
  6. 10 Dec, 2020 1 commit
    • Vasilis Vryniotis's avatar
      Restructuring C++ project: (#3146) · 7d831a2f
      Vasilis Vryniotis authored
      Summary:
      * Reduce unnecessary header inclusions in models and io.
      
      * Move autocast to separate folder and hide autograd implementation in an anonymous namespace.
      
      * Moving files in subfolders.
      
      Reviewed By: fmassa
      
      Differential Revision: D25461523
      
      fbshipit-source-id: 756eeb6848aacaa474de4825ed4c1045d17e2cea
      7d831a2f
  7. 08 Dec, 2020 1 commit
    • Vasilis Vryniotis's avatar
      Per file C++ Operator registration (#3135) · 3c33f367
      Vasilis Vryniotis authored
      * Moving deform_conv2d op registration.
      
      * Moving nms op registration.
      
      * Moving new_empty_tensor op registration.
      
      * Moving ps_roi_align op registration.
      
      * Moving ps_roi_pool op registration.
      
      * Moving roi_align op registration.
      
      * Moving roi_pool op registration.
      
      * Restoring headers for forward/backward and fixing styles.
      
      * Restoring the test hack on windows.
      
      * Stricter header inclusion.
      3c33f367
  8. 02 Dec, 2020 1 commit
    • Vasilis Vryniotis's avatar
      Encapsulate and Standardise C++ Ops (#3097) · 0ebbb0ab
      Vasilis Vryniotis authored
      * Encapsulate and standardize deform_conv2d (#3074)
      
      * Rename files.
      
      * Standardizing method names.
      
      * Adding anonymous namespaces.
      
      * Applying C++ naming rules and alinging variable names across headers and cpp files.
      
      * Syncing names across implementations.
      
      * Rename deform_conv2d.h to deform_conv2d.cpp
      
      * Use header files:
      - Create header files for kernel implementation and remove definitions from vision_*.h files.
      - Eliminate unnecessary headers and ensure all cpp include their headers.
      
      * Change the naming convention for kernel implementations.
      
      * Remove the _param postfix from the variables and standardizing names.
      
      * Exposing public forward/backward methods to the C++ API and moving methods around to minimize git blame changes.
      
      * Encapsulate and standardize nms (#3081)
      
      * Syncing, where possible, the names of functions across devices.
      
      * Adding all internal functions in anonymous namespaces.
      
      * Renaming C++/CUDA kernel files and moving operator code from header to cpp file.
      
      * Create foreach cpp file a separate header file with "public" functions.
      
      * Removing unnecessary repeated includes.
      
      * Update CMakeLists.txt to include all headers.
      
      * Encapsulate and standardize ps_roi_align (#3082)
      
      * Renaming C++ files & methods according to recommended naming conventions and aligning them with Python's API.
      Syncing, where possible, the names of functions across devices.
      
      * Adding all internal functions in anonymous namespaces.
      
      * Renaming C++/CUDA kernel files and moving operator code from header to cpp file.
      
      * Create foreach cpp file a separate header file with "public" functions.
      
      * Removing unnecessary repeated includes.
      
      * Encapsulate and standardize ps_roi_pool (#3084)
      
      * Renaming C++ files & methods according to recommended naming conventions and aligning them with Python's API.
      
      * Adding all internal functions in anonymous namespaces.
      
      * Renaming C++/CUDA kernel files and moving operator code from header to cpp file.
      
      * Create foreach cpp file a separate header file with "public" functions.
      
      * Removing unnecessary repeated includes.
      
      * Encapsulate and standardize roi_align (#3085)
      
      * Renaming C++ files & methods according to recommended naming conventions and aligning them with Python's API.
      
      * Adding all internal functions in anonymous namespaces.
      
      * Renaming C++/CUDA kernel files and moving operator code from header to cpp file.
      
      * Create foreach cpp file a separate header file with "public" functions.
      
      * Removing unnecessary repeated includes.
      
      * Encapsulate and standardize roi_pool  (#3088)
      
      * Renaming C++ files & methods according to recommended naming conventions and aligning them with Python's API.
      
      * Adding all internal functions in anonymous namespaces.
      
      * Syncing variable names between the cpp files and their header files.
      
      * Renaming C++/CUDA kernel files and moving operator code from header to cpp file.
      
      * Create foreach cpp file a separate header file with "public" functions.
      
      * Removing unnecessary repeated includes.
      
      * Encapsulate and standardize new_empty_tensor_op (#3089)
      
      * Renaming C++ files & methods according to recommended naming conventions and aligning them with Python's API.
      
      * Create foreach cpp file a separate header file with "public" functions.
      
      * Adding all internal functions in anonymous namespaces.
      
      * Convert to const ref all possible parameters.
      
      * Removing unnecessary repeated includes.
      
      * Encapsulate and standardize C++ Ops - Clean up (#3094)
      
      * Removing unnecessary repeated includes.
      
      * Remove unnecessary vision_cpu.h, vision_cuda.h, autocast.h.
      
      * Fixing naming convention and correcting method names on macros.
      
      * Turn on clang formatter for cu files and fixing broken styles.
      
      * Replace "#ifndef ... #define ... #endif" with "#pragma once" on header files.
      
      * Adding operator methods in vision::ops namespace. (#3096)
      
      * Adding operator methods in vision::ops namespace.
      
      * Replace general.h with macros.h
      
      * Adding vision.h to the necessary cpp files.
      0ebbb0ab
  9. 30 Oct, 2020 1 commit
    • Vasilis Vryniotis's avatar
      NMS code cleanup (#2907) · 455cd57c
      Vasilis Vryniotis authored
      * Clean up and refactor ROIAlign implementation:
      - Remove primitive const declaration from method names.
      - Remove unnecessary headers.
      - Aligning method names between cpu and cuda.
      
      * Adding back include for cpu.
      
      * Restoring method names of private methods to avoid conflicts.
      
      * Restore include headers.
      455cd57c
  10. 13 Oct, 2020 1 commit
    • vfdev's avatar
      Added rois shape check in C++ (#2794) · b8e93084
      vfdev authored
      * Added rois shape check in C++
      
      * Fixes code formatting
      
      * Remove accidental include
      
      * - Updated code according to the review
      - Replaced old AT_ASSERT/ERROR by new TORCH_CHECK
      b8e93084
  11. 09 Jul, 2020 1 commit
    • mcarilli's avatar
      [WIP] Allow autocast for 1.6 (#2384) · 0a8586c9
      mcarilli authored
      
      
      * Fixes Xiao's repro
      
      * Ports nms to use full dispatcher
      
      * Move HIPGuard to nms_cuda
      
      * clang-format
      
      * run models in test_models.py on GPU if available
      
      * Francisco's comment, also disable cuda model tests to see if CPU alone still passes
      
      * cuda tests now pass locally, although still not comparing to saved numerics
      
      * add note for thing to ask francisco
      
      * Allow cuda and cpu tests to share a data file
      
      * ignore suffix if unneeded
      
      * Skip autocast numerics checks for a few models
      
      * Add roi_align test
      Co-authored-by: default avatarMichael Carilli <mcarilli@nvidia.com>
      0a8586c9
  12. 07 Apr, 2020 1 commit
  13. 03 Apr, 2020 2 commits
  14. 06 Dec, 2019 1 commit
  15. 05 Nov, 2019 1 commit
  16. 29 Aug, 2019 1 commit
  17. 25 Jun, 2019 1 commit
  18. 23 May, 2019 1 commit
    • Varun Agrawal's avatar
      nms_cuda signature update (#945) · 249cfbf5
      Varun Agrawal authored
      Updated nms_cuda signature to accept detections and scores as separate tensors.
      This also required updating the indexing in the NMS CUDA kernel.
      
      Also made the iou_threshold parameter name consistent across implementations.
      249cfbf5
  19. 07 May, 2019 1 commit
    • Francisco Massa's avatar
      Add C++ ops to torchvision (#826) · dc3ac290
      Francisco Massa authored
      * Initial layout for layers with cpp extensions
      
      * Move files around
      
      * Fix import after move
      
      * Add support for multiple types to ROIAlign
      
      * Different organization
      
      CUDA extensions work now
      
      * Cleanups
      
      * Reduce memory requirements for backwards
      
      * Replace runtime_error by AT_ERROR
      
      * Add nms test
      
      * Add support for compilation using CPP extensions
      
      * Change folder structure
      
      * Add ROIPool cuda
      
      * Cleanups
      
      * Add roi_pool.py
      
      * Fix lint
      
      * Add initial structures folder for bounding boxes
      
      * Assertion macros compatible with pytorch master (#540)
      
      * Support for ROI Pooling (#592)
      
      * ROI Pooling with tests. Fix for cuda context in ROI Align.
      
      * renamed bottom and top to follow torch conventions
      
      * remove .type().tensor() calls in favor of the new approach to tensor initialization (#626)
      
      * Consistent naming for rois variable (#627)
      
      * remove .type().tensor() calls in favor of the new approach to tensor initialization
      
      * Consistent naming for rois variable in ROIPool
      
      * ROIPool: Support for all datatypes (#632)
      
      * Use of torch7 naming scheme for ROIAlign forward and backward
      
      * use common cuda helpers in ROIAlign
      
      * use .options() in favor of .type() where applicable
      
      * Added tests for forward pass of ROIAlign, as well as more consistent naming scheme for CPU vs CUDA
      
      * working ROIAlign cuda backwards pass
      
      * working ROIAlign backwards pass for CPU
      
      * added relevant headers for ROIAlign backwards
      
      * tests for ROIAlign layer
      
      * replace .type() with .options() for tensor initialization in ROIAlign layers
      
      * support for Half types in ROIAlign
      
      * gradcheck tests for ROIAlign
      
      * updated ROIPool on CPU to work with all datatypes
      
      * updated and cleaned tests for ROI Pooling
      
      * Fix rebase problem
      
      * Remove structures folder
      
      * Improve cleanup and bugfix in test_layers
      
      * Update C++ headers
      
      * Add CUDAGuard to cu files
      
      * Add more checks to layers
      
      * Add CUDA NMS and tests
      
      * Add multi-type support for NMS CUDA
      
      * Avoid using THCudaMalloc
      
      * Add clang-format and reformat c++ code
      
      * Remove THC includes
      
      * Rename layers to ops
      
      * Add documentation and rename functions
      
      * Improve the documentation a bit
      
      * Fix some lint errors
      
      * Fix remaining lint inssues
      
      * Area computation doesn't add +1 in NMS
      
      * Update CI to use PyTorch nightly
      
      * Make NMS return indices sorted according to the score
      
      * Address reviewer comments
      
      * Lint fixes
      
      * Improve doc for roi_align and roi_pool
      
      * move to xenial
      
      * Fix bug pointed by @lopuhin
      
      * Fix RoIPool reference implementation in Python 2
      
      Also fixes a bug in the clip_boxes_to_image -- this function needs a test!
      
      * Remove change in .travis
      dc3ac290