1. 04 Oct, 2021 1 commit
    • Philip Meier's avatar
      Add ufmt (usort + black) as code formatter (#4384) · 5f0edb97
      Philip Meier authored
      
      
      * add ufmt as code formatter
      
      * cleanup
      
      * quote ufmt requirement
      
      * split imports into more groups
      
      * regenerate circleci config
      
      * fix CI
      
      * clarify local testing utils section
      
      * use ufmt pre-commit hook
      
      * split relative imports into local category
      
      * Revert "split relative imports into local category"
      
      This reverts commit f2e224cde2008c56c9347c1f69746d39065cdd51.
      
      * pin black and usort dependencies
      
      * fix local test utils detection
      
      * fix ufmt rev
      
      * add reference utils to local category
      
      * fix usort config
      
      * remove custom categories sorting
      
      * Run pre-commit without fixing flake8
      
      * got a double import in merge
      Co-authored-by: default avatarNicolas Hug <nicolashug@fb.com>
      5f0edb97
  2. 29 Sep, 2021 1 commit
  3. 24 Sep, 2021 1 commit
  4. 20 Aug, 2021 1 commit
  5. 14 Jul, 2021 1 commit
  6. 17 Jun, 2021 1 commit
  7. 14 Jun, 2021 1 commit
  8. 04 Jun, 2021 1 commit
  9. 01 Jun, 2021 1 commit
  10. 21 May, 2021 1 commit
  11. 17 May, 2021 1 commit
  12. 11 May, 2021 1 commit
  13. 10 May, 2021 1 commit
  14. 08 Apr, 2021 1 commit
    • Nicolas Hug's avatar
      Add Quantized version of RoIAlign (#3624) · ad9cc62a
      Nicolas Hug authored
      * WIP
      
      * clang
      
      * docs
      
      * extracted out common utils
      
      * Use better quantization function and pass tensors as parameters
      
      * proper dequantization
      
      * Some tests
      
      * Dequantization optimization, seems to gain a few ms
      
      * clang-format
      
      * again
      
      * more correct test. Had to remove optimization although it almost works
      
      * Also test aligned=True
      
      * remove useless part
      
      * more docs and comments
      
      * Put back optimization with more robust test
      
      * Added check for index upper bound
      
      * avoid possible overflow
      
      * Move common function into common.h
      
      * oops
      
      * scale=1,zero_point=0 makes more sense
      
      * Force batch size of 1 to prevent any indexingbug
      
      * format
      
      * format again
      
      * updated docstring
      
      * put back description comment for pre_calc_bilinear_interpolate
      
      * revert most changes to docstring as it's taken care of in another PR
      ad9cc62a
  15. 30 Mar, 2021 1 commit
    • Nicolas Hug's avatar
      Add quantized version of nms (#3601) · f74bfab6
      Nicolas Hug authored
      * Add quantized version of nms
      
      * Added tests
      
      * Compute areas only once
      
      * remove calls to dequantize_val
      
      * fix return type for empty tensor
      
      * flake8
      
      * remove use of scale as it gets cancelled out
      
      * simpler int convertion in tests
      
      * explicitly set ovr to double
      
      * add tests for more values of scale and zero_point
      
      * comment about underflow
      
      * remove unnecessary accessor
      
      * properly convert to float for division
      
      * Add comments about underflow
      
      * explicitely cast coordinates to float to allow vectorization
      
      * clang
      
      * clang  again
      
      * hopefully OK now
      f74bfab6
  16. 10 Mar, 2021 2 commits
  17. 08 Mar, 2021 1 commit
    • Nicolas Hug's avatar
      [OPS, IMP] New batched_nms implementation (#3426) · 414427dd
      Nicolas Hug authored
      
      
      * new batched_nms implem
      
      * flake8
      
      * hopefully fix torchscipt tests
      
      * Use where instead of nonzero
      
      * Use same threshold (4k) for CPU and GPU
      
      * Remove use of argsort
      
      * use views again
      
      * remove print
      
      * trying stuff, I don't know what's going on
      
      * previous passed onnx checks so the error isn't in _vanilla func. Trying to return vanilla now
      
      * add tracing decorators
      
      * cleanup
      
      * wip
      
      * ignore new path with ONNX
      
      * use vanilla if tracing...????
      
      * Remove script_if_tracing decorator as it was conflicting with _is_tracing
      
      * flake8
      
      * Improve coverage
      Co-authored-by: default avatarFrancisco Massa <fvsmassa@gmail.com>
      414427dd
  18. 15 Feb, 2021 1 commit
  19. 28 Jan, 2021 1 commit
  20. 04 Jan, 2021 1 commit
    • Vasilis Vryniotis's avatar
      Speed up CI runtime (#3189) · 4d2d8bb0
      Vasilis Vryniotis authored
      * Speedup test_ucf101 (#2623 
      
      * Speedup Cmake builds (#3186)
      
      * Speedup test_autoaugment (#3190)
      
      * Speedup DeformConvTester (#3191)
      
      * Speedup InceptionV3 and GoogleNet on Windows (#3196)
      4d2d8bb0
  21. 15 Dec, 2020 1 commit
  22. 11 Dec, 2020 1 commit
  23. 27 Nov, 2020 1 commit
  24. 09 Nov, 2020 1 commit
  25. 30 Oct, 2020 2 commits
  26. 21 Oct, 2020 2 commits
    • F-G Fernandez's avatar
      Added eps in the __repr__ of FrozenBN (#2852) · 005355bd
      F-G Fernandez authored
      * feat: Updated FrozenBN eps to align with BatchNorm
      
      * feat: Added eps to __repr__ of FrozenBN
      
      * test: Updated unittest of __repr__ for FrozenBN
      
      * test: Updated unittest for eps value in BN and FrozenBN
      
      * fix: Revert FrozenBN eps value
      
      * test: Revert test on eps alignment between FrozenBN and BN
      005355bd
    • F-G Fernandez's avatar
      Added __repr__ to MultiScaleRoIAlign (#2840) · e280f614
      F-G Fernandez authored
      
      
      * feat: Added __repr__ to MultiScaleRoIAlign
      
      * test: Added unittest for __repr__ of MultiScaleRoIAlign
      
      * feat: Added feature map names in __repr__
      
      * test: Updated unittest
      Co-authored-by: default avatarvfdev <vfdev.5@gmail.com>
      e280f614
  27. 11 Oct, 2020 1 commit
  28. 07 Oct, 2020 1 commit
  29. 01 Oct, 2020 1 commit
    • Aditya Oke's avatar
      Adds bounding boxes conversion (#2710) · e70c91a9
      Aditya Oke authored
      
      
      * adds boxes conversion
      
      * adds documentation
      
      * adds xywh tests
      
      * fixes small typo
      
      * adds tests
      
      * Remove sphinx theme
      
      * corrects assertions
      
      * cleans code as per suggestion
      Signed-off-by: default avatarAditya Oke <okeaditya315@gmail.com>
      
      * reverts assertion
      
      * fixes to assertEqual
      
      * fixes inplace operations
      
      * Adds docstrings
      
      * added documentation
      
      * changes tests
      
      * moves code to box_convert
      
      * adds more tests
      
      * Apply suggestions from code review
      
      Let's leave those changes to a separate PR
      
      * fixes documentation
      Co-authored-by: default avatarFrancisco Massa <fvsmassa@gmail.com>
      e70c91a9
  30. 24 Sep, 2020 1 commit
    • Aditya Oke's avatar
      Adds Generalized IOU (#2642) · 6e10e3f8
      Aditya Oke authored
      * tries adding genaralized_iou
      
      * fixes linting
      
      * Adds docs for giou, iou and box area
      
      * fixes lint
      
      * removes docs to fixup in other PR
      
      * linter fix
      
      * Cleans comments
      
      * Adds tests for box area, iou and giou
      
      * typo fix for testCase
      
      * fixes typo
      
      * fixes box area test
      
      * fixes implementation
      
      * updates tests to tolerance
      6e10e3f8
  31. 24 Aug, 2020 1 commit
    • vfdev's avatar
      Fixes crash in deformable convolutions (2598) (#2604) · 8c32666b
      vfdev authored
      * [WIP] Fixes #2598
      - Adjusted num_kernels and batch_size according to kMaxGridNum * CUDA_NUM_THREADS
      - Tests to add
      
      * - Redefined kMaxGridNum as max grid according to current CUDA device
      - Added test to check the code from issue and compared grads CPU/CUDA
      
      * Fixed static kMaxGridNum evaluation to dynamic
      8c32666b
  32. 09 Jul, 2020 1 commit
    • mcarilli's avatar
      [WIP] Allow autocast for 1.6 (#2384) · 0a8586c9
      mcarilli authored
      
      
      * Fixes Xiao's repro
      
      * Ports nms to use full dispatcher
      
      * Move HIPGuard to nms_cuda
      
      * clang-format
      
      * run models in test_models.py on GPU if available
      
      * Francisco's comment, also disable cuda model tests to see if CPU alone still passes
      
      * cuda tests now pass locally, although still not comparing to saved numerics
      
      * add note for thing to ask francisco
      
      * Allow cuda and cpu tests to share a data file
      
      * ignore suffix if unneeded
      
      * Skip autocast numerics checks for a few models
      
      * Add roi_align test
      Co-authored-by: default avatarMichael Carilli <mcarilli@nvidia.com>
      0a8586c9
  33. 01 Jun, 2020 1 commit
  34. 11 May, 2020 2 commits
    • F-G Fernandez's avatar
      Added eps attribute to FrozenBatchNorm2d (#2190) · 7a2d0618
      F-G Fernandez authored
      * feat: Added eps argument to FrozenBatchNorm2d
      
      * test: Added unittest for eps addition in FrozenBatchNorm2d
      
      See #2169
      
      * fix: Reverted forward changes for JIT fuser
      
      * fix: Added back n argument for backward-compatibility
      
      * fix: Fixed FrozenBatchNorm2d forward
      
      Added back eps
      
      * feat: Specified deprecation warnings in FrozenBatchNorm2d
      
      * test: Added unittest for deprecation warninig in FrozenBatchNorm2d
      
      * style: Fixed lint
      
      * style: Fixed block comment lint
      7a2d0618
    • F-G Fernandez's avatar
      fix: Restored support of tuple of Tensors for region pooling ops (#2199) · a09d129c
      F-G Fernandez authored
      * feat: Restored support of tuple of Tensors for roi_align & roi_pool
      
      * test: Added unittest for Tensor sequence support by region pooling
      
      * test: Fixed typo in unittest
      
      * test: Fixed data type
      
      * test: Fixed roi pooling tensor unittest
      
      * test: Fixed box format conversion
      a09d129c
  35. 05 May, 2020 1 commit
    • F-G Fernandez's avatar
      Added number of features in FrozenBatchNorm2d __repr__ (#2168) · 5db8998a
      F-G Fernandez authored
      * feat: Added number of features in FrozenBatchNorm2d repr
      
      While BatchNorm layers have extensive information in their repr, FrozenBatchNorm2d has one
      
      * refactor: Refactored FrozenBatchNorm2d __repr__
      
      * test: Added unittest for FrozenBatchNorm2d __repr__
      
      * style: Removed blank lines in test_ops
      
      * refactor: Avoids creating an extra attribute for __repr__
      
      * style: Switched __repr__ to f-string
      
      Since support of Python version ealier than 3.6 have been dropped, f-string can be used.
      
      * fix: Fixed typo in __repr__
      
      * style: Switched unittest .format to f-string
      5db8998a
  36. 03 Apr, 2020 1 commit
    • Brian Hart's avatar
      improve stability of test_nms_cuda (#2044) · e61538cb
      Brian Hart authored
      * improve stability of test_nms_cuda
      
      This change addresses two issues:
      
      _create_tensors_with_iou() creates test data for the NMS tests. It
      takes care to ensure at least one pair of boxes (1st and last) have
      IoU around the threshold for the test. However, the constructed
      IoU for that pair is _so_ close to the threshold that rounding
      differences (presumably) between CPU and CUDA implementations may
      result in one suppressing a box in the pair and the other not.
      Adjust the construction to ensure the IoU for the box pair is
      near the threshold, but far-enough above that both implementations
      should agree.
      
      Where 2 boxes have nearly or exactly the same score, the CPU and
      CUDA implementations may order them differently. Adjust
      test_nms_cuda() to check only that the non-suppressed box lists
      include the same members, without regard for ordering.
      
      * adjust assertion in test_nms_cuda
      
      The CPU and CUDA nms implementations each sort the box scores
      as part of their work, but the sorts they use are not stable. So
      boxes with the same score maybe be processed in opposite order
      by the two implmentations.
      
      Relax the assertion in test_nms_cuda (following the model in
      pytorch's test_topk()) to allow the test to pass if the output
      differences are caused by similarly-scored boxes.
      
      * improve stability of test_nms_cuda
      
      Adjust _create_tensors_with_iou() to ensure we create at least
      one box just over threshold that should be suppressed.
      e61538cb