1. 30 Aug, 2023 1 commit
  2. 27 Jul, 2023 1 commit
  3. 13 Jul, 2023 1 commit
  4. 08 Aug, 2022 1 commit
  5. 22 Jul, 2022 1 commit
  6. 21 Jun, 2022 1 commit
  7. 15 Jun, 2022 1 commit
    • Lezwon Castelino's avatar
      Add SimpleCopyPaste augmentation (#5825) · bbc1aac8
      Lezwon Castelino authored
      
      
      * added simple POC
      
      * added jitter and crop options
      
      * added references
      
      * moved simplecopypaste to detection module
      
      * working POC for simple copy paste in detection
      
      * added comments
      
      * remove transforms from class
      updated the labels
      added gaussian blur
      
      * removed loop for mask calculation
      
      * replaced Gaussian blur with functional api
      
      * added inplace operations
      
      * added changes to accept tuples instead of tensors
      
      * - make copy paste functional
      - make only one copy of batch and target
      
      * add inplace support within copy paste functional
      
      * Updated code for copy-paste transform
      
      * Fixed code formatting
      
      * [skip ci] removed manual thresholding
      
      * Replaced cropping by resizing data to paste
      
      * Removed inplace arg (as useless) and put a check on iscrowd target
      
      * code-formatting
      
      * Updated copypaste op to make it torch scriptable
      Added fallbacks to support LSJ
      
      * Fixed flake8
      
      * Updates according to the review
      Co-authored-by: default avatarvfdev-5 <vfdev.5@gmail.com>
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      bbc1aac8
  8. 10 May, 2022 1 commit
  9. 03 May, 2022 1 commit
  10. 01 Apr, 2022 1 commit
  11. 22 Mar, 2022 1 commit
    • Vasilis Vryniotis's avatar
      Port Multi-weight support from prototype to main (#5618) · 11bd2eaa
      Vasilis Vryniotis authored
      
      
      * Moving basefiles outside of prototype and porting Alexnet, ConvNext, Densenet and EfficientNet.
      
      * Porting googlenet
      
      * Porting inception
      
      * Porting mnasnet
      
      * Porting mobilenetv2
      
      * Porting mobilenetv3
      
      * Porting regnet
      
      * Porting resnet
      
      * Porting shufflenetv2
      
      * Porting squeezenet
      
      * Porting vgg
      
      * Porting vit
      
      * Fix docstrings
      
      * Fixing imports
      
      * Adding missing import
      
      * Fix mobilenet imports
      
      * Fix tests
      
      * Fix prototype tests
      
      * Exclude get_weight from models on test
      
      * Fix init files
      
      * Porting googlenet
      
      * Porting inception
      
      * porting mobilenetv2
      
      * porting mobilenetv3
      
      * porting resnet
      
      * porting shufflenetv2
      
      * Fix test and linter
      
      * Fixing docs.
      
      * Porting Detection models (#5617)
      
      * fix inits
      
      * fix docs
      
      * Port faster_rcnn
      
      * Port fcos
      
      * Port keypoint_rcnn
      
      * Port mask_rcnn
      
      * Port retinanet
      
      * Port ssd
      
      * Port ssdlite
      
      * Fix linter
      
      * Fixing tests
      
      * Fixing tests
      
      * Fixing vgg test
      
      * Porting Optical Flow, Segmentation, Video models (#5619)
      
      * Porting raft
      
      * Porting video resnet
      
      * Porting deeplabv3
      
      * Porting fcn and lraspp
      
      * Fixing the tests and linter
      
      * Porting docs, examples, tutorials and galleries (#5620)
      
      * Fix examples, tutorials and gallery
      
      * Update gallery/plot_optical_flow.py
      Co-authored-by: default avatarNicolas Hug <contact@nicolas-hug.com>
      
      * Fix import
      
      * Revert hardcoded normalization
      
      * fix uncommitted changes
      
      * Fix bug
      
      * Fix more bugs
      
      * Making resize optional for segmentation
      
      * Fixing preset
      
      * Fix mypy
      
      * Fixing documentation strings
      
      * Fix flake8
      
      * minor refactoring
      Co-authored-by: default avatarNicolas Hug <contact@nicolas-hug.com>
      
      * Resolve conflict
      
      * Porting model tests (#5622)
      
      * Porting tests
      
      * Remove unnecessary variable
      
      * Fix linter
      
      * Move prototype to extended tests
      
      * Fix download models job
      
      * Update CI on Multiweight branch to use the new weight download approach (#5628)
      
      * port Pad to prototype transforms (#5621)
      
      * port Pad to prototype transforms
      
      * use literal
      
      * Bump up LibTorchvision version number for Podspec to release Cocoapods (#5624)
      Co-authored-by: default avatarAnton Thomma <anton@pri.co.nz>
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      
      * pre-download model weights in CI docs build (#5625)
      
      * pre-download model weights in CI docs build
      
      * move changes into template
      
      * change docs image
      
      * Regenerated config.yml
      Co-authored-by: default avatarPhilip Meier <github.pmeier@posteo.de>
      Co-authored-by: default avatarAnton Thomma <11010310+thommaa@users.noreply.github.com>
      Co-authored-by: default avatarAnton Thomma <anton@pri.co.nz>
      
      * Porting reference scripts and updating presets (#5629)
      
      * Making _preset.py classes
      
      * Remove support of targets on presets.
      
      * Rewriting the video preset
      
      * Adding tests to check that the bundled transforms are JIT scriptable
      
      * Rename all presets from *Eval to *Inference
      
      * Minor refactoring
      
      * Remove --prototype and --pretrained from reference scripts
      
      * remove  pretained_backbone refs
      
      * Corrections and simplifications
      
      * Fixing bug
      
      * Fixing linter
      
      * Fix flake8
      
      * restore documentation example
      
      * minor fixes
      
      * fix optical flow missing param
      
      * Fixing commands
      
      * Adding weights_backbone support in detection and segmentation
      
      * Updating the commands for InceptionV3
      
      * Setting `weights_backbone` to its fully BC value (#5653)
      
      * Replace default `weights_backbone=None` with its BC values.
      
      * Fixing tests
      
      * Fix linter
      
      * Update docs.
      
      * Update preprocessing on reference scripts.
      
      * Change qat/ptq to their full values.
      
      * Refactoring preprocessing
      
      * Fix video preset
      
      * No initialization on VGG if pretrained
      
      * Fix warning messages for backbone utils.
      
      * Adding star to all preset constructors.
      
      * Fix mypy.
      Co-authored-by: default avatarNicolas Hug <contact@nicolas-hug.com>
      Co-authored-by: default avatarPhilip Meier <github.pmeier@posteo.de>
      Co-authored-by: default avatarAnton Thomma <11010310+thommaa@users.noreply.github.com>
      Co-authored-by: default avatarAnton Thomma <anton@pri.co.nz>
      11bd2eaa
  12. 07 Mar, 2022 1 commit
  13. 21 Jan, 2022 1 commit
  14. 30 Nov, 2021 1 commit
    • Vasilis Vryniotis's avatar
      Refactor the `get_weights` API (#5006) · 3d8723d5
      Vasilis Vryniotis authored
      * Change the `default` weights mechanism to sue Enum aliases.
      
      * Change `get_weights` to work with full Enum names and make it public.
      
      * Applying improvements from code review.
      3d8723d5
  15. 15 Nov, 2021 1 commit
  16. 03 Nov, 2021 2 commits
  17. 28 Oct, 2021 1 commit
  18. 24 Oct, 2021 1 commit
  19. 04 Oct, 2021 1 commit
    • Philip Meier's avatar
      Add ufmt (usort + black) as code formatter (#4384) · 5f0edb97
      Philip Meier authored
      
      
      * add ufmt as code formatter
      
      * cleanup
      
      * quote ufmt requirement
      
      * split imports into more groups
      
      * regenerate circleci config
      
      * fix CI
      
      * clarify local testing utils section
      
      * use ufmt pre-commit hook
      
      * split relative imports into local category
      
      * Revert "split relative imports into local category"
      
      This reverts commit f2e224cde2008c56c9347c1f69746d39065cdd51.
      
      * pin black and usort dependencies
      
      * fix local test utils detection
      
      * fix ufmt rev
      
      * add reference utils to local category
      
      * fix usort config
      
      * remove custom categories sorting
      
      * Run pre-commit without fixing flake8
      
      * got a double import in merge
      Co-authored-by: default avatarNicolas Hug <nicolashug@fb.com>
      5f0edb97
  20. 11 May, 2021 1 commit
    • Vasilis Vryniotis's avatar
      Add SSDlite architecture with MobileNetV3 backbones (#3757) · 43d77206
      Vasilis Vryniotis authored
      * Partial implementation of SSDlite.
      
      * Add normal init and BN hyperparams.
      
      * Refactor to keep JIT happy
      
      * Completed SSDlite.
      
      * Fix lint
      
      * Update todos
      
      * Add expected file in repo.
      
      * Use C4 expansion instead of C4 output.
      
      * Change scales formula for Default Boxes.
      
      * Add cosine annealing on trainer.
      
      * Make T_max count epochs.
      
      * Fix test and handle corner-case.
      
      * Add support of support width_mult
      
      * Add ssdlite presets.
      
      * Change ReLU6, [-1,1] rescaling, backbone init & no pretraining.
      
      * Use _reduced_tail=True.
      
      * Add sync BN support.
      
      * Adding the best config along with its weights and documentation.
      
      * Make mean/std configurable.
      
      * Fix not implemented for half exception
      43d77206
  21. 07 May, 2021 1 commit
  22. 06 May, 2021 1 commit
  23. 30 Apr, 2021 1 commit
    • Vasilis Vryniotis's avatar
      Add SSD architecture with VGG16 backbone (#3403) · 730c5e1e
      Vasilis Vryniotis authored
      * Early skeleton of API.
      
      * Adding MultiFeatureMap and vgg16 backbone.
      
      * Making vgg16 backbone same as paper.
      
      * Making code generic to support all vggs.
      
      * Moving vgg's extra layers a separate class + L2 scaling.
      
      * Adding header vgg layers.
      
      * Fix maxpool patching.
      
      * Refactoring code to allow for support of different backbones & sizes:
      - Skeleton for Default Boxes generator class
      - Dynamic estimation of configuration when possible
      - Addition of types
      
      * Complete the implementation of DefaultBox generator.
      
      * Replace randn with empty.
      
      * Minor refactoring
      
      * Making clamping between 0 and 1 optional.
      
      * Change xywh to xyxy encoding.
      
      * Adding parameters and reusing objects in constructor.
      
      * Temporarily inherit from Retina to avoid dup code.
      
      * Implement forward methods + temp workarounds to inherit from retina.
      
      * Inherit more methods from retinanet.
      
      * Fix type error.
      
      * Add Regression loss.
      
      * Fixing JIT issues.
      
      * Change JIT workaround to minimize new code.
      
      * Fixing initialization bug.
      
      * Add classification loss.
      
      * Update todos.
      
      * Add weight loading support.
      
      * Support SSD512.
      
      * Change kernel_size to get output size 1x1
      
      * Add xavier init and refactoring.
      
      * Adding unit-tests and fixing JIT issues.
      
      * Add a test for dbox generator.
      
      * Remove unnecessary import.
      
      * Workaround on GeneralizedRCNNTransform to support fixed size input.
      
      * Remove unnecessary random calls from the test.
      
      * Remove more rand calls from the test.
      
      * change mapping and handling of empty labels
      
      * Fix JIT warnings.
      
      * Speed up loss.
      
      * Convert 0-1 dboxes to original size.
      
      * Fix warning.
      
      * Fix tests.
      
      * Update comments.
      
      * Fixing minor bugs.
      
      * Introduce a custom DBoxMatcher.
      
      * Minor refactoring
      
      * Move extra layer definition inside feature extractor.
      
      * handle no bias on init.
      
      * Remove fixed image size limitation
      
      * Change initialization values for bias of classification head.
      
      * Refactoring and update test file.
      
      * Adding ResNet backbone.
      
      * Minor refactoring.
      
      * Remove inheritance of retina and general refactoring.
      
      * SSD should fix the input size.
      
      * Fixing messages and comments.
      
      * Silently ignoring exception if test-only.
      
      * Update comments.
      
      * Update regression loss.
      
      * Restore Xavier init everywhere, update the negative sampling method, change the clipping approach.
      
      * Fixing tests.
      
      * Refactor to move the losses from the Head to the SSD.
      
      * Removing resnet50 ssd version.
      
      * Adding support for best performing backbone and its config.
      
      * Refactor and clean up the API.
      
      * Fix lint
      
      * Update todos and comments.
      
      * Adding RandomHorizontalFlip and RandomIoUCrop transforms.
      
      * Adding necessary checks to our tranforms.
      
      * Adding RandomZoomOut.
      
      * Adding RandomPhotometricDistort.
      
      * Moving Detection transforms to references.
      
      * Update presets
      
      * fix lint
      
      * leave compose and object
      
      * Adding scaling for completeness.
      
      * Adding params in the repr
      
      * Remove unnecessary import.
      
      * minor refactoring
      
      * Remove unnecessary call.
      
      * Give better names to DBox* classes
      
      * Port num_anchors estimation in generator
      
      * Remove rescaling and fix presets
      
      * Add the ability to pass a custom head and refactoring.
      
      * fix lint
      
      * Fix unit-test
      
      * Update todos.
      
      * Change mean values.
      
      * Change the default parameter of SSD to train the full VGG16 and remove the catch of exception for eval only.
      
      * Adding documentation
      
      * Adding weights and updating readmes.
      
      * Update the model weights with a more performing model.
      
      * Adding doc for head.
      
      * Restore import.
      730c5e1e
  24. 28 Jan, 2021 1 commit
    • Vasilis Vryniotis's avatar
      Adding Preset Transforms in reference scripts (#3317) · 1703e4ca
      Vasilis Vryniotis authored
      * Adding presets in the classification reference scripts.
      
      * Adding presets in the object detection reference scripts.
      
      * Adding presets in the segmentation reference scripts.
      
      * Adding presets in the video classification reference scripts.
      
      * Moving flip at the end to align with image classification signature.
      1703e4ca
  25. 18 Jan, 2021 1 commit
    • Vasilis Vryniotis's avatar
      Add MobileNetV3 architecture for Detection (#3253) · bf211dac
      Vasilis Vryniotis authored
      * Minor refactoring of a private method to make it reusuable.
      
      * Adding a FasterRCNN + MobileNetV3 with & w/o FPN models.
      
      * Reducing Resolution to 320-640 and anchor sizes to 16-256.
      
      * Increase anchor sizes.
      
      * Adding rpn score threshold param on the train script.
      
      * Adding trainable_backbone_layers param on the train script.
      
      * Adding rpn_score_thresh param directly in fasterrcnn_mobilenet_v3_large_fpn.
      
      * Remove fasterrcnn_mobilenet_v3_large prototype and update expected file.
      
      * Update documentation and adding weights.
      
      * Use buildin Identity.
      
      * Fix spelling.
      bf211dac
  26. 14 Jan, 2021 1 commit
  27. 07 Jan, 2021 1 commit
  28. 19 Dec, 2019 3 commits
  29. 25 Nov, 2019 1 commit
  30. 29 Aug, 2019 1 commit
  31. 12 Aug, 2019 1 commit
  32. 12 Jul, 2019 1 commit
    • flauted's avatar
      Clean det ref (#1109) · b7615843
      flauted authored
      * Doc multigpu and propagate data path.
      
      * Use raw doc because of backslash.
      b7615843
  33. 14 Jun, 2019 1 commit
  34. 21 May, 2019 1 commit
  35. 19 May, 2019 1 commit
    • Francisco Massa's avatar
      Add Faster R-CNN and Mask R-CNN (#898) · ccd1b27d
      Francisco Massa authored
      * [Remove] Use stride in 1x1 in resnet
      
      This is temporary
      
      * Move files to torchvision
      
      Inference works
      
      * Now seems to give same results
      
      Was using the wrong number of total iterations in the end...
      
      * Distributed evaluation seems to work
      
      * Factor out transforms into its own file
      
      * Enabling horizontal flips
      
      * MultiStepLR and preparing for launches
      
      * Add warmup
      
      * Clip gt boxes to images
      
      Seems to be crucial to avoid divergence. Also reduces the losses over different processes for better logging
      
      * Single-GPU batch-size 1 of CocoEvaluator works
      
      * Multi-GPU CocoEvaluator works
      
      Gives the exact same results as the other one, and also supports batch size > 1
      
      * Silence prints from pycocotools
      
      * Commenting unneeded code for run
      
      * Fixes
      
      * Improvements and cleanups
      
      * Remove scales from Pooler
      
      It was not a free parameter, and depended only on the feature map dimensions
      
      * Cleanups
      
      * More cleanups
      
      * Add misc ops and totally remove maskrcnn_benchmark
      
      * nit
      
      * Move Pooler to ops
      
      * Make FPN slightly more generic
      
      * Minor improvements or FPN
      
      * Move FPN to ops
      
      * Move functions to utils
      
      * Lint fixes
      
      * More lint
      
      * Minor cleanups
      
      * Add FasterRCNN
      
      * Remove modifications to resnet
      
      * Fixes for Python2
      
      * More lint fixes
      
      * Add aspect ratio grouping
      
      * Move functions around
      
      * Make evaluation use all images for mAP, even those without annotations
      
      * Bugfix with DDP introduced in last commit
      
      * [Check] Remove category mapping
      
      * Lint
      
      * Make GroupedBatchSampler prioritize largest clusters in the end of iteration
      
      * Bugfix for selecting the iou_types during evaluation
      
      Also switch to using the torchvision normalization now on, given that we are using torchvision base models
      
      * More lint
      
      * Add barrier after init_process_group
      
      Better be safe than sorry
      
      * Make evaluation only use one CPU thread per process
      
      When doing multi-gpu evaluation, paste_masks_in_image is multithreaded and throttles evaluation altogether. Also change default for aspect ratio group to match Detectron
      
      * Fix bug in GroupedBatchSampler
      
      After the first epoch, the number of batch elements could be larger than batch_size, because they got accumulated from the previous iteration. Fix this and also rename some variables for more clarity
      
      * Start adding KeypointRCNN
      
      Currently runs and perform inference, need to do full training
      
      * Remove use of opencv in keypoint inference
      
      PyTorch 1.1 adds support for bicubic interpolation which matches opencv (except for empty boxes, where one of the dimensions is 1, but that's fine)
      
      * Remove Masker
      
      Towards having mask postprocessing done inside the model
      
      * Bugfixes in previous change plus cleanups
      
      * Preparing to run keypoint training
      
      * Zero initialize bias for mask heads
      
      * Minor improvements on print
      
      * Towards moving resize to model
      
      Also remove class mapping specific to COCO
      
      * Remove zero init in bias for mask head
      
      Checking if it decreased accuracy
      
      * [CHECK] See if this change brings back expected accuracy
      
      * Cleanups on model and training script
      
      * Remove BatchCollator
      
      * Some cleanups in coco_eval
      
      * Move postprocess to transform
      
      * Revert back scaling and start adding conversion to coco api
      
      The scaling didn't seem to matter
      
      * Use decorator instead of context manager in evaluate
      
      * Move training and evaluation functions to a separate file
      
      Also adds support for obtaining a coco API object from our dataset
      
      * Remove unused code
      
      * Update location of lr_scheduler
      
      Its behavior has changed in PyTorch 1.1
      
      * Remove debug code
      
      * Typo
      
      * Bugfix
      
      * Move image normalization to model
      
      * Remove legacy tensor constructors
      
      Also move away from Int and instead use int64
      
      * Bugfix in MultiscaleRoiAlign
      
      * Move transforms to its own file
      
      * Add missing file
      
      * Lint
      
      * More lint
      
      * Add some basic test for detection models
      
      * More lint
      ccd1b27d