Commits · d5f4cc38dc96bd14e5cab0893c3203af9a8a9685 · OpenDAS / vision

30 Aug, 2023 1 commit
- Datapoint -> TVTensor; datapoint[s] -> tv_tensor[s] (#7894) · d5f4cc38
  Nicolas Hug authored Aug 30, 2023
  
  d5f4cc38
27 Jul, 2023 1 commit
- Properly handle maskrcnn and keypoints w.r.t. V2 in detection references (#7742) · 8233c9cd
  Nicolas Hug authored Jul 27, 2023
```
Co-authored-by: Philip Meier <github.pmeier@posteo.de>
```
  8233c9cd
13 Jul, 2023 1 commit
- Add --backend and --use-v2 support to detection refs (#7732) · bb3aae7b
  Nicolas Hug authored Jul 13, 2023
```
Co-authored-by: Philip Meier <github.pmeier@posteo.de>
```
  bb3aae7b
08 Aug, 2022 1 commit

Update references to use the new Model Registration API (#6369) · 1d0786b0

Vasilis Vryniotis authored Aug 08, 2022

* Expose on Hub the public methods of the registration API

* Limit methods and update docs.

* Update references to use the new Model Registration API

1d0786b0

22 Jul, 2022 1 commit

Upgrade usort to `1.0.2` and black to 22.3.0 (#5106) · 6ca9c76a

Philip Meier authored Jul 22, 2022



* upgrade usort to

* Also update black

* Actually use 1.0.2

* Apply pre-commit
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

6ca9c76a

21 Jun, 2022 1 commit
- Fix copypaste collate pickle issues (#6181) · 28557e0c
  Vasilis Vryniotis authored Jun 21, 2022
  
  28557e0c
15 Jun, 2022 1 commit

Add SimpleCopyPaste augmentation (#5825) · bbc1aac8

Lezwon Castelino authored Jun 15, 2022



* added simple POC

* added jitter and crop options

* added references

* moved simplecopypaste to detection module

* working POC for simple copy paste in detection

* added comments

* remove transforms from class
updated the labels
added gaussian blur

* removed loop for mask calculation

* replaced Gaussian blur with functional api

* added inplace operations

* added changes to accept tuples instead of tensors

* - make copy paste functional
- make only one copy of batch and target

* add inplace support within copy paste functional

* Updated code for copy-paste transform

* Fixed code formatting

* [skip ci] removed manual thresholding

* Replaced cropping by resizing data to paste

* Removed inplace arg (as useless) and put a check on iscrowd target

* code-formatting

* Updated copypaste op to make it torch scriptable
Added fallbacks to support LSJ

* Fixed flake8

* Updates according to the review
Co-authored-by: vfdev-5 <vfdev.5@gmail.com>
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

bbc1aac8

10 May, 2022 1 commit
- Fix regression on Detection training script (#5985) · 3ec4b949
  Vasilis Vryniotis authored May 10, 2022
  
  3ec4b949
03 May, 2022 1 commit

Reduce variance of evaluation in reference (#5819) · e556640b

YosuaMichael authored May 03, 2022

* Change code to reduce variance in eval

* Remove unnecessary new line

* Fix missing import warnings

* Fix the warning on video_classification

* Fix bug to get len of UniformClipSampler

e556640b

01 Apr, 2022 1 commit
- Detection recipe enhancements (#5715) · d59398b5
  Vasilis Vryniotis authored Apr 01, 2022
```
* Detection recipe enhancements

* Add back nesterov momentum
```
  d59398b5
22 Mar, 2022 1 commit

Port Multi-weight support from prototype to main (#5618) · 11bd2eaa

Vasilis Vryniotis authored Mar 22, 2022



* Moving basefiles outside of prototype and porting Alexnet, ConvNext, Densenet and EfficientNet.

* Porting googlenet

* Porting inception

* Porting mnasnet

* Porting mobilenetv2

* Porting mobilenetv3

* Porting regnet

* Porting resnet

* Porting shufflenetv2

* Porting squeezenet

* Porting vgg

* Porting vit

* Fix docstrings

* Fixing imports

* Adding missing import

* Fix mobilenet imports

* Fix tests

* Fix prototype tests

* Exclude get_weight from models on test

* Fix init files

* Porting googlenet

* Porting inception

* porting mobilenetv2

* porting mobilenetv3

* porting resnet

* porting shufflenetv2

* Fix test and linter

* Fixing docs.

* Porting Detection models (#5617)

* fix inits

* fix docs

* Port faster_rcnn

* Port fcos

* Port keypoint_rcnn

* Port mask_rcnn

* Port retinanet

* Port ssd

* Port ssdlite

* Fix linter

* Fixing tests

* Fixing tests

* Fixing vgg test

* Porting Optical Flow, Segmentation, Video models (#5619)

* Porting raft

* Porting video resnet

* Porting deeplabv3

* Porting fcn and lraspp

* Fixing the tests and linter

* Porting docs, examples, tutorials and galleries (#5620)

* Fix examples, tutorials and gallery

* Update gallery/plot_optical_flow.py
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

* Fix import

* Revert hardcoded normalization

* fix uncommitted changes

* Fix bug

* Fix more bugs

* Making resize optional for segmentation

* Fixing preset

* Fix mypy

* Fixing documentation strings

* Fix flake8

* minor refactoring
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

* Resolve conflict

* Porting model tests (#5622)

* Porting tests

* Remove unnecessary variable

* Fix linter

* Move prototype to extended tests

* Fix download models job

* Update CI on Multiweight branch to use the new weight download approach (#5628)

* port Pad to prototype transforms (#5621)

* port Pad to prototype transforms

* use literal

* Bump up LibTorchvision version number for Podspec to release Cocoapods (#5624)
Co-authored-by: Anton Thomma <anton@pri.co.nz>
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* pre-download model weights in CI docs build (#5625)

* pre-download model weights in CI docs build

* move changes into template

* change docs image

* Regenerated config.yml
Co-authored-by: Philip Meier <github.pmeier@posteo.de>
Co-authored-by: Anton Thomma <11010310+thommaa@users.noreply.github.com>
Co-authored-by: Anton Thomma <anton@pri.co.nz>

* Porting reference scripts and updating presets (#5629)

* Making _preset.py classes

* Remove support of targets on presets.

* Rewriting the video preset

* Adding tests to check that the bundled transforms are JIT scriptable

* Rename all presets from *Eval to *Inference

* Minor refactoring

* Remove --prototype and --pretrained from reference scripts

* remove  pretained_backbone refs

* Corrections and simplifications

* Fixing bug

* Fixing linter

* Fix flake8

* restore documentation example

* minor fixes

* fix optical flow missing param

* Fixing commands

* Adding weights_backbone support in detection and segmentation

* Updating the commands for InceptionV3

* Setting `weights_backbone` to its fully BC value (#5653)

* Replace default `weights_backbone=None` with its BC values.

* Fixing tests

* Fix linter

* Update docs.

* Update preprocessing on reference scripts.

* Change qat/ptq to their full values.

* Refactoring preprocessing

* Fix video preset

* No initialization on VGG if pretrained

* Fix warning messages for backbone utils.

* Adding star to all preset constructors.

* Fix mypy.
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>
Co-authored-by: Philip Meier <github.pmeier@posteo.de>
Co-authored-by: Anton Thomma <11010310+thommaa@users.noreply.github.com>
Co-authored-by: Anton Thomma <anton@pri.co.nz>

11bd2eaa

07 Mar, 2022 1 commit
- Refactor preset transforms (#5562) · d8654bb0
  Vasilis Vryniotis authored Mar 07, 2022
```
* Refactor preset transforms

* Making presets public.
```
  d8654bb0
21 Jan, 2022 1 commit

Adding prototype flag on reference scripts (#5248) · 4bf6c6e4

Vasilis Vryniotis authored Jan 21, 2022

* Adding prototype flag on reference scripts.

* Import prototype instead of models/transforms.

* Correcting exception type.

* fixing none referencing

4bf6c6e4

30 Nov, 2021 1 commit

Refactor the `get_weights` API (#5006) · 3d8723d5

Vasilis Vryniotis authored Nov 30, 2021

* Change the `default` weights mechanism to sue Enum aliases.

* Change `get_weights` to work with full Enum names and make it public.

* Applying improvements from code review.

3d8723d5

15 Nov, 2021 1 commit

support amp training for detection models (#4933) · 59ec1dfd

Hu Ye authored Nov 16, 2021



* support amp training

* support amp training

* support amp training

* Update references/detection/train.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* Update references/detection/engine.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* fix lint issues
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

59ec1dfd

03 Nov, 2021 2 commits
- Moving the check for prototype support in all references. (#4849) · 3300692c
  Vasilis Vryniotis authored Nov 03, 2021
  
  3300692c
- Adding multiweight support to FasterRCNN (#4847) · dd1adb07
  Vasilis Vryniotis authored Nov 03, 2021
```
* Aligning exception with all other models.

* Adding prototype preprocessing on video references.

* Adding the rest of model builders on faster_rcnn.
```
  dd1adb07
28 Oct, 2021 1 commit
- Use f-strings almost everywhere, and other cleanups by applying pyupgrade (#4585) · d367a01a
  Jirka Borovec authored Oct 28, 2021
```
Co-authored-by: Nicolas Hug <nicolashug@fb.com>
```
  d367a01a
24 Oct, 2021 1 commit

Add types and improve descriptions to ArgumentParser parameters (#4724) · 9ae833af

puhuk authored Oct 24, 2021



* Add type to default argument

To resolve issue #4694

* Resolve issue #4694

Add missing types on argument parser

* Update with ufmt

formatted with ufmt

* Updated with review

Updated with review

* Update type of arguments

Add train.py from video_classification, similarity and train_quantization.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

9ae833af

04 Oct, 2021 1 commit

Add ufmt (usort + black) as code formatter (#4384) · 5f0edb97

Philip Meier authored Oct 04, 2021



* add ufmt as code formatter

* cleanup

* quote ufmt requirement

* split imports into more groups

* regenerate circleci config

* fix CI

* clarify local testing utils section

* use ufmt pre-commit hook

* split relative imports into local category

* Revert "split relative imports into local category"

This reverts commit f2e224cde2008c56c9347c1f69746d39065cdd51.

* pin black and usort dependencies

* fix local test utils detection

* fix ufmt rev

* add reference utils to local category

* fix usort config

* remove custom categories sorting

* Run pre-commit without fixing flake8

* got a double import in merge
Co-authored-by: Nicolas Hug <nicolashug@fb.com>

5f0edb97

11 May, 2021 1 commit

Add SSDlite architecture with MobileNetV3 backbones (#3757) · 43d77206

Vasilis Vryniotis authored May 11, 2021

* Partial implementation of SSDlite.

* Add normal init and BN hyperparams.

* Refactor to keep JIT happy

* Completed SSDlite.

* Fix lint

* Update todos

* Add expected file in repo.

* Use C4 expansion instead of C4 output.

* Change scales formula for Default Boxes.

* Add cosine annealing on trainer.

* Make T_max count epochs.

* Fix test and handle corner-case.

* Add support of support width_mult

* Add ssdlite presets.

* Change ReLU6, [-1,1] rescaling, backbone init & no pretraining.

* Use _reduced_tail=True.

* Add sync BN support.

* Adding the best config along with its weights and documentation.

* Make mean/std configurable.

* Fix not implemented for half exception

43d77206

07 May, 2021 1 commit
- Add checkpoints used for preemption. (#3789) · a78d0d83
  Vasilis Vryniotis authored May 07, 2021
  
  a78d0d83
06 May, 2021 1 commit

Make reference scripts compatible with submitit (#3785) · c2ab0c59

Vasilis Vryniotis authored May 06, 2021

* Add submitit script, partition param and parser on its own method.

* Fix method names, handle add_help correctly and refactoring.

* Delete run_with_submitit.py file

c2ab0c59

30 Apr, 2021 1 commit

Add SSD architecture with VGG16 backbone (#3403) · 730c5e1e

Vasilis Vryniotis authored Apr 30, 2021

* Early skeleton of API.

* Adding MultiFeatureMap and vgg16 backbone.

* Making vgg16 backbone same as paper.

* Making code generic to support all vggs.

* Moving vgg's extra layers a separate class + L2 scaling.

* Adding header vgg layers.

* Fix maxpool patching.

* Refactoring code to allow for support of different backbones & sizes:
- Skeleton for Default Boxes generator class
- Dynamic estimation of configuration when possible
- Addition of types

* Complete the implementation of DefaultBox generator.

* Replace randn with empty.

* Minor refactoring

* Making clamping between 0 and 1 optional.

* Change xywh to xyxy encoding.

* Adding parameters and reusing objects in constructor.

* Temporarily inherit from Retina to avoid dup code.

* Implement forward methods + temp workarounds to inherit from retina.

* Inherit more methods from retinanet.

* Fix type error.

* Add Regression loss.

* Fixing JIT issues.

* Change JIT workaround to minimize new code.

* Fixing initialization bug.

* Add classification loss.

* Update todos.

* Add weight loading support.

* Support SSD512.

* Change kernel_size to get output size 1x1

* Add xavier init and refactoring.

* Adding unit-tests and fixing JIT issues.

* Add a test for dbox generator.

* Remove unnecessary import.

* Workaround on GeneralizedRCNNTransform to support fixed size input.

* Remove unnecessary random calls from the test.

* Remove more rand calls from the test.

* change mapping and handling of empty labels

* Fix JIT warnings.

* Speed up loss.

* Convert 0-1 dboxes to original size.

* Fix warning.

* Fix tests.

* Update comments.

* Fixing minor bugs.

* Introduce a custom DBoxMatcher.

* Minor refactoring

* Move extra layer definition inside feature extractor.

* handle no bias on init.

* Remove fixed image size limitation

* Change initialization values for bias of classification head.

* Refactoring and update test file.

* Adding ResNet backbone.

* Minor refactoring.

* Remove inheritance of retina and general refactoring.

* SSD should fix the input size.

* Fixing messages and comments.

* Silently ignoring exception if test-only.

* Update comments.

* Update regression loss.

* Restore Xavier init everywhere, update the negative sampling method, change the clipping approach.

* Fixing tests.

* Refactor to move the losses from the Head to the SSD.

* Removing resnet50 ssd version.

* Adding support for best performing backbone and its config.

* Refactor and clean up the API.

* Fix lint

* Update todos and comments.

* Adding RandomHorizontalFlip and RandomIoUCrop transforms.

* Adding necessary checks to our tranforms.

* Adding RandomZoomOut.

* Adding RandomPhotometricDistort.

* Moving Detection transforms to references.

* Update presets

* fix lint

* leave compose and object

* Adding scaling for completeness.

* Adding params in the repr

* Remove unnecessary import.

* minor refactoring

* Remove unnecessary call.

* Give better names to DBox* classes

* Port num_anchors estimation in generator

* Remove rescaling and fix presets

* Add the ability to pass a custom head and refactoring.

* fix lint

* Fix unit-test

* Update todos.

* Change mean values.

* Change the default parameter of SSD to train the full VGG16 and remove the catch of exception for eval only.

* Adding documentation

* Adding weights and updating readmes.

* Update the model weights with a more performing model.

* Adding doc for head.

* Restore import.

730c5e1e

28 Jan, 2021 1 commit

Adding Preset Transforms in reference scripts (#3317) · 1703e4ca

Vasilis Vryniotis authored Jan 28, 2021

* Adding presets in the classification reference scripts.

* Adding presets in the object detection reference scripts.

* Adding presets in the segmentation reference scripts.

* Adding presets in the video classification reference scripts.

* Moving flip at the end to align with image classification signature.

1703e4ca

18 Jan, 2021 1 commit

Add MobileNetV3 architecture for Detection (#3253) · bf211dac

Vasilis Vryniotis authored Jan 18, 2021

* Minor refactoring of a private method to make it reusuable.

* Adding a FasterRCNN + MobileNetV3 with & w/o FPN models.

* Reducing Resolution to 320-640 and anchor sizes to 16-256.

* Increase anchor sizes.

* Adding rpn score threshold param on the train script.

* Adding trainable_backbone_layers param on the train script.

* Adding rpn_score_thresh param directly in fasterrcnn_mobilenet_v3_large_fpn.

* Remove fasterrcnn_mobilenet_v3_large prototype and update expected file.

* Update documentation and adding weights.

* Use buildin Identity.

* Fix spelling.

bf211dac

14 Jan, 2021 1 commit

Improve speed/accuracy of FasterRCNN by introducing a score threshold on RPN (#3205) · 8ebfd2f5

Vasilis Vryniotis authored Jan 14, 2021



* Introduce small score threshold on rpn

* Adding docs and fixing keypoint and mask.

* Making value 0.0 by default for BC.

* Fixing for onnx.

* Update threshold.

* Removing non-default threshold from reference scripts.
Co-authored-by: Francisco Massa <fvsmassa@gmail.com>

8ebfd2f5

07 Jan, 2021 1 commit

Remove unused imports after manual review (#3229) · 7536e298

Ben Weinstein authored Jan 07, 2021



* remove unused imports after manual review

* Update torchvision/datasets/voc.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* remove two more instances
Co-authored-by: Ben Weinstein <benweinstein@Bens-MacBook-Pro.local>
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

7536e298

19 Dec, 2019 3 commits
- Fix lint (#1693) · d2c763e1
  Francisco Massa authored Dec 19, 2019
  
  d2c763e1
- Fix lint (#1692) · 013d1322
  Francisco Massa authored Dec 19, 2019
  
  013d1322
- fix a little bug about resume (#1628) · f4a82243
  MultiK authored Dec 20, 2019
```
* fix a little bug about resume

When resuming, we need to start from the last epoch not 0.

* the second way for resuming

the second way for resuming
```
  f4a82243
25 Nov, 2019 1 commit
- update default parameters (#1611) · 537f0df7
  Yoshitomo Matsubara authored Nov 25, 2019
  
  537f0df7
29 Aug, 2019 1 commit
- Fix comment in default arguments (#1243) · 80cfc2c8
  Joaquín Alori authored Aug 29, 2019
  
  80cfc2c8
12 Aug, 2019 1 commit
- Better explain lr and batch size in references/detection/train.py (#1233) · 19315e31
  Gu Wang authored Aug 13, 2019
```
* explain lr and batch size in references/detection/train.py

* fix typo
```
  19315e31
12 Jul, 2019 1 commit

Clean det ref (#1109) · b7615843

flauted authored Jul 12, 2019

* Doc multigpu and propagate data path.

* Use raw doc because of backslash.

b7615843

14 Jun, 2019 1 commit
- Misc lint fixes (#1020) · f052c53f
  Francisco Massa authored Jun 14, 2019
  
  f052c53f
21 May, 2019 1 commit
- Add pretrained arg to reference scripts (#935) · 115d2eb7
  Francisco Massa authored May 21, 2019
```
Allows for easily evaluating the pre-trained models in the modelzoo
```
  115d2eb7
19 May, 2019 1 commit

Add Faster R-CNN and Mask R-CNN (#898) · ccd1b27d

Francisco Massa authored May 19, 2019

* [Remove] Use stride in 1x1 in resnet

This is temporary

* Move files to torchvision

Inference works

* Now seems to give same results

Was using the wrong number of total iterations in the end...

* Distributed evaluation seems to work

* Factor out transforms into its own file

* Enabling horizontal flips

* MultiStepLR and preparing for launches

* Add warmup

* Clip gt boxes to images

Seems to be crucial to avoid divergence. Also reduces the losses over different processes for better logging

* Single-GPU batch-size 1 of CocoEvaluator works

* Multi-GPU CocoEvaluator works

Gives the exact same results as the other one, and also supports batch size > 1

* Silence prints from pycocotools

* Commenting unneeded code for run

* Fixes

* Improvements and cleanups

* Remove scales from Pooler

It was not a free parameter, and depended only on the feature map dimensions

* Cleanups

* More cleanups

* Add misc ops and totally remove maskrcnn_benchmark

* nit

* Move Pooler to ops

* Make FPN slightly more generic

* Minor improvements or FPN

* Move FPN to ops

* Move functions to utils

* Lint fixes

* More lint

* Minor cleanups

* Add FasterRCNN

* Remove modifications to resnet

* Fixes for Python2

* More lint fixes

* Add aspect ratio grouping

* Move functions around

* Make evaluation use all images for mAP, even those without annotations

* Bugfix with DDP introduced in last commit

* [Check] Remove category mapping

* Lint

* Make GroupedBatchSampler prioritize largest clusters in the end of iteration

* Bugfix for selecting the iou_types during evaluation

Also switch to using the torchvision normalization now on, given that we are using torchvision base models

* More lint

* Add barrier after init_process_group

Better be safe than sorry

* Make evaluation only use one CPU thread per process

When doing multi-gpu evaluation, paste_masks_in_image is multithreaded and throttles evaluation altogether. Also change default for aspect ratio group to match Detectron

* Fix bug in GroupedBatchSampler

After the first epoch, the number of batch elements could be larger than batch_size, because they got accumulated from the previous iteration. Fix this and also rename some variables for more clarity

* Start adding KeypointRCNN

Currently runs and perform inference, need to do full training

* Remove use of opencv in keypoint inference

PyTorch 1.1 adds support for bicubic interpolation which matches opencv (except for empty boxes, where one of the dimensions is 1, but that's fine)

* Remove Masker

Towards having mask postprocessing done inside the model

* Bugfixes in previous change plus cleanups

* Preparing to run keypoint training

* Zero initialize bias for mask heads

* Minor improvements on print

* Towards moving resize to model

Also remove class mapping specific to COCO

* Remove zero init in bias for mask head

Checking if it decreased accuracy

* [CHECK] See if this change brings back expected accuracy

* Cleanups on model and training script

* Remove BatchCollator

* Some cleanups in coco_eval

* Move postprocess to transform

* Revert back scaling and start adding conversion to coco api

The scaling didn't seem to matter

* Use decorator instead of context manager in evaluate

* Move training and evaluation functions to a separate file

Also adds support for obtaining a coco API object from our dataset

* Remove unused code

* Update location of lr_scheduler

Its behavior has changed in PyTorch 1.1

* Remove debug code

* Typo

* Bugfix

* Move image normalization to model

* Remove legacy tensor constructors

Also move away from Int and instead use int64

* Bugfix in MultiscaleRoiAlign

* Move transforms to its own file

* Add missing file

* Lint

* More lint

* Add some basic test for detection models

* More lint

ccd1b27d