Commits · ba0d665bbbbd8587777a33d22b059ed40c0d9866 · OpenDAS / vision

25 Jul, 2022 1 commit

Vectorize box encoding in FCOS (#6278) · ba0d665b

Abhijit Deo authored Jul 25, 2022



* intial structure

* fixed types of few variables

* remove the commented code

* list -> List

* encode method will take input as tensors instead of list of tensor
Co-authored-by: Joao Gomes <jdsgomes@fb.com>

ba0d665b

22 Jul, 2022 1 commit

Upgrade usort to `1.0.2` and black to 22.3.0 (#5106) · 6ca9c76a

Philip Meier authored Jul 22, 2022



* upgrade usort to

* Also update black

* Actually use 1.0.2

* Apply pre-commit
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

6ca9c76a

14 Jul, 2022 1 commit

fix Swin Transformer inplace mutation (#6266) · 418d8a6f

Local State authored Jul 14, 2022



* fix inplace mutation

* Different attn shouldn't share the same attribute

* a simpler solution
Co-authored-by: YosuaMichael <yosuamichaelm@gmail.com>

418d8a6f

08 Jul, 2022 1 commit
- Move out the pad operation from PatchMerging in swin transformer to make it fx compatible (#6252) · e75a3337
  YosuaMichael authored Jul 08, 2022
  
  e75a3337
07 Jul, 2022 1 commit

Adding video accuracy for video_classification reference script (#6241) · 8a45147f

YosuaMichael authored Jul 07, 2022

* Add ensembled video accuracy on video reference script

* Change the parser func to be similar with classification reference

* Fix typo type->dtype

* Use custom kinetics

* Fix dataset to not getting start_pts

* Change dataset name, and put video_idx at the back

* Ufmt format

* Use functional softmax, updating meta and use it to overwrite eval param

* Fix typo

* Put the eval parameters on the docs for now

* Change meta for video resnet to use frame-rate 15, also change wording on docs

8a45147f

05 Jul, 2022 1 commit

Vectorize box decoding in FCOS (#6203) · b3b74481

Abhijit Deo authored Jul 05, 2022



* basic structure

* added constrains

* fixed errors

* thanks to vadim!

* addressing the comments and added docstrign

* Apply suggestions from code review
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

b3b74481

24 Jun, 2022 1 commit

Add MViT architecture in TorchVision (#6198) · fb7f9a16

Vasilis Vryniotis authored Jun 24, 2022

* Adding MViT v2 architecture (#6105)

* Adding mvitv2 architecture

* Fixing memory issues on tests and minor refactorings.

* Adding input validation

* Adding docs and minor refactoring

* Add `min_temporal_size` in the supported meta-data.

* Switch Tuple[int, int, int] with List[int] to support easier the 2D case

* Adding more docs and references

* Change naming conventions of classes to follow the same pattern as MobileNetV3

* Fix test breakage.

* Update todos

* Performance optimizations.

* Add support to MViT v1 (#6179)

* Switch implementation to v1 variant.

* Fix docs

* Adding back a v2 pseudovariant

* Changing the way the network are configured.

* Temporarily removing v2

* Adding weights.

* Expand _squeeze/_unsqueeze to support arbitrary dims.

* Update references script.

* Fix tests.

* Fixing frames and preprocessing.

* Fix std/mean values in transforms.

* Add permanent Dropout and update the weights.

* Update accuracies.

* Fix documentation

* Remove unnecessary expected file.

* Skip big model test

* Rewrite the configuration logic to reduce LOC.

* Fix mypy

fb7f9a16

23 Jun, 2022 1 commit

Add raft-stereo model to prototype/models (#6107) · 11caf37a

YosuaMichael authored Jun 23, 2022

* Add rough raft-stereo implementation on prototype/models

* Add standard raft_stereo builder, and modify context_encoder to be more similar with original implementation

* Follow original implementation on pre-convolve context

* Fix to make sure we can load original implementation weight and got same output

* reusing component from raft

* Make the raft_stereo_fast able to load original weight implementation

* Format with ufmt and update some comment

* Use raft FlowHead

* clean up comments

* Remove unnecessary import and use ufmt format

* Add __all__ and more docs for RaftStereo class

* Only accept param and not module for raft stereo builder

* Cleanup comment

* Adding typing to raft_stereo

* Update some of raft code and reuse on raft stereo

* Use bool instead of int

* Make standard raft_stereo model jit scriptable

* Make the function _make_out_layer using boolean with_block and init the block_layer with identity

* Separate corr_block into two modules for pyramid and building corr features

* Use tuple if input is not variable size, also remove default value if using List

* Format using ufmt and update ConvGRU to not inherit from raft in order to satisfy both jit script and mypy

* Change RaftStereo docs input type

* Ufmt format raft

* revert back convgru to see mypy errors, add test for jit and fx, make the model fx compatible

* ufmt format

* Specify device for new tensor, dont init module then overwrite and put if-else instead

* Ignore mypy problem on override, put back num_iters on forward

* Revert some effort to make it fx compatible but unnecessary now

* refactor code and remove num_iters from RaftStereo constructor

* Change to raft_stereo_realtime, and specify device directly for tensor creation

* Add description for raft_stereo_realtime

* Update the test for raft_stereo

* Fix raft stereo prototype test to properly test jit script

* Ufmt format

* Test against expected file, change name from raft_stereo to raft_stereo_builder to prevent import error

* Revert __init__.py changes

* Add default value for non-list param on model builder

* Add checking on out_with_block length, add more docs on the encoder

* Use base instead of basic since it is more commonly used

* rename expect file to base as well

* rename on test

* Revert the revert of __init__.py, also revert the adding default value to _raft_stereo to follow the standard pattern

* ufmt format __init__.py

11caf37a

16 Jun, 2022 2 commits
- Fix all broken URLs (#6176) · 12bb8873
  Nicolas Hug authored Jun 16, 2022
  
  12bb8873
- Adding `_log_api_usage_once` to Swin's reusable components. (#6174) · ac5dc51a
  Vasilis Vryniotis authored Jun 16, 2022
  
  ac5dc51a
14 Jun, 2022 1 commit

Add new `.. betastatus::` directive and document Beta APIs (#6115) · 0e688ce0

Nicolas Hug authored Jun 14, 2022

* Add new .. betastatus:: directive to document Beta APIs

* Also add it for the fine-grained video API

* Add directive for all builders and pages of Detection module

* Also segmentation and video models

0e688ce0

10 Jun, 2022 1 commit
- Fix ViT and Resnext docs (#6150) · fee6d12c
  Nicolas Hug authored Jun 10, 2022
  
  fee6d12c
31 May, 2022 1 commit
- Add missing `_version` to the MLPBlock (#6113) · ba4b0db5
  Vasilis Vryniotis authored May 31, 2022
```
* Add missing `_version` to the MLPBlock

* fix linter
```
  ba4b0db5
26 May, 2022 2 commits

Change weights return type to Mapping (#6097) · 6aaa2b00
Vasilis Vryniotis authored May 26, 2022

6aaa2b00

Refactor swin transfomer so later we can reuse component for 3d version (#6088) · 952f4806

YosuaMichael authored May 26, 2022

* Use List[int] instead of int for window_size and shift_size

* Make PatchMerging and SwinTransformerBlock able to handle 2d and 3d cases

* Separate patch embedding from SwinTransformer and enable to get model without head by specifying num_heads=None

* Dont use if before padding so it is fx friendly

* Put the handling on window_size edge cases on separate function and wrap with torch.fx.wrap so it is excluded from tracing

* Update the weight url to the converted weight with new structure

* Update the accuracy of swin_transformer

* Change assert to Exception and nit

* Make num_classes optional

* Add typing output for _fix_window_and_shift_size function

* init head to None to make it jit scriptable

* Revert the change to make num_classes optional

* Revert unneccesarry changes that might be risky

* Remove self.head declaration

952f4806

25 May, 2022 1 commit
- Fix bug by checking if norm_layer weight is None before init (#6082) · 0f971f64
  YosuaMichael authored May 25, 2022
  
  0f971f64
20 May, 2022 2 commits

Document ResNet architecture tweak (#5977) · 37665a0b

puhuk authored May 20, 2022



* To resolve issue #5964

Add note for resnet architecture

* Update resnet.py

* Update resnet.py

* Update resnet.rst

* Fix stylings

* Add the same notes on model builders

* Improve description

* Apply the change everywhere

* Remove trailing space
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

37665a0b

Move Permute layer to ops. (#6055) · d57f929d
Vasilis Vryniotis authored May 20, 2022

d57f929d

19 May, 2022 2 commits

Adding multi-layer perceptron in ops (#6053) · 77cad127

Vasilis Vryniotis authored May 19, 2022

* Adding an MLP block.

* Adding documentation

* Update typos.

* Fix inplace for Dropout.

* Apply recommendations from code review.

* Making changes on pre-trained models.

* Fix linter

77cad127

add swin_s and swin_b variants and improved swin_t (#6048) · 9d9cfab2

Joao Gomes authored May 19, 2022



* add swin_s and swin_b variants

* fix swin_b params

* fix n parameters and acc numbers

* adding missing acc numbers

* apply ufmt

* Updating `_docs` to reflect training recipe

* Fix exted for swin_b
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

9d9cfab2

18 May, 2022 4 commits

Doc revamp for optical flow models (#5895) · 5985504c
Nicolas Hug authored May 18, 2022
```
* Doc revamp for optical flow models

* Some more
```
5985504c

New schema for metrics in weights meta-data (#6047) · 2ec0e847

Nicolas Hug authored May 18, 2022

* Classif models

* Detection

* Segmentation

* quantization

* Video

* optical flow

* tests

* Fix docs

* Fix Video dataset

* Consistency for RAFT dataset names

* use ImageNet-1K

* Use COCO-val2017-VOC-labels for segmentation

* formatting

2ec0e847

Fix resnext docs (#6044) · ff8fae92

Aditya Oke authored May 18, 2022


Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

ff8fae92

Document all remaining pre-trained weights (#6039) · b52f2331

Vasilis Vryniotis authored May 18, 2022

* Adding docs for quantized models.

* Adding docs for video models.

* Adding docs for segmentation models.

* Adding docs for optical flow models.

* Adding docs for detection models.

* Fix typo.

* Make changes from code-review.

b52f2331

17 May, 2022 4 commits

Document all pre-trained Classification weights (#6036) · edb7bbbd

Vasilis Vryniotis authored May 17, 2022

* Improving the auto-gen doc.

* Adding details for AlexNet, ConvNext, DenseNet, EfficientNets, GoogLeNet and InceptionV3.

* Fixing location of `_docs`

* Adding docs in the remaining classification models.

* Fix linter

edb7bbbd

update paper link for FCOS refence (#6035) · 202ecfd5

WuZhe authored May 17, 2022



* update paper link for FCOS refence

* remove 'updated version'
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

202ecfd5

docs: Added quantized ResNext to the new doc (#6032) · 10acc822

F-G Fernandez authored May 17, 2022



* docs: Added quantized ResNeXt to new docs

* docs: Fixed docstring
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

10acc822

docs: Fixed quantized resnet docstring (#6033) · b50aaf0f
F-G Fernandez authored May 17, 2022

b50aaf0f

16 May, 2022 7 commits
- Revamp docs for Quantized ShuffleNetV2 (#6028) · a1232c21
  Zhiqiang Wang authored May 17, 2022
  
  a1232c21
- Add weight for mnasnet0_75 and mnasnet1_3 (#6019) · 4176556e
  YosuaMichael authored May 16, 2022
```
* Add weight for mnasnet0_75 and mnasnet1_3

* Fix missing comma

* Add PR url as recipe, and update the metrics

* Add weights to legacy handler

* Update docs to specify there are weights available
```
  4176556e
- Expose `get_weight` to Torch Hub (#6026) · 9e788719
  Vasilis Vryniotis authored May 16, 2022
```
* Prefixing `_get_enum_from_fn` with underscore

* Exposing `get_weight` to Torch Hub
```
  9e788719
- Fix ConvNext weight links (#6023) · 8e5844fc
  Nicolas Hug authored May 16, 2022
  
  8e5844fc
- Update ShuffleNetV2 annotations for x1_5 and x2_0 variants (#6022) · a161098e
  Vasilis Vryniotis authored May 16, 2022
  
  a161098e
- Add `.. note::` about quantize parameter in quantized models builders (#6021) · 01664a8e
  Abhijit Deo authored May 16, 2022
  
  01664a8e
- Revamp docs for Quantized MobileNetV3 (#6016) · f176fb0d
  Abhijit Deo authored May 16, 2022
```
* added note

* quantize = True higlighted in the note.

* Keep "Large" in docstring
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>
```
  f176fb0d
13 May, 2022 2 commits

Added revamped quantized resnet docs (#6012) · d585f86d

Hu Ye authored May 14, 2022



* Create resnet_quant.rst

* add resnet quant

* refactor docs

* Minor fix

* Nit
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

d585f86d

[DOC] Add quantized inception updated documentation. (#6005) · e7abd3bb

Yassine Alouini authored May 13, 2022



* [DOC] Add quantized inception updated documentation.

* Add missing file.

* Apply suggestions from code review

[ENH] Improve doc thanks to various code review comments.
Co-authored-by: Aditya Oke <47158509+oke-aditya@users.noreply.github.com>
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>
Co-authored-by: Aditya Oke <47158509+oke-aditya@users.noreply.github.com>
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

e7abd3bb

12 May, 2022 3 commits

Clean up model documentation (#6003) · c67a5839

Vasilis Vryniotis authored May 12, 2022

* Remove old "minimum input size" from docstrings.

* Remove "currently only XYZ weights available"

* Fix description of wide_resnet101_2

* Make display license URLs as links.

* Clarify the order of dims of min_size.

* Remove lengthy keypoint_names from meta-table.

c67a5839

added revamped quantized mobilenetv2 docs (#6004) · ac016599

Lezwon Castelino authored May 12, 2022



* added quantized mobilenetv2 docs

* remove quotes
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

ac016599

Handle empty weights in doc generation (#6006) · 3414322d
Nicolas Hug authored May 12, 2022

3414322d