Commits · fb6bc29b236f5eb1cbb1dd3d250b1a84d0327704 · ModelZoo / ResNet50_tensorflow

01 Aug, 2018 1 commit

Refactor object detection box predictors and fix some issues with model_main. (#4965) · 02a9969e

pkulzc authored Aug 01, 2018

* Merged commit includes the following changes:
206852642 by Zhichao Lu:

Build the balanced_positive_negative_sampler in the model builder for FasterRCNN. Also adds an option to use the static implementation of the sampler.

--
206803260 by Zhichao Lu:

Fixes a misplaced argument in resnet fpn feature extractor.

--
206682736 by Zhichao Lu:

This CL modifies the SSD meta architecture to support both Slim-based and Keras-based box predictors, and begins preparation for Keras box predictor support in the other meta architectures.

Concretely, this CL adds a new `KerasBoxPredictor` base class and makes the meta architectures appropriately call whichever box predictors they are using.

We can switch the non-ssd meta architectures to fully support Keras box predictors once the Keras Convolutional Box Predictor CL is submitted.

--
206669634 by Zhichao Lu:

Adds an alternate method for balanced positive negative sampler using static shapes.

--
206643278 by Zhichao Lu:

This CL adds a Keras layer hyperparameter configuration object to the hyperparams_builder.

It automatically converts from Slim layer hyperparameter configs to Keras layer hyperparameters. Namely, it:
- Builds Keras initializers/regularizers instead of Slim ones
- sets weights_regularizer/initializer to kernel_regularizer/initializer
- converts batchnorm decay to momentum
- converts Slim l2 regularizer weights to the equivalent Keras l2 weights

This will be used in the conversion of object detection feature extractors & box predictors to newer Tensorflow APIs.

--
206611681 by Zhichao Lu:

Internal changes.

--
206591619 by Zhichao Lu:

Clip the to shape when the input tensors are larger than the expected padded static shape

--
206517644 by Zhichao Lu:

Make MultiscaleGridAnchorGenerator more consistent with MultipleGridAnchorGenerator.

--
206415624 by Zhichao Lu:

Make the hardcoded feature pyramid network (FPN) levels configurable for both SSD
Resnet and SSD Mobilenet.

--
206398204 by Zhichao Lu:

This CL modifies the SSD meta architecture to support both Slim-based and Keras-based feature extractors.

This allows us to begin the conversion of object detection to newer Tensorflow APIs.

--
206213448 by Zhichao Lu:

Adding a method to compute the expected classification loss by background/foreground weighting.

--
206204232 by Zhichao Lu:

Adding the keypoint head to the Mask RCNN pipeline.

--
206200352 by Zhichao Lu:

- Create Faster R-CNN target assigner in the model builder. This allows configuring matchers in Target assigner to use TPU compatible ops (tf.gather in this case) without any change in meta architecture.
- As a +ve side effect of the refactoring, we can now re-use a single target assigner for all of second stage heads in Faster R-CNN.

--
206178206 by Zhichao Lu:

Force ssd feature extractor builder to use keyword arguments so values won't be passed to wrong arguments.

--
206168297 by Zhichao Lu:

Updating exporter to use freeze_graph.freeze_graph_with_def_protos rather than a homegrown version.

--
206080748 by Zhichao Lu:

Merge external contributions.

--
206074460 by Zhichao Lu:

Update to preprocessor to apply temperature and softmax to the multiclass scores on read.

--
205960802 by Zhichao Lu:

Fixing a bug in hierarchical label expansion script.

--
205944686 by Zhichao Lu:

Update exporter to support exporting quantized model.

--
205912529 by Zhichao Lu:

Add a two stage matcher to allow for thresholding by one criteria and then argmaxing on the other.

--
205909017 by Zhichao Lu:

Add test for grayscale image_resizer

--
205892801 by Zhichao Lu:

Add flag to decide whether to apply batch norm to conv layers of weight shared box predictor.

--
205824449 by Zhichao Lu:

make sure that by default mask rcnn box predictor predicts 2 stages.

--
205730139 by Zhichao Lu:

Updating warning message to be more explicit about variable size mismatch.

--
205696992 by Zhichao Lu:

Remove utils/ops.py's dependency on core/box_list_ops.py. This will allow re-using TPU compatible ops from utils/ops.py in core/box_list_ops.py.

--
205696867 by Zhichao Lu:

Refactoring mask rcnn predictor so have each head in a separate file.
This CL lets us to add new heads more easily in the future to mask rcnn.

--
205492073 by Zhichao Lu:

Refactor R-FCN box predictor to be TPU compliant.

- Change utils/ops.py:position_sensitive_crop_regions to operate on single image and set of boxes without `box_ind`
- Add a batch version that operations on batches of images and batches of boxes.
- Refactor R-FCN box predictor to use the batched version of position sensitive crop regions.

--
205453567 by Zhichao Lu:

Fix bug that cannot export inference graph when write_inference_graph flag is True.

--
205316039 by Zhichao Lu:

Changing input tensor name.

--
205256307 by Zhichao Lu:

Fix model zoo links for quantized model.

--
205164432 by Zhichao Lu:

Fixes eval error when label map contains non-ascii characters.

--
205129842 by Zhichao Lu:

Adds a option to clip the anchors to the window size without filtering the overlapped boxes in Faster-RCNN

--
205094863 by Zhichao Lu:

Update to label map util to allow the option of adding a background class and fill in gaps in the label map. Useful for using multiclass scores which require a complete label map with explicit background label.

--
204989032 by Zhichao Lu:

Add tf.prof support to exporter.

--
204825267 by Zhichao Lu:

Modify mask rcnn box predictor tests for TPU compatibility.

--
204778749 by Zhichao Lu:

Remove score filtering from postprocessing.py and rely on filtering logic in tf.image.non_max_suppression

--
204775818 by Zhichao Lu:

Python3 fixes for object_detection.

--
204745920 by Zhichao Lu:

Object Detection Dataset visualization tool (documentation).

--
204686993 by Zhichao Lu:

Internal changes.

--
204559667 by Zhichao Lu:

Refactor box_predictor.py into multiple files.
The abstract base class remains in the object_detection/core, The other classes have moved to a separate file each in object_detection/predictors

--
204552847 by Zhichao Lu:

Update blog post link.

--
204508028 by Zhichao Lu:

Bump down the batch size to 1024 to be a bit more tolerant to OOM and double the number of iterations. This job still converges to 20.5 mAP in 3 hours.

PiperOrigin-RevId: 206852642

* Add original post-processing back.

02a9969e

01 May, 2018 1 commit

Internal changes to slim and object detection (#4100) · 505f554c

pkulzc authored May 01, 2018

* Adding option for one_box_for_all_classes to the box_predictor

PiperOrigin-RevId: 192813444

* Extend to accept different ratios of conv channels.

PiperOrigin-RevId: 192837477

* Remove inaccurate caveat from proto file.

PiperOrigin-RevId: 192850747

* Add option to set dropout for classification net in weight shared box predictor.

PiperOrigin-RevId: 192922089

* fix flakiness in testSSDRandomCropWithMultiClassScores due to randomness.

PiperOrigin-RevId: 193067658

* Post-process now works again in train mode.

PiperOrigin-RevId: 193087707

* Adding support for reading in logits as groundtruth labels and applying an optional temperature (scaling) before softmax in support of distillation.

PiperOrigin-RevId: 193119411

* Add a util function to visualize value histogram as a tf.summary.image.

PiperOrigin-RevId: 193137342

* Do not add batch norm parameters to final conv2d ops that predict boxes encodings and class scores in weight shared conv box predictor.

This allows us to set proper bias and force initial predictions to be background when using focal loss.

PiperOrigin-RevId: 193204364

* Make sure the final layers are also resized proportional to conv_depth_ratio.

PiperOrigin-RevId: 193228972

* Remove deprecated batch_norm_trainable field from ssd mobilenet v2 config

PiperOrigin-RevId: 193244778

* Updating coco evaluation metrics to allow for a batch of image info, rather than a single image.

PiperOrigin-RevId: 193382651

* Update protobuf requirements to 3+ in installation docs.

PiperOrigin-RevId: 193409179

* Add support for training keypoints.

PiperOrigin-RevId: 193576336

* Fix data augmentation functions.

PiperOrigin-RevId: 193737238

* Read the default batch size from config file.

PiperOrigin-RevId: 193959861

* Fixing a bug in the coco evaluator.

PiperOrigin-RevId: 193974479

* num_gt_boxes_per_image and num_det_boxes_per_image value incorrect.
Should be not the expand dim.

PiperOrigin-RevId: 194122420

* Add option to evaluate any checkpoint (without requiring write access to that directory and overwriting any existing logs there).

PiperOrigin-RevId: 194292198

* PiperOrigin-RevId: 190346687

* - Expose slim arg_scope function to compute keys to enable tessting.
- Add is_training=None option to mobinenet arg_scopes. This allows the users to set is_training from an outer scope.

PiperOrigin-RevId: 190997959

* Add an option to not set slim arg_scope for batch_norm is_training parameter. This enables users to set the is_training parameter from an outer scope.

PiperOrigin-RevId: 191611934

* PiperOrigin-RevId: 191955231

* PiperOrigin-RevId: 193254125

* PiperOrigin-RevId: 193371562

* PiperOrigin-RevId: 194085628

505f554c

13 Apr, 2018 1 commit

Add option to override base feature extractor hyperparams in SSD models. This... · decbad8a

Zhichao Lu authored Apr 05, 2018

Add option to override base feature extractor hyperparams in SSD models. This would allow us to use the same set of hyperparams for the complete feature extractor (base + new layers) if desired.

PiperOrigin-RevId: 191787921

decbad8a

04 Apr, 2018 1 commit

Merged commit includes the following changes: · 6b72b5cd

Zhichao Lu authored Apr 04, 2018

191649512  by Zhichao Lu:

    Introduce two parameters in ssd.proto - freeze_batchnorm, inplace_batchnorm_update - and set up slim arg_scopes in ssd_meta_arch.py such that applies it to all batchnorm ops in the predict() method.

    This centralizes the control of freezing and doing inplace batchnorm updates.

--
191620303  by Zhichao Lu:

    Modifications to the preprocessor to support multiclass scores

--
191610773  by Zhichao Lu:

    Adding multiclass_scores to InputDataFields and adding padding for multiclass_scores.

--
191595011  by Zhichao Lu:

    Contains implementation of the detection metric for the Open Images Challenge.

--
191449408  by Zhichao Lu:

    Change hyperparams_builder to return a callable so the users can inherit values from outer arg_scopes. This allows us to easily set batch_norm parameters like "is_training" and "inplace_batchnorm_update" for all feature extractors from the base class and propagate it correctly to the nested scopes.

--
191437008  by Zhichao Lu:

    Contains implementation of the Recall@N and MedianRank@N metrics.

--
191385254  by Zhichao Lu:

    Add config rewrite flag to eval.py

--
191382500  by Zhichao Lu:

    Fix bug for config_util.

--

PiperOrigin-RevId: 191649512

6b72b5cd

03 Apr, 2018 1 commit
- Provide option to perform in-place batch norm updates for ssd feature extractors. · d2c5bfac
  Zhichao Lu authored Mar 27, 2018
```
PiperOrigin-RevId: 190688309
```
  d2c5bfac
22 Mar, 2018 1 commit

Internal changes for object detection. (#3656) · 001a2a61

pkulzc authored Mar 22, 2018

* Force cast of num_classes to integer

PiperOrigin-RevId: 188335318

* Updating config util to allow overwriting of cosine decay learning rates.

PiperOrigin-RevId: 188338852

* Make box_list_ops.py and box_list_ops_test.py work with C API enabled.

The C API has improved shape inference over the original Python
code. This causes some previously-working conds to fail. Switching to smart_cond fixes this.

Another effect of the improved shape inference is that one of the
failures tested gets caught earlier, so I modified the test to reflect
this.

PiperOrigin-RevId: 188409792

* Fix parallel event file writing issue.

Without this change, the event files might get corrupted when multiple evaluations are run in parallel.

PiperOrigin-RevId: 188502560

* Deprecating the boolean flag of from_detection_checkpoint.

Replace with a string field fine_tune_checkpoint_type to train_config to provide extensibility. The fine_tune_checkpoint_type can currently take value of `detection`, `classification`, or others when the restore_map is overwritten.

PiperOrigin-RevId: 188518685

* Automated g4 rollback of changelist 188502560

PiperOrigin-RevId: 188519969

* Introducing eval metrics specs for Coco Mask metrics. This allows metrics to be computed in tensorflow using the tf.learn Estimator.

PiperOrigin-RevId: 188528485

* Minor fix to make object_detection/metrics/coco_evaluation.py python3 compatible.

PiperOrigin-RevId: 188550683

* Updating eval_util to handle eval_metric_ops from multiple `DetectionEvaluator`s.

PiperOrigin-RevId: 188560474

* Allow tensor input for new_height and new_width for resize_image.

PiperOrigin-RevId: 188561908

* Fix typo in fine_tune_checkpoint_type name in trainer.

PiperOrigin-RevId: 188799033

* Adding mobilenet feature extractor to object detection.

PiperOrigin-RevId: 188916897

* Allow label maps to optionally contain an explicit background class with id zero.

PiperOrigin-RevId: 188951089

* Fix boundary conditions in random_pad_to_aspect_ratio to ensure that min_scale is always less than max_scale.

PiperOrigin-RevId: 189026868

* Fallback on from_detection_checkpoint option if fine_tune_checkpoint_type isn't set.

PiperOrigin-RevId: 189052833

* Add proper names for learning rate schedules so we don't see cryptic names on tensorboard.

PiperOrigin-RevId: 189069837

* Enforcing that all datasets are batched (and then unbatched in the model) with batch_size >= 1.

PiperOrigin-RevId: 189117178

* Adding regularization to total loss returned from DetectionModel.loss().

PiperOrigin-RevId: 189189123

* Standardize the names of loss scalars (for SSD, Faster R-CNN and R-FCN) in both training and eval so they can be compared on tensorboard.

Log localization and classification losses in evaluation.

PiperOrigin-RevId: 189189940

* Remove negative test from box list ops test.

PiperOrigin-RevId: 189229327

* Add an option to warmup learning rate in manual stepping schedule.

PiperOrigin-RevId: 189361039

* Replace tf.contrib.slim.tfexample_decoder.LookupTensor with object_detection.data_decoders.tf_example_decoder.LookupTensor.

PiperOrigin-RevId: 189388556

* Force regularization summary variables under specific family names.

PiperOrigin-RevId: 189393190

* Automated g4 rollback of changelist 188619139

PiperOrigin-RevId: 189396001

* Remove step 0 schedule since we do a hard check for it after cl/189361039

PiperOrigin-RevId: 189396697

* PiperOrigin-RevId: 189040463

* PiperOrigin-RevId: 189059229

* PiperOrigin-RevId: 189214402

* Force regularization summary variables under specific family names.

PiperOrigin-RevId: 189393190

* Automated g4 rollback of changelist 188619139

PiperOrigin-RevId: 189396001

* Make slim python3 compatible.

* Monir fixes.

* Add TargetAssignment summaries in a separate family.

PiperOrigin-RevId: 189407487

* 1. Setting `family` keyword arg prepends the summary names twice with the same name. Directly adding family suffix to the name gets rid of this problem.
2. Make sure the eval losses have the same name.

PiperOrigin-RevId: 189434618

* Minor fixes to make object detection tf 1.4 compatible.

PiperOrigin-RevId: 189437519

* Call the base of mobilenet_v1 feature extractor under the right arg scope and set batchnorm is_training based on the value passed in the constructor.

PiperOrigin-RevId: 189460890

* Automated g4 rollback of changelist 188409792

PiperOrigin-RevId: 189463882

* Update object detection syncing.

PiperOrigin-RevId: 189601955

* Add an option to warmup learning rate, hold it constant for a certain number of steps and cosine decay it.

PiperOrigin-RevId: 189606169

* Let the proposal feature extractor function in faster_rcnn meta architectures return the activations (end_points).

PiperOrigin-RevId: 189619301

* Fixed bug which caused masks to be mostly zeros (caused by detection_boxes being in absolute coordinates if scale_to_absolute=True.

PiperOrigin-RevId: 189641294

* Open sourcing Mobilenetv2 + SSDLite.

PiperOrigin-RevId: 189654520

* Remove unused files.

001a2a61

27 Feb, 2018 1 commit

Merged commit includes the following changes: · 78d5f8f8

Zhichao Lu authored Feb 27, 2018

187187978  by Zhichao Lu:

    Only updating hyperparameters if they have non-null values.

--
187097690  by Zhichao Lu:

    Rewrite some conditions a bit more clearly.

--
187085190  by Zhichao Lu:

    More informative error message.

--
186935376  by Zhichao Lu:

    Added option to evaluator.evaluate to use custom evaluator objects.

--
186808249  by Zhichao Lu:

    Fix documentation re: number of stages.

--
186775014  by Zhichao Lu:

    Change anchor generator interface to return a list of BoxLists containing anchors for different feature map layers.

--
186729028  by Zhichao Lu:

    Minor fixes to object detection.

--
186723716  by Zhichao Lu:

    Fix tf_example_decoder.py initailization issue.

--
186668505  by Zhichao Lu:

    Remove unused import.

--
186475361  by Zhichao Lu:

    Update the box predictor interface to return list of predictions - one from each feature map - instead of stacking them into one large tensor.

--
186410844  by Zhichao Lu:

    Fix PythonPath Dependencies.

--
186365384  by Zhichao Lu:

    Made some of the functions in exporter public so they can be reused.

--
186341438  by Zhichao Lu:

    Re-introducing check that label-map-path must be a valid (non-empty) string prior to overwriting pipeline config.

--
186036984  by Zhichao Lu:

    Adding default hyperparameters and allowing for overriding them via flags.

--
186026006  by Zhichao Lu:

    Strip `eval_` prefix from name argument give to TPUEstimator.evaluate since it adds the same prefix internally.

--
186016042  by Zhichao Lu:

    Add an option to evaluate models on training data.

--
185944986  by Zhichao Lu:

    let _update_label_map_path go through even if the path is empty

--
185860781  by Zhichao Lu:

    Add random normal initializer option to hyperparams builder.

    Scale the regression losses outside of the box encoder by adjusting huber loss delta and regression loss weight.

--
185846325  by Zhichao Lu:

    Add an option to normalize localization loss by the code size(number of box coordinates) in SSD Meta architecture.

--
185761217  by Zhichao Lu:

    Change multiscale_grid_anchor_generator to return anchors in normalized coordinates by default and add option to configure it.

    In SSD meta architecture, TargetAssigner operates in normalized coordinate space (i.e, groundtruth boxes are in normalized coordinates) hence we need the option to generate anchors in normalized coordinates.

--
185747733  by Zhichao Lu:

    Change the smooth L1 localization implementationt to use tf.losses.huber_loss and expose the delta parameter in the proto.

--
185715309  by Zhichao Lu:

    Obviates the need for prepadding on mobilenet v1 and v2 for fully convolutional models.

--
185685695  by Zhichao Lu:

    Fix manual stepping schedule to return first rate when there are no boundaries

--
185621650  by Zhichao Lu:

    Added target assigner proto for configuring negative class weights.

--

PiperOrigin-RevId: 187187978

78d5f8f8

14 Feb, 2018 1 commit

Add encode_background_as_zeros option to the SSDMetaArch class --- now clients... · 930ccd92

Zhichao Lu authored Feb 09, 2018

Add encode_background_as_zeros option to the SSDMetaArch class --- now clients have the option of encoding background targets as an all zeros vector or a one-hot vector with the 0th dimension corresponding to a background prediction.

PiperOrigin-RevId: 185228281

930ccd92

10 Feb, 2018 1 commit

Merged commit includes the following changes: · 1efe98bb

Zhichao Lu authored Feb 09, 2018

185215255  by Zhichao Lu:

    Stop populating image/object/class/text field when generating COCO tf record.

--
185213306  by Zhichao Lu:

    Use the params batch size and not the one from train_config in input_fn

--
185209081  by Zhichao Lu:

    Handle the case when there are no ground-truth masks for an image.

--
185195531  by Zhichao Lu:

    Remove unstack and stack operations on features from third_party/object_detection/model.py.

--
185195017  by Zhichao Lu:

    Matrix multiplication based gather op implementation.

--
185187744  by Zhichao Lu:

    Fix eval_util minor issue.

--
185098733  by Zhichao Lu:

    Internal change

185076656  by Zhichao Lu:

    Increment the amount of boxes for coco17.

--
185074199  by Zhichao Lu:

    Add config for SSD Resnet50 v1 with FPN.

--
185060199  by Zhichao Lu:

    Fix a bug in clear_detections.
    This method set detection_keys to an empty dictionary instead of an empty set. I've refactored so that this ...

1efe98bb

01 Feb, 2018 1 commit

Merged commit includes the following changes: · 7a9934df

Zhichao Lu authored Jan 31, 2018

184048729  by Zhichao Lu:

    Modify target_assigner so that it creates regression targets taking keypoints into account.

--
184027183  by Zhichao Lu:

    Resnet V1 FPN based feature extractors for SSD meta architecture in Object Detection V2 API.

--
184004730  by Zhichao Lu:

    Expose a lever to override the configured mask_type.

--
183933113  by Zhichao Lu:

    Weight shared convolutional box predictor as described in https://arxiv.org/abs/1708.02002

--
183929669  by Zhichao Lu:

    Expanding box list operations for future data augmentations.

--
183916792  by Zhichao Lu:

    Fix unrecognized assertion function in tests.

--
183906851  by Zhichao Lu:

    - Change ssd meta architecture to use regression weights to compute loss normalizer.

--
183871003  by Zhichao Lu:

    Fix config_util_test wrong dependency.

--
183782120  by Zhichao Lu:

    Add __init__ file to third_party directories.

--
183779109  by Zhichao Lu:

    Setup regular version s...

7a9934df

27 Oct, 2017 1 commit
- update protos. · 9adf0242
  Vivek Rathod authored Oct 27, 2017
  
  9adf0242
21 Sep, 2017 1 commit
- Move the research models into a research subfolder (#2430) · f87a58cd
  Neal Wu authored Sep 21, 2017
  
  f87a58cd
15 Jun, 2017 1 commit

Add Tensorflow Object Detection API. (#1561) · a4944a57

derekjchow authored Jun 14, 2017

For details see our paper:
"Speed/accuracy trade-offs for modern convolutional object detectors."
Huang J, Rathod V, Sun C, Zhu M, Korattikara A, Fathi A, Fischer I,
Wojna Z, Song Y, Guadarrama S, Murphy K, CVPR 2017
https://arxiv.org/abs/1611.10012

a4944a57