Commits · fe6536b78e6394cd318f9fbab9d594d87eee347b · ModelZoo / ResNet50_tensorflow

23 Aug, 2018 1 commit
- Updated model_lib to use min_score_threshold and max_num_boxes_to_visualize from the eval config. · fe6536b7
  Cameron Rudnick authored Aug 22, 2018
```
Updated model_lib to use min_score_threshold and max_num_boxes_to_visualize from the eval config.
```
  fe6536b7
08 Aug, 2018 1 commit

Update object detection post processing and fixes boxes padding/clipping issue. (#5026) · 59f7e80a

pkulzc authored Aug 07, 2018

* Merged commit includes the following changes:
207771702 by Zhichao Lu:

Refactoring evaluation utilities so that it is easier to introduce new DetectionEvaluators with eval_metric_ops.

--
207758641 by Zhichao Lu:

Require tensorflow version 1.9+ for running object detection API.

--
207641470 by Zhichao Lu:

Clip `num_groundtruth_boxes` in pad_input_data_to_static_shapes() to `max_num_boxes`. This prevents a scenario where tensors are sliced to an invalid range in model_lib.unstack_batch().

--
207621728 by Zhichao Lu:

This CL adds a FreezableBatchNorm that inherits from the Keras BatchNormalization layer, but supports freezing the `training` parameter at construction time instead of having to do it in the `call` method.

It also adds a method to the `KerasLayerHyperparams` class that will build an appropriate FreezableBatchNorm layer according to the hyperparameter configuration. If batch_norm is disabled, this method returns and Identity layer.

These will be used to simplify the conversion to Keras APIs.

--
207610524 by Zhichao Lu:

Update anchor generators and box predictors for python3 compatibility.

--
207585122 by Zhichao Lu:

Refactoring convolutional box predictor into separate prediction heads.

--
207549305 by Zhichao Lu:

Pass all 1s for batch weights if nothing is specified in GT.

--
207336575 by Zhichao Lu:

Move the new argument 'target_assigner_instance' to the end of the list of arguments to the ssd_meta_arch constructor for backwards compatibility.

--
207327862 by Zhichao Lu:

Enable support for float output in quantized custom op for postprocessing in SSD Mobilenet model.

--
207323154 by Zhichao Lu:

Bug fix: change dict.iteritems() to dict.items()

--
207301109 by Zhichao Lu:

Integrating expected_classification_loss_under_sampling op as an option in the ssd_meta_arch

--
207286221 by Zhichao Lu:

Adding an option to weight regression loss with foreground scores from the ground truth labels.

--
207231739 by Zhichao Lu:

Explicitly mentioning the argument names when calling the batch target assigner.

--
207206356 by Zhichao Lu:

Add include_trainable_variables field to train config to better handle trainable variables.

--
207135930 by Zhichao Lu:

Internal change.

--
206862541 by Zhichao Lu:

Do not unpad the outputs from batch_non_max_suppression before sampling.

Since BalancedPositiveNegativeSampler takes an indicator for valid positions to sample from we can pass the output from NMS directly into Sampler.

PiperOrigin-RevId: 207771702

* Remove unused doc.

59f7e80a

02 Aug, 2018 1 commit

Bug fix: change dict.iteritems() to dict.items() · e2ea9eb4

Abdullah Alrasheed authored Aug 02, 2018

`iteritems()` was removed from python3. `items()` does the same functionality so changing it will work in both python2 and python3. The only difference as far as I know is `iteritems()` returns a generator where `items` returns a list. But for this this code it will not make any difference where we are just changing the key of the dict to a string.

e2ea9eb4

01 Aug, 2018 1 commit

Refactor object detection box predictors and fix some issues with model_main. (#4965) · 02a9969e

pkulzc authored Aug 01, 2018

* Merged commit includes the following changes:
206852642 by Zhichao Lu:

Build the balanced_positive_negative_sampler in the model builder for FasterRCNN. Also adds an option to use the static implementation of the sampler.

--
206803260 by Zhichao Lu:

Fixes a misplaced argument in resnet fpn feature extractor.

--
206682736 by Zhichao Lu:

This CL modifies the SSD meta architecture to support both Slim-based and Keras-based box predictors, and begins preparation for Keras box predictor support in the other meta architectures.

Concretely, this CL adds a new `KerasBoxPredictor` base class and makes the meta architectures appropriately call whichever box predictors they are using.

We can switch the non-ssd meta architectures to fully support Keras box predictors once the Keras Convolutional Box Predictor CL is submitted.

--
206669634 by Zhichao Lu:

Adds an alternate method for balanced positive negative sampler using static shapes.

--
206643278 by Zhichao Lu:

This CL adds a Keras layer hyperparameter configuration object to the hyperparams_builder.

It automatically converts from Slim layer hyperparameter configs to Keras layer hyperparameters. Namely, it:
- Builds Keras initializers/regularizers instead of Slim ones
- sets weights_regularizer/initializer to kernel_regularizer/initializer
- converts batchnorm decay to momentum
- converts Slim l2 regularizer weights to the equivalent Keras l2 weights

This will be used in the conversion of object detection feature extractors & box predictors to newer Tensorflow APIs.

--
206611681 by Zhichao Lu:

Internal changes.

--
206591619 by Zhichao Lu:

Clip the to shape when the input tensors are larger than the expected padded static shape

--
206517644 by Zhichao Lu:

Make MultiscaleGridAnchorGenerator more consistent with MultipleGridAnchorGenerator.

--
206415624 by Zhichao Lu:

Make the hardcoded feature pyramid network (FPN) levels configurable for both SSD
Resnet and SSD Mobilenet.

--
206398204 by Zhichao Lu:

This CL modifies the SSD meta architecture to support both Slim-based and Keras-based feature extractors.

This allows us to begin the conversion of object detection to newer Tensorflow APIs.

--
206213448 by Zhichao Lu:

Adding a method to compute the expected classification loss by background/foreground weighting.

--
206204232 by Zhichao Lu:

Adding the keypoint head to the Mask RCNN pipeline.

--
206200352 by Zhichao Lu:

- Create Faster R-CNN target assigner in the model builder. This allows configuring matchers in Target assigner to use TPU compatible ops (tf.gather in this case) without any change in meta architecture.
- As a +ve side effect of the refactoring, we can now re-use a single target assigner for all of second stage heads in Faster R-CNN.

--
206178206 by Zhichao Lu:

Force ssd feature extractor builder to use keyword arguments so values won't be passed to wrong arguments.

--
206168297 by Zhichao Lu:

Updating exporter to use freeze_graph.freeze_graph_with_def_protos rather than a homegrown version.

--
206080748 by Zhichao Lu:

Merge external contributions.

--
206074460 by Zhichao Lu:

Update to preprocessor to apply temperature and softmax to the multiclass scores on read.

--
205960802 by Zhichao Lu:

Fixing a bug in hierarchical label expansion script.

--
205944686 by Zhichao Lu:

Update exporter to support exporting quantized model.

--
205912529 by Zhichao Lu:

Add a two stage matcher to allow for thresholding by one criteria and then argmaxing on the other.

--
205909017 by Zhichao Lu:

Add test for grayscale image_resizer

--
205892801 by Zhichao Lu:

Add flag to decide whether to apply batch norm to conv layers of weight shared box predictor.

--
205824449 by Zhichao Lu:

make sure that by default mask rcnn box predictor predicts 2 stages.

--
205730139 by Zhichao Lu:

Updating warning message to be more explicit about variable size mismatch.

--
205696992 by Zhichao Lu:

Remove utils/ops.py's dependency on core/box_list_ops.py. This will allow re-using TPU compatible ops from utils/ops.py in core/box_list_ops.py.

--
205696867 by Zhichao Lu:

Refactoring mask rcnn predictor so have each head in a separate file.
This CL lets us to add new heads more easily in the future to mask rcnn.

--
205492073 by Zhichao Lu:

Refactor R-FCN box predictor to be TPU compliant.

- Change utils/ops.py:position_sensitive_crop_regions to operate on single image and set of boxes without `box_ind`
- Add a batch version that operations on batches of images and batches of boxes.
- Refactor R-FCN box predictor to use the batched version of position sensitive crop regions.

--
205453567 by Zhichao Lu:

Fix bug that cannot export inference graph when write_inference_graph flag is True.

--
205316039 by Zhichao Lu:

Changing input tensor name.

--
205256307 by Zhichao Lu:

Fix model zoo links for quantized model.

--
205164432 by Zhichao Lu:

Fixes eval error when label map contains non-ascii characters.

--
205129842 by Zhichao Lu:

Adds a option to clip the anchors to the window size without filtering the overlapped boxes in Faster-RCNN

--
205094863 by Zhichao Lu:

Update to label map util to allow the option of adding a background class and fill in gaps in the label map. Useful for using multiclass scores which require a complete label map with explicit background label.

--
204989032 by Zhichao Lu:

Add tf.prof support to exporter.

--
204825267 by Zhichao Lu:

Modify mask rcnn box predictor tests for TPU compatibility.

--
204778749 by Zhichao Lu:

Remove score filtering from postprocessing.py and rely on filtering logic in tf.image.non_max_suppression

--
204775818 by Zhichao Lu:

Python3 fixes for object_detection.

--
204745920 by Zhichao Lu:

Object Detection Dataset visualization tool (documentation).

--
204686993 by Zhichao Lu:

Internal changes.

--
204559667 by Zhichao Lu:

Refactor box_predictor.py into multiple files.
The abstract base class remains in the object_detection/core, The other classes have moved to a separate file each in object_detection/predictors

--
204552847 by Zhichao Lu:

Update blog post link.

--
204508028 by Zhichao Lu:

Bump down the batch size to 1024 to be a bit more tolerant to OOM and double the number of iterations. This job still converges to 20.5 mAP in 3 hours.

PiperOrigin-RevId: 206852642

* Add original post-processing back.

02a9969e

13 Jul, 2018 1 commit

Merged commit includes the following changes: · 85dd5fa4

Zhichao Lu authored Jul 13, 2018

204489224  by Zhichao Lu:

    Modify ssd mobilenet v1 fpn config to be a bit more tolerant to OOM failure by bumping down the batch size to 64 and doubling the number of iterations to 25k. It now converges in 2.5 hours.

--
204488942  by Zhichao Lu:

    Internal change

204480631  by Zhichao Lu:

    This CL makes sure that num_steps parameter are not updated to 0 if num_steps field is not mentioned in config.

    The default behavior for number of steps parameter for training is infinite (train forever). The default value num_steps in train.proto is 0 (for training indefinitely). However the estimator/training function expects the num_steps to be set to None to train indefinitely.

--
204437217  by Zhichao Lu:

    Create a Docker image to support TensorFlow Lite / Object Detection blog post.

--
204317570  by Zhichao Lu:

    Internal change

PiperOrigin-RevId: 204489224

85dd5fa4

02 Jul, 2018 1 commit

Open Images Challenge 2018 tools, minor fixes and refactors. (#4661) · 32e7d660

pkulzc authored Jul 02, 2018

* Merged commit includes the following changes:
202804536 by Zhichao Lu:

Return tf.data.Dataset from input_fn that goes into the estimator and use PER_HOST_V2 option for tpu input pipeline config.

This change shaves off 100ms per step resulting in 25 minutes of total reduced training time for ssd mobilenet v1 (15k steps to convergence).

--
202769340 by Zhichao Lu:

Adding as_matrix() transformation for image-level labels.

--
202768721 by Zhichao Lu:

Challenge evaluation protocol modification: adding labelmaps creation.

--
202750966 by Zhichao Lu:

Add the explicit names to two output nodes.

--
202732783 by Zhichao Lu:

Enforcing that batch size is 1 for evaluation, and no original images are retained during evaluation when use_tpu=False (to avoid dynamic shapes).

--
202425430 by Zhichao Lu:

Refactor input pipeline to improve performance.

--
202406389 by Zhichao Lu:

Only check the validity of `warmup_learning_rate` if it will be used.

--
202330450 by Zhichao Lu:

Adding the description of the flag input_image_label_annotations_csv to add
image-level labels to tf.Example.

--
202029012 by Zhichao Lu:

Enabling displaying relationship name in the final metrics output.

--
202024010 by Zhichao Lu:

Update to the public README.

--
201999677 by Zhichao Lu:

Fixing the way negative labels are handled in VRD evaluation.

--
201962313 by Zhichao Lu:

Fix a bug in resize_to_range.

--
201808488 by Zhichao Lu:

Update ssd_inception_v2_pets.config to use right filename of pets dataset tf records.

--
201779225 by Zhichao Lu:

Update object detection API installation doc

--
201766518 by Zhichao Lu:

Add shell script to create pycocotools package for CMLE.

--
201722377 by Zhichao Lu:

Removes verified_labels field and uses groundtruth_image_classes field instead.

--
201616819 by Zhichao Lu:

Disable eval_on_tpu since eval_metrics is not setup to execute on TPU.
Do not use run_config.task_type to switch tpu mode for EVAL,
since that won't work in unit test.
Expand unit test to verify that the same instantiation of the Estimator can independently disable eval on TPU whereas training is enabled on TPU.

--
201524716 by Zhichao Lu:

Disable export model to TPU, inference is not compatible with TPU.
Add GOOGLE_INTERNAL support in object detection copy.bara.sky

--
201453347 by Zhichao Lu:

Fixing bug when evaluating the quantized model.

--
200795826 by Zhichao Lu:

Fixing parsing bug: image-level labels are parsed as tuples instead of numpy
array.

--
200746134 by Zhichao Lu:

Adding image_class_text and image_class_label fields into tf_example_decoder.py

--
200743003 by Zhichao Lu:

Changes to model_main.py and model_tpu_main to enable training and continuous eval.

--
200736324 by Zhichao Lu:

Replace deprecated squeeze_dims argument.

--
200730072 by Zhichao Lu:

Make detections only during predict and eval mode while creating model function

--
200729699 by Zhichao Lu:

Minor correction to internal documentation (definition of Huber loss)

--
200727142 by Zhichao Lu:

Add command line parsing as a set of flags using argparse and add header to the
resulting file.

--
200726169 by Zhichao Lu:

A tutorial on running evaluation for the Open Images Challenge 2018.

--
200665093 by Zhichao Lu:

Cleanup on variables_helper_test.py.

--
200652145 by Zhichao Lu:

Add an option to write (non-frozen) graph when exporting inference graph.

--
200573810 by Zhichao Lu:

Update ssd_mobilenet_v1_coco and ssd_inception_v2_coco download links to point to a newer version.

--
200498014 by Zhichao Lu:

Add test for groundtruth mask resizing.

--
200453245 by Zhichao Lu:

Cleaning up exporting_models.md along with exporting scripts

--
200311747 by Zhichao Lu:

Resize groundtruth mask to match the size of the original image.

--
200287269 by Zhichao Lu:

Having a option to use custom MatMul based crop_and_resize op as an alternate to the TF op in Faster-RCNN

--
200127859 by Zhichao Lu:

Updating the instructions to run locally with new binary. Also updating pets configs since file path naming has changed.

--
200127044 by Zhichao Lu:

A simpler evaluation util to compute Open Images Challenge
2018 metric (object detection track).

--
200124019 by Zhichao Lu:

Freshening up configuring_jobs.md

--
200086825 by Zhichao Lu:

Make merge_multiple_label_boxes work for ssd model.

--
199843258 by Zhichao Lu:

Allows inconsistent feature channels to be compatible with WeightSharedConvolutionalBoxPredictor.

--
199676082 by Zhichao Lu:

Enable an override for `InputReader.shuffle` for object detection pipelines.

--
199599212 by Zhichao Lu:

Markdown fixes.

--
199535432 by Zhichao Lu:

Pass num_additional_channels to tf.example decoder in predict_input_fn.

--
199399439 by Zhichao Lu:

Adding `num_additional_channels` field to specify how many additional channels to use in the model.

PiperOrigin-RevId: 202804536

* Add original model builder and docs back.

32e7d660

06 Jun, 2018 1 commit

Merged commit includes the following changes: · 9fce9c64

Zhichao Lu authored Jun 05, 2018

199348852 by Zhichao Lu:

Small typos fixes in VRD evaluation.

--
199315191 by Zhichao Lu:

Change padding shapes when additional channels are available.

--
199309180 by Zhichao Lu:

Adds minor fixes to the Object Detection API implementation.

--
199298605 by Zhichao Lu:

Force num_readers to be 1 when only input file is not sharded.

--
199292952 by Zhichao Lu:

Adds image-level labels parsing into TfExampleDetectionAndGTParser.

--
199259866 by Zhichao Lu:

Visual Relationships Evaluation executable.

--
199208330 by Zhichao Lu:

Infer train_config.batch_size as the effective batch size. Therefore we need to divide the effective batch size in trainer by train_config.replica_to_aggregate to get per worker batch size.

--
199207842 by Zhichao Lu:

Internal change.

--
199204222 by Zhichao Lu:

In case the image has more than three channels, we only take the first three channels for visualization.

--
199194388 by Zhichao Lu:

Correcting protocols description: VOC 2007 -> VOC 2012.

--
199188290 by Zhichao Lu:

Adds per-relationship APs and mAP computation to VRD evaluation.

--
199158801 by Zhichao Lu:

If available, additional channels are merged with input image.

--
199099637 by Zhichao Lu:

OpenImages Challenge metric support:
-adding verified labels standard field for TFExample;
-adding tfrecord creation functionality.

--
198957391 by Zhichao Lu:

Allow tf record sharding when creating pets dataset.

--
198925184 by Zhichao Lu:

Introduce moving average support for evaluation. Also adding the ability to override this configuration via config_util.

--
198918186 by Zhichao Lu:

Handles the case where there are 0 box masks.

--
198809009 by Zhichao Lu:

Plumb groundtruth weights into target assigner for Faster RCNN.

--
198759987 by Zhichao Lu:

Fix object detection test broken by shape inference.

--
198668602 by Zhichao Lu:

Adding a new input field in data_decoders/tf_example_decoder.py for storing additional channels.

--
198530013 by Zhichao Lu:

An util for hierarchical expandion of boxes and labels of OID dataset.

--
198503124 by Zhichao Lu:

Fix dimension mismatch error introduced by
https://github.com/tensorflow/tensorflow/pull/18251, or cl/194031845.
After above change, conv2d strictly checks for conv_dims + 2 == input_rank.

--
198445807 by Zhichao Lu:

Enabling Object Detection Challenge 2018 metric in evaluator.py framework for
running eval job.
Renaming old OpenImages V2 metric.

--
198413950 by Zhichao Lu:

Support generic configuration override using namespaced keys

Useful for adding custom hyper-parameter tuning fields without having to add custom override methods to config_utils.py.

--
198106437 by Zhichao Lu:

Enable fused batchnorm now that quantization is supported.

--
198048364 by Zhichao Lu:

Add support for keypoints in tf sequence examples and some util ops.

--
198004736 by Zhichao Lu:

Relax postprocessing unit tests that are based on assumption that tf.image.non_max_suppression are stable with respect to input.

--
197997513 by Zhichao Lu:

More lenient validation for normalized box boundaries.

--
197940068 by Zhichao Lu:

A couple of minor updates/fixes:
- Updating input reader proto with option to use display_name when decoding data.
- Updating visualization tool to specify whether using absolute or normalized box coordinates. Appropriate boxes will now appear in TB when using model_main.py

--
197920152 by Zhichao Lu:

Add quantized training support in the new OD binaries and a config for SSD Mobilenet v1 quantized training that is TPU compatible.

--
197213563 by Zhichao Lu:

Do not share batch_norm for classification and regression tower in weight shared box predictor.

--
197196757 by Zhichao Lu:

Relax the box_predictor api to return box_prediction of shape [batch_size, num_anchors, code_size] in addition to [batch_size, num_anchors, (1|q), code_size].

--
196898361 by Zhichao Lu:

Allow per-channel scalar value to pad input image with when using keep aspect ratio resizer (when pad_to_max_dimension=True).

In Object Detection Pipeline, we pad image before normalization and this skews batch_norm statistics during training. The option to set per channel pad value lets us truly pad with zeros.

--
196592101 by Zhichao Lu:

Fix bug regarding tfrecord shuffling in object_detection

--
196320138 by Zhichao Lu:

Fix typo in exporting_models.md

PiperOrigin-RevId: 199348852

9fce9c64

11 May, 2018 1 commit

Merged commit includes the following changes: · 324d6dc3

Zhichao Lu authored May 10, 2018

196161788  by Zhichao Lu:

    Add eval_on_train_steps parameter.

    Since the number of samples in train dataset is usually different to the number of samples in the eval dataset.

--
196151742  by Zhichao Lu:

    Add an optional random sampling process for SSD meta arch and update mean stddev coder to use default std dev when corresponding tensor is not added to boxlist field.

--
196148940  by Zhichao Lu:

    Release ssdlite mobilenet v2 coco trained model.

--
196058528  by Zhichao Lu:

    Apply FPN feature map generation before we add additional layers on top of resnet feature extractor.

--
195818367  by Zhichao Lu:

    Add support for exporting detection keypoints.

--
195745420  by Zhichao Lu:

    Introduce include_metrics_per_category option to Object Detection eval_config.

--
195734733  by Zhichao Lu:

    Rename SSDLite config to be more explicit.

--
195717383  by Zhichao Lu:

    Add quantized training to object_detection.

--
195683542  by...

324d6dc3

03 May, 2018 1 commit

Merged commit includes the following changes: · 63054210

Zhichao Lu authored May 03, 2018

195269567 by Zhichao Lu:

Removing image summaries during train mode.

--
195147413 by Zhichao Lu:

SSDLite config for mobilenet v2.

--
194883585 by Zhichao Lu:

Simplify TPU compatible nearest neighbor upsampling using reshape and broadcasting.

--
194851009 by Zhichao Lu:

Include ava v2.1 detection models in model zoo.

--
194292198 by Zhichao Lu:

Add option to evaluate any checkpoint (without requiring write access to that directory and overwriting any existing logs there).

--
194122420 by Zhichao Lu:

num_gt_boxes_per_image and num_det_boxes_per_image value incorrect.
Should be not the expand dim.

--
193974479 by Zhichao Lu:

Fixing a bug in the coco evaluator.

--
193959861 by Zhichao Lu:

Read the default batch size from config file.

--
193737238 by Zhichao Lu:

Fix data augmentation functions.

--
193576336 by Zhichao Lu:

Add support for training keypoints.

--
193409179 by Zhichao Lu:

Update protobuf requirements to 3+ in installation docs.

--
193382651 by Zhichao Lu:

Updating coco evaluation metrics to allow for a batch of image info, rather than a single image.

--
193244778 by Zhichao Lu:

Remove deprecated batch_norm_trainable field from ssd mobilenet v2 config

--
193228972 by Zhichao Lu:

Make sure the final layers are also resized proportional to conv_depth_ratio.

--
193204364 by Zhichao Lu:

Do not add batch norm parameters to final conv2d ops that predict boxes encodings and class scores in weight shared conv box predictor.

This allows us to set proper bias and force initial predictions to be background when using focal loss.

--
193137342 by Zhichao Lu:

Add a util function to visualize value histogram as a tf.summary.image.

--
193119411 by Zhichao Lu:

Adding support for reading in logits as groundtruth labels and applying an optional temperature (scaling) before softmax in support of distillation.

--
193087707 by Zhichao Lu:

Post-process now works again in train mode.

--
193067658 by Zhichao Lu:

fix flakiness in testSSDRandomCropWithMultiClassScores due to randomness.

--
192922089 by Zhichao Lu:

Add option to set dropout for classification net in weight shared box predictor.

--
192850747 by Zhichao Lu:

Remove inaccurate caveat from proto file.

--
192837477 by Zhichao Lu:

Extend to accept different ratios of conv channels.

--
192813444 by Zhichao Lu:

Adding option for one_box_for_all_classes to the box_predictor

--
192624207 by Zhichao Lu:

Update to trainer to allow for reading multiclass scores

--
192583425 by Zhichao Lu:

Contains implementation of Visual Relations Detection evaluation metric (per
image evaluation).

--
192529600 by Zhichao Lu:

Modify the ssd meta arch to allow the option of not adding an implicit background class.

--
192512429 by Zhichao Lu:

Refactor model_tpu_main.py files and move continuous eval loop into model_lib.py

--
192494267 by Zhichao Lu:

Update create_pascal_tf_record.py and create_pet_tf_record.py

--
192485456 by Zhichao Lu:

Enforcing that all eval metric ops have valid python strings.

--
192472546 by Zhichao Lu:

Set regularize_depthwise to true in mobilenet_v1_argscope.

--
192421843 by Zhichao Lu:

Refactoring of Mask-RCNN to put all mask prediction code in third stage.

--
192320460 by Zhichao Lu:

Returning eval_on_train_input_fn from create_estimator_and_inputs(), rather than using train_input_fn in EVAL mode (which will still have data augmentation).

--
192226678 by Zhichao Lu:

Access TPUEstimator and CrossShardOptimizer from tf namesspace.

--
192195514 by Zhichao Lu:

Fix test that was flaky due to randomness

--
192166224 by Zhichao Lu:

Minor fixes to match git repo.

--
192147130 by Zhichao Lu:

use shape utils for assertion in feature extractor.

--
192132440 by Zhichao Lu:

Class agnostic masks for mask_rcnn

--
192006190 by Zhichao Lu:

Add learning rate summary in EVAL mode in model.py

--
192004845 by Zhichao Lu:

Migrating away from Experiment class, as it is now deprecated. Also, refactoring into a separate model library and binaries.

--
191957195 by Zhichao Lu:

Add classification_loss and localiztion_loss metrics for TPU jobs.

--
191932855 by Zhichao Lu:

Add an option to skip the last striding in mobilenet. The modified network has nominal output stride 16 instead of 32.

--
191787921 by Zhichao Lu:

Add option to override base feature extractor hyperparams in SSD models. This would allow us to use the same set of hyperparams for the complete feature extractor (base + new layers) if desired.

--
191743097 by Zhichao Lu:

Adding an attribute to SSD model to indicate which fields in prediction dictionary have a batch dimension. This will be useful for future video models.

--
191668425 by Zhichao Lu:

Internal change.

--
191649512 by Zhichao Lu:

Introduce two parameters in ssd.proto - freeze_batchnorm, inplace_batchnorm_update - and set up slim arg_scopes in ssd_meta_arch.py such that applies it to all batchnorm ops in the predict() method.

This centralizes the control of freezing and doing inplace batchnorm updates.

--
191620303 by Zhichao Lu:

Modifications to the preprocessor to support multiclass scores

PiperOrigin-RevId: 195269567

63054210

13 Apr, 2018 7 commits
- Refactor model_tpu_main.py files and move continuous eval loop into model_lib.py · a60dd985
  Zhichao Lu authored Apr 11, 2018
```
PiperOrigin-RevId: 192512429
```
  a60dd985
- Enforcing that all eval metric ops have valid python strings. · 2151c447
  Zhichao Lu authored Apr 11, 2018
```
PiperOrigin-RevId: 192485456
```
  2151c447
- Returning eval_on_train_input_fn from create_estimator_and_inputs(), rather... · 227f41e9
  Zhichao Lu authored Apr 10, 2018
```
Returning eval_on_train_input_fn from create_estimator_and_inputs(), rather than using train_input_fn in EVAL mode (which will still have data augmentation).

PiperOrigin-RevId: 192320460
```
  227f41e9
- Access TPUEstimator and CrossShardOptimizer from tf namesspace. · 7e810001
  Zhichao Lu authored Apr 09, 2018
```
PiperOrigin-RevId: 192226678
```
  7e810001
- Add learning rate summary in EVAL mode in model.py · bfd15ec1
  Zhichao Lu authored Apr 07, 2018
```
PiperOrigin-RevId: 192006190
```
  bfd15ec1
- Migrating away from Experiment class, as it is now deprecated. Also,... · 90cc9baa
  Zhichao Lu authored Apr 07, 2018
```
Migrating away from Experiment class, as it is now deprecated. Also, refactoring into a separate model library and binaries.

PiperOrigin-RevId: 192004845
```
  90cc9baa
- Add classification_loss and localiztion_loss metrics for TPU jobs. · 8deba73f
  Zhichao Lu authored Apr 06, 2018
```
PiperOrigin-RevId: 191957195
```
  8deba73f
03 Apr, 2018 6 commits
- Updating model to infer when labels are padded, rather than relying solely on... · e9f85e83
  Zhichao Lu authored Mar 29, 2018
```
Updating model to infer when labels are padded, rather than relying solely on the mode. This is necessary for evaluating on train data.

PiperOrigin-RevId: 190960173
```
  e9f85e83
- Removing flag defaults for train steps and eval steps. If not provided, will... · 056bea45
  Zhichao Lu authored Mar 27, 2018
```
Removing flag defaults for train steps and eval steps. If not provided, will revert to the values specified in the config.

PiperOrigin-RevId: 190706470
```
  056bea45
- Write groundtruth weights from input pipeline into model. · bb43ed96
  Zhichao Lu authored Mar 27, 2018
```
PiperOrigin-RevId: 190636417
```
  bb43ed96
- Minor change to remove redundant check for finetune_checkpoint_type. · bda5126e
  Zhichao Lu authored Mar 21, 2018
```
PiperOrigin-RevId: 189997094
```
  bda5126e
- Apply fix for image summary in eval metrics, so that eval on train works appropriately. · e46021a7
  Zhichao Lu authored Mar 20, 2018
```
PiperOrigin-RevId: 189815553
```
  e46021a7
- Enabling both train and eval image summaries. Note that eval summaries are not... · 6f1756bc
  Zhichao Lu authored Mar 19, 2018
```
Enabling both train and eval image summaries. Note that eval summaries are not created tf-learn environment. To get them to show up, added the summary image into the eval_metric_ops.

PiperOrigin-RevId: 189658259
```
  6f1756bc
22 Mar, 2018 1 commit

Internal changes for object detection. (#3656) · 001a2a61

pkulzc authored Mar 22, 2018

* Force cast of num_classes to integer

PiperOrigin-RevId: 188335318

* Updating config util to allow overwriting of cosine decay learning rates.

PiperOrigin-RevId: 188338852

* Make box_list_ops.py and box_list_ops_test.py work with C API enabled.

The C API has improved shape inference over the original Python
code. This causes some previously-working conds to fail. Switching to smart_cond fixes this.

Another effect of the improved shape inference is that one of the
failures tested gets caught earlier, so I modified the test to reflect
this.

PiperOrigin-RevId: 188409792

* Fix parallel event file writing issue.

Without this change, the event files might get corrupted when multiple evaluations are run in parallel.

PiperOrigin-RevId: 188502560

* Deprecating the boolean flag of from_detection_checkpoint.

Replace with a string field fine_tune_checkpoint_type to train_config to provide extensibility. The fine_tune_checkpoint_type can currently take value of `detection`, `classification`, or others when the restore_map is overwritten.

PiperOrigin-RevId: 188518685

* Automated g4 rollback of changelist 188502560

PiperOrigin-RevId: 188519969

* Introducing eval metrics specs for Coco Mask metrics. This allows metrics to be computed in tensorflow using the tf.learn Estimator.

PiperOrigin-RevId: 188528485

* Minor fix to make object_detection/metrics/coco_evaluation.py python3 compatible.

PiperOrigin-RevId: 188550683

* Updating eval_util to handle eval_metric_ops from multiple `DetectionEvaluator`s.

PiperOrigin-RevId: 188560474

* Allow tensor input for new_height and new_width for resize_image.

PiperOrigin-RevId: 188561908

* Fix typo in fine_tune_checkpoint_type name in trainer.

PiperOrigin-RevId: 188799033

* Adding mobilenet feature extractor to object detection.

PiperOrigin-RevId: 188916897

* Allow label maps to optionally contain an explicit background class with id zero.

PiperOrigin-RevId: 188951089

* Fix boundary conditions in random_pad_to_aspect_ratio to ensure that min_scale is always less than max_scale.

PiperOrigin-RevId: 189026868

* Fallback on from_detection_checkpoint option if fine_tune_checkpoint_type isn't set.

PiperOrigin-RevId: 189052833

* Add proper names for learning rate schedules so we don't see cryptic names on tensorboard.

PiperOrigin-RevId: 189069837

* Enforcing that all datasets are batched (and then unbatched in the model) with batch_size >= 1.

PiperOrigin-RevId: 189117178

* Adding regularization to total loss returned from DetectionModel.loss().

PiperOrigin-RevId: 189189123

* Standardize the names of loss scalars (for SSD, Faster R-CNN and R-FCN) in both training and eval so they can be compared on tensorboard.

Log localization and classification losses in evaluation.

PiperOrigin-RevId: 189189940

* Remove negative test from box list ops test.

PiperOrigin-RevId: 189229327

* Add an option to warmup learning rate in manual stepping schedule.

PiperOrigin-RevId: 189361039

* Replace tf.contrib.slim.tfexample_decoder.LookupTensor with object_detection.data_decoders.tf_example_decoder.LookupTensor.

PiperOrigin-RevId: 189388556

* Force regularization summary variables under specific family names.

PiperOrigin-RevId: 189393190

* Automated g4 rollback of changelist 188619139

PiperOrigin-RevId: 189396001

* Remove step 0 schedule since we do a hard check for it after cl/189361039

PiperOrigin-RevId: 189396697

* PiperOrigin-RevId: 189040463

* PiperOrigin-RevId: 189059229

* PiperOrigin-RevId: 189214402

* Force regularization summary variables under specific family names.

PiperOrigin-RevId: 189393190

* Automated g4 rollback of changelist 188619139

PiperOrigin-RevId: 189396001

* Make slim python3 compatible.

* Monir fixes.

* Add TargetAssignment summaries in a separate family.

PiperOrigin-RevId: 189407487

* 1. Setting `family` keyword arg prepends the summary names twice with the same name. Directly adding family suffix to the name gets rid of this problem.
2. Make sure the eval losses have the same name.

PiperOrigin-RevId: 189434618

* Minor fixes to make object detection tf 1.4 compatible.

PiperOrigin-RevId: 189437519

* Call the base of mobilenet_v1 feature extractor under the right arg scope and set batchnorm is_training based on the value passed in the constructor.

PiperOrigin-RevId: 189460890

* Automated g4 rollback of changelist 188409792

PiperOrigin-RevId: 189463882

* Update object detection syncing.

PiperOrigin-RevId: 189601955

* Add an option to warmup learning rate, hold it constant for a certain number of steps and cosine decay it.

PiperOrigin-RevId: 189606169

* Let the proposal feature extractor function in faster_rcnn meta architectures return the activations (end_points).

PiperOrigin-RevId: 189619301

* Fixed bug which caused masks to be mostly zeros (caused by detection_boxes being in absolute coordinates if scale_to_absolute=True.

PiperOrigin-RevId: 189641294

* Open sourcing Mobilenetv2 + SSDLite.

PiperOrigin-RevId: 189654520

* Remove unused files.

001a2a61

08 Mar, 2018 1 commit
- Allow model.py to be extended with custom model building functions. · c21d3c25
  Zhichao Lu authored Mar 05, 2018
```
PiperOrigin-RevId: 187941168
```
  c21d3c25
27 Feb, 2018 1 commit

Merged commit includes the following changes: · 78d5f8f8

Zhichao Lu authored Feb 27, 2018

187187978  by Zhichao Lu:

    Only updating hyperparameters if they have non-null values.

--
187097690  by Zhichao Lu:

    Rewrite some conditions a bit more clearly.

--
187085190  by Zhichao Lu:

    More informative error message.

--
186935376  by Zhichao Lu:

    Added option to evaluator.evaluate to use custom evaluator objects.

--
186808249  by Zhichao Lu:

    Fix documentation re: number of stages.

--
186775014  by Zhichao Lu:

    Change anchor generator interface to return a list of BoxLists containing anchors for different feature map layers.

--
186729028  by Zhichao Lu:

    Minor fixes to object detection.

--
186723716  by Zhichao Lu:

    Fix tf_example_decoder.py initailization issue.

--
186668505  by Zhichao Lu:

    Remove unused import.

--
186475361  by Zhichao Lu:

    Update the box predictor interface to return list of predictions - one from each feature map - instead of stacking them into one large tensor.

--
186410844  by Zhichao Lu:

    Fix PythonPath Dependencies.

--
186365384  by Zhichao Lu:

    Made some of the functions in exporter public so they can be reused.

--
186341438  by Zhichao Lu:

    Re-introducing check that label-map-path must be a valid (non-empty) string prior to overwriting pipeline config.

--
186036984  by Zhichao Lu:

    Adding default hyperparameters and allowing for overriding them via flags.

--
186026006  by Zhichao Lu:

    Strip `eval_` prefix from name argument give to TPUEstimator.evaluate since it adds the same prefix internally.

--
186016042  by Zhichao Lu:

    Add an option to evaluate models on training data.

--
185944986  by Zhichao Lu:

    let _update_label_map_path go through even if the path is empty

--
185860781  by Zhichao Lu:

    Add random normal initializer option to hyperparams builder.

    Scale the regression losses outside of the box encoder by adjusting huber loss delta and regression loss weight.

--
185846325  by Zhichao Lu:

    Add an option to normalize localization loss by the code size(number of box coordinates) in SSD Meta architecture.

--
185761217  by Zhichao Lu:

    Change multiscale_grid_anchor_generator to return anchors in normalized coordinates by default and add option to configure it.

    In SSD meta architecture, TargetAssigner operates in normalized coordinate space (i.e, groundtruth boxes are in normalized coordinates) hence we need the option to generate anchors in normalized coordinates.

--
185747733  by Zhichao Lu:

    Change the smooth L1 localization implementationt to use tf.losses.huber_loss and expose the delta parameter in the proto.

--
185715309  by Zhichao Lu:

    Obviates the need for prepadding on mobilenet v1 and v2 for fully convolutional models.

--
185685695  by Zhichao Lu:

    Fix manual stepping schedule to return first rate when there are no boundaries

--
185621650  by Zhichao Lu:

    Added target assigner proto for configuring negative class weights.

--

PiperOrigin-RevId: 187187978

78d5f8f8

10 Feb, 2018 1 commit

Merged commit includes the following changes: · 1efe98bb

Zhichao Lu authored Feb 09, 2018

185215255  by Zhichao Lu:

    Stop populating image/object/class/text field when generating COCO tf record.

--
185213306  by Zhichao Lu:

    Use the params batch size and not the one from train_config in input_fn

--
185209081  by Zhichao Lu:

    Handle the case when there are no ground-truth masks for an image.

--
185195531  by Zhichao Lu:

    Remove unstack and stack operations on features from third_party/object_detection/model.py.

--
185195017  by Zhichao Lu:

    Matrix multiplication based gather op implementation.

--
185187744  by Zhichao Lu:

    Fix eval_util minor issue.

--
185098733  by Zhichao Lu:

    Internal change

185076656  by Zhichao Lu:

    Increment the amount of boxes for coco17.

--
185074199  by Zhichao Lu:

    Add config for SSD Resnet50 v1 with FPN.

--
185060199  by Zhichao Lu:

    Fix a bug in clear_detections.
    This method set detection_keys to an empty dictionary instead of an empty set. I've refactored so that this method and the constructor use the same code path.

--
185031359  by Zhichao Lu:

    Eval TPU trained models continuously.

--
185016591  by Zhichao Lu:

    Use TPUEstimatorSpec for TPU

--
185013651  by Zhichao Lu:

    Add PreprocessorCache to record and duplicate augmentations.

--
184921763  by Zhichao Lu:

    Minor fixes for object detection.

--
184920610  by Zhichao Lu:

    Adds a model builder test for "embedded_ssd_mobilenet_v1" feature extractor.

--
184919284  by Zhichao Lu:

    Added unit tests for TPU, with optional training / eval.

--
184915910  by Zhichao Lu:

    Update third_party g3 doc with Mask RCNN detection models.

--
184914085  by Zhichao Lu:

    Slight change to WeightSharedConvolutionalBoxPredictor implementation to make things match more closely with RetinaNet.  Specifically we now construct the box encoding and class predictor towers separately rather than having them share weights until penultimate layer.

--
184913786  by Zhichao Lu:

    Plumbs SSD Resnet V1 with FPN models into model builder.

--
184910030  by Zhichao Lu:

    Add coco metrics to evaluator.

--
184897758  by Zhichao Lu:

    Merge changes from github.

--
184888736  by Zhichao Lu:

    Ensure groundtruth_weights are always 1-D.

--
184887256  by Zhichao Lu:

    Introduce an option to add summaries in the model so it can be turned off when necessary.

--
184865559  by Zhichao Lu:

    Updating inputs so that a dictionary of tensors is returned from input_fn. Moving unbatch/unpad to model.py.
    Also removing source_id key from features dictionary, and replacing with an integer hash.

--
184859205  by Zhichao Lu:

    This CL is trying to hide those differences by making the default settings work with the public code.

--
184769779  by Zhichao Lu:

    Pass groundtruth weights into ssd meta architecture all the way to target assigner.

    This will allow training ssd models with padded groundtruth tensors.

--
184767117  by Zhichao Lu:

    * Add `params` arg to make all input fns work with TPUEstimator
    * Add --master
    * Output eval results

--
184766244  by Zhichao Lu:

    Update create_coco_tf_record to include category indices

--
184752937  by Zhichao Lu:

    Create a third_party version of TPU compatible mobilenet_v2_focal_loss coco config.

--
184750174  by Zhichao Lu:

    A few small fixes for multiscale anchor generator and a test.

--
184746581  by Zhichao Lu:

    Update jupyter notebook to show mask if provided by model.

--
184728646  by Zhichao Lu:

    Adding a few more tests to make sure decoding with/without label maps performs as expected.

--
184624154  by Zhichao Lu:

    Add an object detection binary for TPU.

--
184622118  by Zhichao Lu:

    Batch, transform, and unbatch in the tflearn interface.

--
184595064  by Zhichao Lu:

    Add support for training grayscale models.

--
184532026  by Zhichao Lu:

    Change dataset_builder.build to perform optional batching using tf.data.Dataset API

--
184330239  by Zhichao Lu:

    Add augment_input_data and transform_input_data helper functions to third_party/tensorflow_models/object_detection/inputs.py

--
184328681  by Zhichao Lu:

    Use an internal rgb to gray method that can be quantized.

--
184327909  by Zhichao Lu:

    Helper function to return padding shapes to use with Dataset.padded_batch.

--
184326291  by Zhichao Lu:

    Added decode_func for specialized decoding.

--
184314676  by Zhichao Lu:

    Add unstack_batch method to inputs.py.

    This will enable us to convert batched tensors to lists of tensors. This is compatible with OD API that consumes groundtruth batch as a list of tensors.

--
184281269  by Zhichao Lu:

    Internal test target changes.

--
184192851  by Zhichao Lu:

    Adding `Estimator` interface for object detection.

--
184187885  by Zhichao Lu:

    Add config_util functions to help with input pipeline.

    1. function to return expected shapes from the resizer config
    2. function to extract image_resizer_config from model_config.

--
184139892  by Zhichao Lu:

    Adding support for depthwise SSD (ssd-lite) and depthwise box predictions.

--
184089891  by Zhichao Lu:

    Fix third_party faster rcnn resnet101 coco config.

--
184083378  by Zhichao Lu:

    In the case when there is no object/weights field in tf.Example proto, return a default weight of 1.0 for all boxes.

--

PiperOrigin-RevId: 185215255

1efe98bb