Commits · 02a9969e94feb51966f9bacddc1836d811f8ce69 · ModelZoo / ResNet50_tensorflow

"vscode:/vscode.git/clone" did not exist on "49bfaf68870f59df6ee30af15fb1fff9cd93fe6d"

01 Aug, 2018 1 commit

Refactor object detection box predictors and fix some issues with model_main. (#4965) · 02a9969e

pkulzc authored Aug 01, 2018

* Merged commit includes the following changes:
206852642 by Zhichao Lu:

Build the balanced_positive_negative_sampler in the model builder for FasterRCNN. Also adds an option to use the static implementation of the sampler.

--
206803260 by Zhichao Lu:

Fixes a misplaced argument in resnet fpn feature extractor.

--
206682736 by Zhichao Lu:

This CL modifies the SSD meta architecture to support both Slim-based and Keras-based box predictors, and begins preparation for Keras box predictor support in the other meta architectures.

Concretely, this CL adds a new `KerasBoxPredictor` base class and makes the meta architectures appropriately call whichever box predictors they are using.

We can switch the non-ssd meta architectures to fully support Keras box predictors once the Keras Convolutional Box Predictor CL is submitted.

--
206669634 by Zhichao Lu:

Adds an alternate method for balanced positive negative sampler using static shapes.

--
206643278 by Zhichao Lu:

This CL adds a Keras layer hyperparameter configuration object to the hyperparams_builder.

It automatically converts from Slim layer hyperparameter configs to Keras layer hyperparameters. Namely, it:
- Builds Keras initializers/regularizers instead of Slim ones
- sets weights_regularizer/initializer to kernel_regularizer/initializer
- converts batchnorm decay to momentum
- converts Slim l2 regularizer weights to the equivalent Keras l2 weights

This will be used in the conversion of object detection feature extractors & box predictors to newer Tensorflow APIs.

--
206611681 by Zhichao Lu:

Internal changes.

--
206591619 by Zhichao Lu:

Clip the to shape when the input tensors are larger than the expected padded static shape

--
206517644 by Zhichao Lu:

Make MultiscaleGridAnchorGenerator more consistent with MultipleGridAnchorGenerator.

--
206415624 by Zhichao Lu:

Make the hardcoded feature pyramid network (FPN) levels configurable for both SSD
Resnet and SSD Mobilenet.

--
206398204 by Zhichao Lu:

This CL modifies the SSD meta architecture to support both Slim-based and Keras-based feature extractors.

This allows us to begin the conversion of object detection to newer Tensorflow APIs.

--
206213448 by Zhichao Lu:

Adding a method to compute the expected classification loss by background/foreground weighting.

--
206204232 by Zhichao Lu:

Adding the keypoint head to the Mask RCNN pipeline.

--
206200352 by Zhichao Lu:

- Create Faster R-CNN target assigner in the model builder. This allows configuring matchers in Target assigner to use TPU compatible ops (tf.gather in this case) without any change in meta architecture.
- As a +ve side effect of the refactoring, we can now re-use a single target assigner for all of second stage heads in Faster R-CNN.

--
206178206 by Zhichao Lu:

Force ssd feature extractor builder to use keyword arguments so values won't be passed to wrong arguments.

--
206168297 by Zhichao Lu:

Updating exporter to use freeze_graph.freeze_graph_with_def_protos rather than a homegrown version.

--
206080748 by Zhichao Lu:

Merge external contributions.

--
206074460 by Zhichao Lu:

Update to preprocessor to apply temperature and softmax to the multiclass scores on read.

--
205960802 by Zhichao Lu:

Fixing a bug in hierarchical label expansion script.

--
205944686 by Zhichao Lu:

Update exporter to support exporting quantized model.

--
205912529 by Zhichao Lu:

Add a two stage matcher to allow for thresholding by one criteria and then argmaxing on the other.

--
205909017 by Zhichao Lu:

Add test for grayscale image_resizer

--
205892801 by Zhichao Lu:

Add flag to decide whether to apply batch norm to conv layers of weight shared box predictor.

--
205824449 by Zhichao Lu:

make sure that by default mask rcnn box predictor predicts 2 stages.

--
205730139 by Zhichao Lu:

Updating warning message to be more explicit about variable size mismatch.

--
205696992 by Zhichao Lu:

Remove utils/ops.py's dependency on core/box_list_ops.py. This will allow re-using TPU compatible ops from utils/ops.py in core/box_list_ops.py.

--
205696867 by Zhichao Lu:

Refactoring mask rcnn predictor so have each head in a separate file.
This CL lets us to add new heads more easily in the future to mask rcnn.

--
205492073 by Zhichao Lu:

Refactor R-FCN box predictor to be TPU compliant.

- Change utils/ops.py:position_sensitive_crop_regions to operate on single image and set of boxes without `box_ind`
- Add a batch version that operations on batches of images and batches of boxes.
- Refactor R-FCN box predictor to use the batched version of position sensitive crop regions.

--
205453567 by Zhichao Lu:

Fix bug that cannot export inference graph when write_inference_graph flag is True.

--
205316039 by Zhichao Lu:

Changing input tensor name.

--
205256307 by Zhichao Lu:

Fix model zoo links for quantized model.

--
205164432 by Zhichao Lu:

Fixes eval error when label map contains non-ascii characters.

--
205129842 by Zhichao Lu:

Adds a option to clip the anchors to the window size without filtering the overlapped boxes in Faster-RCNN

--
205094863 by Zhichao Lu:

Update to label map util to allow the option of adding a background class and fill in gaps in the label map. Useful for using multiclass scores which require a complete label map with explicit background label.

--
204989032 by Zhichao Lu:

Add tf.prof support to exporter.

--
204825267 by Zhichao Lu:

Modify mask rcnn box predictor tests for TPU compatibility.

--
204778749 by Zhichao Lu:

Remove score filtering from postprocessing.py and rely on filtering logic in tf.image.non_max_suppression

--
204775818 by Zhichao Lu:

Python3 fixes for object_detection.

--
204745920 by Zhichao Lu:

Object Detection Dataset visualization tool (documentation).

--
204686993 by Zhichao Lu:

Internal changes.

--
204559667 by Zhichao Lu:

Refactor box_predictor.py into multiple files.
The abstract base class remains in the object_detection/core, The other classes have moved to a separate file each in object_detection/predictors

--
204552847 by Zhichao Lu:

Update blog post link.

--
204508028 by Zhichao Lu:

Bump down the batch size to 1024 to be a bit more tolerant to OOM and double the number of iterations. This job still converges to 20.5 mAP in 3 hours.

PiperOrigin-RevId: 206852642

* Add original post-processing back.

02a9969e

02 Jul, 2018 1 commit

Open Images Challenge 2018 tools, minor fixes and refactors. (#4661) · 32e7d660

pkulzc authored Jul 02, 2018

* Merged commit includes the following changes:
202804536 by Zhichao Lu:

Return tf.data.Dataset from input_fn that goes into the estimator and use PER_HOST_V2 option for tpu input pipeline config.

This change shaves off 100ms per step resulting in 25 minutes of total reduced training time for ssd mobilenet v1 (15k steps to convergence).

--
202769340 by Zhichao Lu:

Adding as_matrix() transformation for image-level labels.

--
202768721 by Zhichao Lu:

Challenge evaluation protocol modification: adding labelmaps creation.

--
202750966 by Zhichao Lu:

Add the explicit names to two output nodes.

--
202732783 by Zhichao Lu:

Enforcing that batch size is 1 for evaluation, and no original images are retained during evaluation when use_tpu=False (to avoid dynamic shapes).

--
202425430 by Zhichao Lu:

Refactor input pipeline to improve performance.

--
202406389 by Zhichao Lu:

Only check the validity of `warmup_learning_rate` if it will be used.

--
202330450 by Zhichao Lu:

Adding the description of the flag input_image_label_annotations_csv to add
image-level labels to tf.Example.

--
202029012 by Zhichao Lu:

Enabling displaying relationship name in the final metrics output.

--
202024010 by Zhichao Lu:

Update to the public README.

--
201999677 by Zhichao Lu:

Fixing the way negative labels are handled in VRD evaluation.

--
201962313 by Zhichao Lu:

Fix a bug in resize_to_range.

--
201808488 by Zhichao Lu:

Update ssd_inception_v2_pets.config to use right filename of pets dataset tf records.

--
201779225 by Zhichao Lu:

Update object detection API installation doc

--
201766518 by Zhichao Lu:

Add shell script to create pycocotools package for CMLE.

--
201722377 by Zhichao Lu:

Removes verified_labels field and uses groundtruth_image_classes field instead.

--
201616819 by Zhichao Lu:

Disable eval_on_tpu since eval_metrics is not setup to execute on TPU.
Do not use run_config.task_type to switch tpu mode for EVAL,
since that won't work in unit test.
Expand unit test to verify that the same instantiation of the Estimator can independently disable eval on TPU whereas training is enabled on TPU.

--
201524716 by Zhichao Lu:

Disable export model to TPU, inference is not compatible with TPU.
Add GOOGLE_INTERNAL support in object detection copy.bara.sky

--
201453347 by Zhichao Lu:

Fixing bug when evaluating the quantized model.

--
200795826 by Zhichao Lu:

Fixing parsing bug: image-level labels are parsed as tuples instead of numpy
array.

--
200746134 by Zhichao Lu:

Adding image_class_text and image_class_label fields into tf_example_decoder.py

--
200743003 by Zhichao Lu:

Changes to model_main.py and model_tpu_main to enable training and continuous eval.

--
200736324 by Zhichao Lu:

Replace deprecated squeeze_dims argument.

--
200730072 by Zhichao Lu:

Make detections only during predict and eval mode while creating model function

--
200729699 by Zhichao Lu:

Minor correction to internal documentation (definition of Huber loss)

--
200727142 by Zhichao Lu:

Add command line parsing as a set of flags using argparse and add header to the
resulting file.

--
200726169 by Zhichao Lu:

A tutorial on running evaluation for the Open Images Challenge 2018.

--
200665093 by Zhichao Lu:

Cleanup on variables_helper_test.py.

--
200652145 by Zhichao Lu:

Add an option to write (non-frozen) graph when exporting inference graph.

--
200573810 by Zhichao Lu:

Update ssd_mobilenet_v1_coco and ssd_inception_v2_coco download links to point to a newer version.

--
200498014 by Zhichao Lu:

Add test for groundtruth mask resizing.

--
200453245 by Zhichao Lu:

Cleaning up exporting_models.md along with exporting scripts

--
200311747 by Zhichao Lu:

Resize groundtruth mask to match the size of the original image.

--
200287269 by Zhichao Lu:

Having a option to use custom MatMul based crop_and_resize op as an alternate to the TF op in Faster-RCNN

--
200127859 by Zhichao Lu:

Updating the instructions to run locally with new binary. Also updating pets configs since file path naming has changed.

--
200127044 by Zhichao Lu:

A simpler evaluation util to compute Open Images Challenge
2018 metric (object detection track).

--
200124019 by Zhichao Lu:

Freshening up configuring_jobs.md

--
200086825 by Zhichao Lu:

Make merge_multiple_label_boxes work for ssd model.

--
199843258 by Zhichao Lu:

Allows inconsistent feature channels to be compatible with WeightSharedConvolutionalBoxPredictor.

--
199676082 by Zhichao Lu:

Enable an override for `InputReader.shuffle` for object detection pipelines.

--
199599212 by Zhichao Lu:

Markdown fixes.

--
199535432 by Zhichao Lu:

Pass num_additional_channels to tf.example decoder in predict_input_fn.

--
199399439 by Zhichao Lu:

Adding `num_additional_channels` field to specify how many additional channels to use in the model.

PiperOrigin-RevId: 202804536

* Add original model builder and docs back.

32e7d660

03 Apr, 2018 1 commit
- Provide option to perform in-place batch norm updates for ssd feature extractors. · d2c5bfac
  Zhichao Lu authored Mar 27, 2018
```
PiperOrigin-RevId: 190688309
```
  d2c5bfac
01 Feb, 2018 1 commit

Merged commit includes the following changes: · 7a9934df

Zhichao Lu authored Jan 31, 2018

184048729  by Zhichao Lu:

    Modify target_assigner so that it creates regression targets taking keypoints into account.

--
184027183  by Zhichao Lu:

    Resnet V1 FPN based feature extractors for SSD meta architecture in Object Detection V2 API.

--
184004730  by Zhichao Lu:

    Expose a lever to override the configured mask_type.

--
183933113  by Zhichao Lu:

    Weight shared convolutional box predictor as described in https://arxiv.org/abs/1708.02002

--
183929669  by Zhichao Lu:

    Expanding box list operations for future data augmentations.

--
183916792  by Zhichao Lu:

    Fix unrecognized assertion function in tests.

--
183906851  by Zhichao Lu:

    - Change ssd meta architecture to use regression weights to compute loss normalizer.

--
183871003  by Zhichao Lu:

    Fix config_util_test wrong dependency.

--
183782120  by Zhichao Lu:

    Add __init__ file to third_party directories.

--
183779109  by Zhichao Lu:

    Setup regular version s...

7a9934df

27 Oct, 2017 1 commit
- update protos. · 9adf0242
  Vivek Rathod authored Oct 27, 2017
  
  9adf0242
21 Sep, 2017 1 commit
- Move the research models into a research subfolder (#2430) · f87a58cd
  Neal Wu authored Sep 21, 2017
  
  f87a58cd
15 Jun, 2017 1 commit

Add Tensorflow Object Detection API. (#1561) · a4944a57

derekjchow authored Jun 14, 2017

For details see our paper:
"Speed/accuracy trade-offs for modern convolutional object detectors."
Huang J, Rathod V, Sun C, Zhu M, Korattikara A, Fathi A, Fischer I,
Wojna Z, Song Y, Guadarrama S, Murphy K, CVPR 2017
https://arxiv.org/abs/1611.10012

a4944a57