1. 28 Apr, 2022 1 commit
  2. 27 Apr, 2022 1 commit
  3. 26 Apr, 2022 1 commit
  4. 13 Oct, 2021 1 commit
  5. 24 Jun, 2021 1 commit
  6. 02 Apr, 2021 2 commits
  7. 14 Sep, 2020 1 commit
  8. 03 Sep, 2020 1 commit
  9. 17 Jul, 2020 3 commits
  10. 13 Jul, 2020 2 commits
  11. 10 Jul, 2020 2 commits
  12. 26 Jun, 2020 1 commit
    • vivek rathod's avatar
      Merged commit includes the following changes: (#8741) · 6ef140dc
      vivek rathod authored
      318545448  by jonathanhuang:
      
          Modifies visualization code in TF2 evaluation loop so that we don't write out image summaries to disk for every single image.  This change will reduce summary file sizes by ~2 orders of magnitude on average and speed up evaluation cycles (20 minutes per COCO eval cycle vs 2 hours for RetinaNet).
      
      --
      318514741  by sbeery:
      
          Adding link to the blog post
      
      --
      
      PiperOrigin-RevId: 318545448
      6ef140dc
  13. 18 Jun, 2020 1 commit
  14. 26 May, 2020 1 commit
    • pkulzc's avatar
      Release MobileDet code and model, and require tf_slim installation for OD API. (#8562) · 451906e4
      pkulzc authored
      
      
      * Merged commit includes the following changes:
      311933687  by Sergio Guadarrama:
      
          Removes spurios use of tf.compat.v2, which results in spurious tf.compat.v1.compat.v2. Adds basic test to nasnet_utils.
          Replaces all remaining import tensorflow as tf with import tensorflow.compat.v1 as tf
      
      --
      311766063  by Sergio Guadarrama:
      
          Removes explicit tf.compat.v1 in all call sites (we already import tf.compat.v1, so this code was  doing tf.compat.v1.compat.v1). The existing code worked in latest version of tensorflow, 2.2, (and 1.15) but not in 1.14 or in 2.0.0a, this CL fixes it.
      
      --
      311624958  by Sergio Guadarrama:
      
          Updates README that doesn't render properly in github documentation
      
      --
      310980959  by Sergio Guadarrama:
      
          Moves research_models/slim off tf.contrib.slim/layers/framework to tf_slim
      
      --
      310263156  by Sergio Guadarrama:
      
          Adds model breakdown for MobilenetV3
      
      --
      308640516  by Sergio Guadarrama:
      
          Internal change
      
      308244396  by Sergio Guadarrama:
      
          GroupNormalization support for MobilenetV3.
      
      --
      307475800  by Sergio Guadarrama:
      
          Internal change
      
      --
      302077708  by Sergio Guadarrama:
      
          Remove `disable_tf2` behavior from slim py_library targets
      
      --
      301208453  by Sergio Guadarrama:
      
          Automated refactoring to make code Python 3 compatible.
      
      --
      300816672  by Sergio Guadarrama:
      
          Internal change
      
      299433840  by Sergio Guadarrama:
      
          Internal change
      
      299221609  by Sergio Guadarrama:
      
          Explicitly disable Tensorflow v2 behaviors for all TF1.x binaries and tests
      
      --
      299179617  by Sergio Guadarrama:
      
          Internal change
      
      299040784  by Sergio Guadarrama:
      
          Internal change
      
      299036699  by Sergio Guadarrama:
      
          Internal change
      
      298736510  by Sergio Guadarrama:
      
          Internal change
      
      298732599  by Sergio Guadarrama:
      
          Internal change
      
      298729507  by Sergio Guadarrama:
      
          Internal change
      
      298253328  by Sergio Guadarrama:
      
          Internal change
      
      297788346  by Sergio Guadarrama:
      
          Internal change
      
      297785278  by Sergio Guadarrama:
      
          Internal change
      
      297783127  by Sergio Guadarrama:
      
          Internal change
      
      297725870  by Sergio Guadarrama:
      
          Internal change
      
      297721811  by Sergio Guadarrama:
      
          Internal change
      
      297711347  by Sergio Guadarrama:
      
          Internal change
      
      297708059  by Sergio Guadarrama:
      
          Internal change
      
      297701831  by Sergio Guadarrama:
      
          Internal change
      
      297700038  by Sergio Guadarrama:
      
          Internal change
      
      297670468  by Sergio Guadarrama:
      
          Internal change.
      
      --
      297350326  by Sergio Guadarrama:
      
          Explicitly replace "import tensorflow" with "tensorflow.compat.v1" for TF2.x migration
      
      --
      297201668  by Sergio Guadarrama:
      
          Explicitly replace "import tensorflow" with "tensorflow.compat.v1" for TF2.x migration
      
      --
      294483372  by Sergio Guadarrama:
      
          Internal change
      
      PiperOrigin-RevId: 311933687
      
      * Merged commit includes the following changes:
      312578615  by Menglong Zhu:
      
          Modify the LSTM feature extractors to be python 3 compatible.
      
      --
      311264357  by Menglong Zhu:
      
          Removes contrib.slim
      
      --
      308957207  by Menglong Zhu:
      
          Automated refactoring to make code Python 3 compatible.
      
      --
      306976470  by yongzhe:
      
          Internal change
      
      306777559  by Menglong Zhu:
      
          Internal change
      
      --
      299232507  by lzyuan:
      
          Internal update.
      
      --
      299221735  by lzyuan:
      
          Add small epsilon on max_range for quantize_op to prevent range collapse.
      
      --
      
      PiperOrigin-RevId: 312578615
      
      * Merged commit includes the following changes:
      310447280  by lzc:
      
          Internal changes.
      
      --
      
      PiperOrigin-RevId: 310447280
      Co-authored-by: default avatarSergio Guadarrama <sguada@google.com>
      Co-authored-by: default avatarMenglong Zhu <menglong@google.com>
      451906e4
  15. 12 May, 2020 1 commit
    • pkulzc's avatar
      Open source MnasFPN and minor fixes to OD API (#8484) · 8518d053
      pkulzc authored
      310447280  by lzc:
      
          Internal change
      
      310420845  by Zhichao Lu:
      
          Open source the internal Context RCNN code.
      
      --
      310362339  by Zhichao Lu:
      
          Internal change
      
      310259448  by lzc:
      
          Update required TF version for OD API.
      
      --
      310252159  by Zhichao Lu:
      
          Port patch_ops_test to TF1/TF2 as TPUs.
      
      --
      310247180  by Zhichao Lu:
      
          Ignore keypoint heatmap loss in the regions/bounding boxes with target keypoint
          class but no valid keypoint annotations.
      
      --
      310178294  by Zhichao Lu:
      
          Opensource MnasFPN
          https://arxiv.org/abs/1912.01106
      
      --
      310094222  by lzc:
      
          Internal changes.
      
      --
      310085250  by lzc:
      
          Internal Change.
      
      --
      310016447  by huizhongc:
      
          Remove unrecognized classes from labeled_classes.
      
      --
      310009470  by rathodv:
      
          Mark batcher.py as TF1 only.
      
      --
      310001984  by rathodv:
      
          Update core/preprocessor.py to be compatible with TF1/TF2..
      
      --
      309455035  by Zhi...
      8518d053
  16. 24 Apr, 2020 1 commit
  17. 13 Apr, 2020 1 commit
    • Jaeyoun Kim's avatar
      Updated README files of research models (#8390) · d466d4e6
      Jaeyoun Kim authored
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Create README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      No Maintenance Intended
      
      * Update README.md
      
      TensorFlow Requirement: 1.x
      
      * Update README.md
      
      TensorFlow Requirement: 1.x
      
      * Update README.md
      
      TensorFlow Requirement: 1.x
      
      * Update README.md
      
      TensorFlow Requirement: 1.x
      
      * Update README.md
      
      TensorFlow Requirement: 1.x
      
      * Update README.md
      
      TensorFlow Requirement: 1.x
      
      * Update README.md
      
      TensorFlow Requirement: 1.x
      
      * Update README.md
      
      TensorFlow Requirement: 1.x
      
      * Update README.md
      
      TensorFlow Requirement: 1.x
      d466d4e6
  18. 13 Nov, 2019 1 commit
    • Mark Sandler's avatar
      Merged commit includes the following changes: (#7800) · b968a6ce
      Mark Sandler authored
      280142968  by Zhichao Lu:
      
          Opensource MobilenetEdgeTPU + ssdlite into third-party object detection APIs on EdgeTPU.
      
      --
      280134001  by Zhichao Lu:
      
          Adds MobilenetEdgeTpu + ssdlite into internal object detection APIs on EdgeTPU.
      
      --
      278941778  by Zhichao Lu:
      
          Add support for fixed input shapes for 'encoded_image_string_tensor' and 'tf_example' inputs.
      
      --
      278933274  by Zhichao Lu:
      
            Adding fool proof check to avoid using 1x1 depthwise conv op.
      
      --
      278762192  by Zhichao Lu:
      
          Ensure correct number of iterations after training resumes.
      
      --
      278746440  by Zhichao Lu:
      
          Internal change.
      
      --
      278006953  by Zhichao Lu:
      
          Internal changes to tf.contrib symbols
      
      --
      278006330  by Zhichao Lu:
      
          Internal changes to tf.contrib symbols
      
      --
      277593959  by Zhichao Lu:
      
            Make the ssd_feature_extractor_test.py PY3 compatible. The "six.zip" will use "itertools.izip" in Python 2 and "zip" in Python 3....
      b968a6ce
  19. 17 Oct, 2019 1 commit
    • pkulzc's avatar
      Release MobileNet V3 models and SSDLite models with MobileNet V3 backbone. (#7678) · 0ba83cf0
      pkulzc authored
      * Merged commit includes the following changes:
      275131829  by Sergio Guadarrama:
      
          updates mobilenet/README.md to be github compatible adds V2+ reference to mobilenet_v1.md file and fixes invalid markdown
      
      --
      274908068  by Sergio Guadarrama:
      
          Opensource MobilenetV3 detection models.
      
      --
      274697808  by Sergio Guadarrama:
      
          Fixed cases where tf.TensorShape was constructed with float dimensions
      
          This is a prerequisite for making TensorShape and Dimension more strict
          about the types of their arguments.
      
      --
      273577462  by Sergio Guadarrama:
      
          Fixing `conv_defs['defaults']` override issue.
      
      --
      272801298  by Sergio Guadarrama:
      
          Adds links to trained models for Moblienet V3, adds a version of minimalistic mobilenet-v3 to the definitions.
      
      --
      268928503  by Sergio Guadarrama:
      
          Mobilenet v2 with group normalization.
      
      --
      263492735  by Sergio Guadarrama:
      
          Internal change
      
      260037126  by Sergio Guadarrama:
      
          Adds an option of using a custom depthwise operation in `expanded_conv`.
      
      --
      259997001  by Sergio Guadarrama:
      
          Explicitly mark Python binaries/tests with python_version = "PY2".
      
      --
      252697685  by Sergio Guadarrama:
      
          Internal change
      
      251918746  by Sergio Guadarrama:
      
          Internal change
      
      251909704  by Sergio Guadarrama:
      
          Mobilenet V3 backbone implementation.
      
      --
      247510236  by Sergio Guadarrama:
      
          Internal change
      
      246196802  by Sergio Guadarrama:
      
          Internal change
      
      246014539  by Sergio Guadarrama:
      
          Internal change
      
      245891435  by Sergio Guadarrama:
      
          Internal change
      
      245834925  by Sergio Guadarrama:
      
          n/a
      
      --
      
      PiperOrigin-RevId: 275131829
      
      * Merged commit includes the following changes:
      274959989  by Zhichao Lu:
      
          Update detection model zoo with MobilenetV3 SSD candidates.
      
      --
      274908068  by Zhichao Lu:
      
          Opensource MobilenetV3 detection models.
      
      --
      274695889  by richardmunoz:
      
          RandomPatchGaussian preprocessing step
      
          This step can be used during model training to randomly apply gaussian noise to a random image patch. Example addition to an Object Detection API pipeline config:
      
          train_config {
            ...
            data_augmentation_options {
              random_patch_gaussian {
                random_coef: 0.5
                min_patch_size: 1
                max_patch_size: 250
                min_gaussian_stddev: 0.0
                max_gaussian_stddev: 1.0
              }
            }
            ...
          }
      
      --
      274257872  by lzc:
      
          Internal change.
      
      --
      274114689  by Zhichao Lu:
      
          Pass native_resize flag to other FPN variants.
      
      --
      274112308  by lzc:
      
          Internal change.
      
      --
      274090763  by richardmunoz:
      
          Util function for getting a patch mask on an image for use with the Object Detection API
      
      --
      274069806  by Zhichao Lu:
      
          Adding functions which will help compute predictions and losses for CenterNet.
      
      --
      273860828  by lzc:
      
          Internal change.
      
      --
      273380069  by richardmunoz:
      
          RandomImageDownscaleToTargetPixels preprocessing step
      
          This step can be used during model training to randomly downscale an image to a random target number of pixels. If the image does not contain more than the target number of pixels, then downscaling is skipped. Example addition to an Object Detection API pipeline config:
      
          train_config {
            ...
            data_augmentation_options {
              random_downscale_to_target_pixels {
                random_coef: 0.5
                min_target_pixels: 300000
                max_target_pixels: 500000
              }
            }
            ...
          }
      
      --
      272987602  by Zhichao Lu:
      
          Avoid -inf when empty box list is passed.
      
      --
      272525836  by Zhichao Lu:
      
          Cleanup repeated resizing code in meta archs.
      
      --
      272458667  by richardmunoz:
      
          RandomJpegQuality preprocessing step
      
          This step can be used during model training to randomly encode the image into a jpeg with a random quality level. Example addition to an Object Detection API pipeline config:
      
          train_config {
            ...
            data_augmentation_options {
              random_jpeg_quality {
                random_coef: 0.5
                min_jpeg_quality: 80
                max_jpeg_quality: 100
              }
            }
            ...
          }
      
      --
      271412717  by Zhichao Lu:
      
          Enables TPU training with the V2 eager + tf.function Object Detection training loops.
      
      --
      270744153  by Zhichao Lu:
      
          Adding the offset and size target assigners for CenterNet.
      
      --
      269916081  by Zhichao Lu:
      
          Include basic installation in Object Detection API tutorial.
          Also:
           - Use TF2.0
           - Use saved_model
      
      --
      269376056  by Zhichao Lu:
      
          Fix to variable loading in RetinaNet w/ custom loops. (makes the code rely on the exact name scopes that are generated a little bit less)
      
      --
      269256251  by lzc:
      
          Add use_partitioned_nms field to config and update post_prossing_builder to honor that flag when building nms function.
      
      --
      268865295  by Zhichao Lu:
      
          Adding functionality for importing and merging back internal state of the metric.
      
      --
      268640984  by Zhichao Lu:
      
          Fix computation of gaussian sigma value to create CenterNet heatmap target.
      
      --
      267475576  by Zhichao Lu:
      
          Fix for exporter trying to export non-existent exponential moving averages.
      
      --
      267286768  by Zhichao Lu:
      
          Update mixed-precision policy.
      
      --
      266166879  by Zhichao Lu:
      
          Internal change
      
      265860884  by Zhichao Lu:
      
          Apply floor function to center coordinates when creating heatmap for CenterNet target.
      
      --
      265702749  by Zhichao Lu:
      
          Internal change
      
      --
      264241949  by ronnyvotel:
      
          Updating Faster R-CNN 'final_anchors' to be in normalized coordinates.
      
      --
      264175192  by lzc:
      
          Update model_fn to only read hparams if it is not None.
      
      --
      264159328  by Zhichao Lu:
      
          Modify nearest neighbor upsampling to eliminate a multiply operation. For quantized models, the multiply operation gets unnecessarily quantized and reduces accuracy (simple stacking would work in place of the broadcast op which doesn't require quantization). Also removes an unnecessary reshape op.
      
      --
      263668306  by Zhichao Lu:
      
          Add the option to use dynamic map_fn for batch NMS
      
      --
      263031163  by Zhichao Lu:
      
          Mark outside compilation for NMS as optional.
      
      --
      263024916  by Zhichao Lu:
      
          Add an ExperimentalModel meta arch for experimenting with new model types.
      
      --
      262655894  by Zhichao Lu:
      
          Add the center heatmap target assigner for CenterNet
      
      --
      262431036  by Zhichao Lu:
      
          Adding add_eval_dict to allow for evaluation on model_v2
      
      --
      262035351  by ronnyvotel:
      
          Removing any non-Tensor predictions from the third stage of Mask R-CNN.
      
      --
      261953416  by Zhichao Lu:
      
          Internal change.
      
      --
      261834966  by Zhichao Lu:
      
          Fix the NMS OOM issue on TPU by forcing NMS to run outside of TPU.
      
      --
      261775941  by Zhichao Lu:
      
          Make Keras InputLayer compatible with both TF 1.x and TF 2.0.
      
      --
      261775633  by Zhichao Lu:
      
          Visualize additional channels with ground-truth bounding boxes.
      
      --
      261768117  by lzc:
      
          Internal change.
      
      --
      261766773  by ronnyvotel:
      
          Exposing `return_raw_detections_during_predict` in Faster R-CNN Proto.
      
      --
      260975089  by ronnyvotel:
      
          Moving calculation of batched prediction tensor names after all tensors in prediction dictionary are created.
      
      --
      259816913  by ronnyvotel:
      
          Adding raw detection boxes and feature map indices to SSD
      
      --
      259791955  by Zhichao Lu:
      
          Added a flag to control the use partitioned_non_max_suppression.
      
      --
      259580475  by Zhichao Lu:
      
          Tweak quantization-aware training re-writer to support NasFpn model architecture.
      
      --
      259579943  by rathodv:
      
          Add a meta target assigner proto and builders in OD API.
      
      --
      259577741  by Zhichao Lu:
      
          Internal change.
      
      --
      259366315  by lzc:
      
          Internal change.
      
      --
      259344310  by ronnyvotel:
      
          Updating faster rcnn so that raw_detection_boxes from predict() are in normalized coordinates.
      
      --
      259338670  by Zhichao Lu:
      
          Add support for use_native_resize_op to more feature extractors. Use dynamic shapes when static shapes are not available.
      
      --
      259083543  by ronnyvotel:
      
          Updating/fixing documentation.
      
      --
      259078937  by rathodv:
      
          Add prediction fields for tensors returned from detection_model.predict.
      
      --
      259044601  by Zhichao Lu:
      
          Add protocol buffer and builders for temperature scaling calibration.
      
      --
      259036770  by lzc:
      
          Internal changes.
      
      --
      259006223  by ronnyvotel:
      
          Adding detection anchor indices to Faster R-CNN Config. This is useful when one wishes to associate final detections and the anchors (or pre-nms boxes) from which they originated.
      
      --
      258872501  by Zhichao Lu:
      
          Run the training pipeline of ssd + resnet_v1_50 + fpn with a checkpoint.
      
      --
      258840686  by ronnyvotel:
      
          Adding standard outputs to DetectionModel.predict(). This CL only updates Faster R-CNN. Other meta architectures will be updated in future CLs.
      
      --
      258672969  by lzc:
      
          Internal change.
      
      --
      258649494  by lzc:
      
          Internal changes.
      
      --
      258630321  by ronnyvotel:
      
          Fixing documentation in shape_utils.flatten_dimensions().
      
      --
      258468145  by Zhichao Lu:
      
          Add additional output tensors parameter to Postprocess op.
      
      --
      258099219  by Zhichao Lu:
      
          Internal changes
      
      --
      
      PiperOrigin-RevId: 274959989
      0ba83cf0
  20. 15 Jul, 2019 1 commit
    • pkulzc's avatar
      Object detection changes: (#7208) · fe748d4a
      pkulzc authored
      257914648  by lzc:
      
          Internal changes
      
      --
      257525973  by Zhichao Lu:
      
          Fixes bug that silently prevents checkpoints from loading when training w/ eager + functions. Also sets up scripts to run training.
      
      --
      257296614  by Zhichao Lu:
      
          Adding detection_features to model outputs
      
      --
      257234565  by Zhichao Lu:
      
          Fix wrong order of `classes_with_max_scores` in class-agnostic NMS caused by
          sorting in partitioned-NMS.
      
      --
      257232002  by ronnyvotel:
      
          Supporting `filter_nonoverlapping` option in np_box_list_ops.clip_to_window().
      
      --
      257198282  by Zhichao Lu:
      
          Adding the focal loss and l1 loss from the Objects as Points paper.
      
      --
      257089535  by Zhichao Lu:
      
          Create Keras based ssd + resnetv1 + fpn.
      
      --
      257087407  by Zhichao Lu:
      
          Make object_detection/data_decoders Python3-compatible.
      
      --
      257004582  by Zhichao Lu:
      
          Updates _decode_raw_data_into_masks_and_boxes to the latest binary masks-to-string encoding format.
      
      --
      257002124  by Zhichao Lu:
      
          Make object_detection/utils Python3-compatible, except json_utils.
      
          The patching trick used in json_utils is not going to work in Python 3.
      
      --
      256795056  by lzc:
      
          Add a detection_anchor_indices field to detection outputs.
      
      --
      256477542  by Zhichao Lu:
      
          Make object_detection/core Python3-compatible.
      
      --
      256387593  by Zhichao Lu:
      
          Edit class_id_function_approximations builder to skip class ids not present in label map.
      
      --
      256259039  by Zhichao Lu:
      
          Move NMS to TPU for FasterRCNN.
      
      --
      256071360  by rathodv:
      
          When multiclass_scores is empty, add one-hot encoding of groundtruth_classes as multiclass scores so that data_augmentation ops that expect the presence of multiclass_scores don't have to individually handle this case.
      
          Also copy input tensor_dict to out_tensor_dict first to avoid inplace modification.
      
      --
      256023645  by Zhichao Lu:
      
          Adds the first WIP iterations of TensorFlow v2 eager + functions style custom training & evaluation loops.
      
      --
      255980623  by Zhichao Lu:
      
          Adds a new data augmentation operation "remap_labels" which remaps a set of labels to a new label.
      
      --
      255753259  by Zhichao Lu:
      
          Announcement of the released evaluation tutorial for Open Images Challenge
          2019.
      
      --
      255698776  by lzc:
      
          Fix rewrite_nn_resize_op function which was broken by tf forward compatibility movement.
      
      --
      255623150  by Zhichao Lu:
      
          Add Keras-based ResnetV1 models.
      
      --
      255504992  by Zhichao Lu:
      
          Fixing the typo in specifying label expansion for ground truth segmentation
          file.
      
      --
      255470768  by Zhichao Lu:
      
          1. Fixing Python bug with parsed arguments.
          2. Adding capability to parse relevant columns from CSV header.
          3. Fixing bug with duplicated labels expansion.
      
      --
      255462432  by Zhichao Lu:
      
          Adds a new data augmentation operation "drop_label_probabilistically" which drops a given label with the given probability. This supports experiments on training in the presence of label noise.
      
      --
      255441632  by rathodv:
      
          Fallback on groundtruth classes when multiclass_scores tensor is empty.
      
      --
      255434899  by Zhichao Lu:
      
          Ensuring evaluation binary can run even with big files by synchronizing
          processing of ground truth and predictions: in this way, ground truth is not stored but immediatly
          used for evaluation. In case gt of object masks, this allows to run
          evaluations on relatively large sets.
      
      --
      255337855  by lzc:
      
          Internal change.
      
      --
      255308908  by Zhichao Lu:
      
          Add comment to clarify usage of calibration parameters proto.
      
      --
      255266371  by Zhichao Lu:
      
          Ensuring correct processing of the case, when no groundtruth masks are provided
          for an image.
      
      --
      255236648  by Zhichao Lu:
      
          Refactor model_builder in faster_rcnn.py to a util_map, so that it's possible to be overwritten.
      
      --
      255093285  by Zhichao Lu:
      
          Updating capability to subsample data during evaluation
      
      --
      255081222  by rathodv:
      
          Convert groundtruth masks to be of type float32 before its used in the loss function.
      
          When using mixed precision training, masks are represented using bfloat16 tensors in the input pipeline for performance reasons. We need to convert them to float32 before using it in the loss function.
      
      --
      254788436  by Zhichao Lu:
      
          Add forward_compatible to non_max_suppression_with_scores to make it is
          compatible with older tensorflow version.
      
      --
      254442362  by Zhichao Lu:
      
          Add num_layer field to ssd feature extractor proto.
      
      --
      253911582  by jonathanhuang:
      
          Plumbs Soft-NMS options (using the new tf.image.non_max_suppression_with_scores op) into the TF Object Detection API.  It adds a `soft_nms_sigma` field to the postprocessing proto file and plumbs this through to both the multiclass and class_agnostic versions of NMS. Note that there is no effect on behavior of NMS when soft_nms_sigma=0 (which it is set to by default).
      
          See also "Soft-NMS -- Improving Object Detection With One Line of Code" by Bodla et al (https://arxiv.org/abs/1704.04503)
      
      --
      253703949  by Zhichao Lu:
      
          Internal test fixes.
      
      --
      253151266  by Zhichao Lu:
      
          Fix the op type check for FusedBatchNorm, given that we introduced
          FusedBatchNormV3 in a previous change.
      
      --
      252718956  by Zhichao Lu:
      
          Customize activation function to enable relu6 instead of relu for saliency
          prediction model seastarization
      
      --
      252158593  by Zhichao Lu:
      
          Make object_detection/core Python3-compatible.
      
      --
      252150717  by Zhichao Lu:
      
          Make object_detection/core Python3-compatible.
      
      --
      251967048  by Zhichao Lu:
      
          Make GraphRewriter proto extensible.
      
      --
      251950039  by Zhichao Lu:
      
          Remove experimental_export_device_assignment from TPUEstimator.export_savedmodel(), so as to remove rewrite_for_inference().
      
          As a replacement, export_savedmodel() V2 API supports device_assignment where user call tpu.rewrite in model_fn and pass in device_assigment there.
      
      --
      251890697  by rathodv:
      
          Updated docstring to include new output nodes.
      
      --
      251662894  by Zhichao Lu:
      
          Add autoaugment augmentation option to objection detection api codebase. This
          is an available option in preprocessor.py.
      
          The intended usage of autoaugment is to be done along with random flipping and
          cropping for best results.
      
      --
      251532908  by Zhichao Lu:
      
          Add TrainingDataType enum to track whether class-specific or agnostic data was used to fit the calibration function.
      
          This is useful, since classes with few observations may require a calibration function fit on all classes.
      
      --
      251511339  by Zhichao Lu:
      
          Add multiclass isotonic regression to the calibration builder.
      
      --
      251317769  by pengchong:
      
          Internal Change.
      
      --
      250729989  by Zhichao Lu:
      
          Fixing bug in gt statistics count in case of mask and box annotations.
      
      --
      250729627  by Zhichao Lu:
      
          Label expansion for segmentation.
      
      --
      250724905  by Zhichao Lu:
      
          Fix use_depthwise in fpn and test it with fpnlite on ssd + mobilenet v2.
      
      --
      250670379  by Zhichao Lu:
      
          Internal change
      
      250630364  by lzc:
      
          Fix detection_model_zoo footnotes
      
      --
      250560654  by Zhichao Lu:
      
          Fix static shape issue in matmul_crop_and_resize.
      
      --
      250534857  by Zhichao Lu:
      
          Edit class agnostic calibration function docstring to more accurately describe the function's outputs.
      
      --
      250533277  by Zhichao Lu:
      
          Edit the multiclass messages to use class ids instead of labels.
      
      --
      
      PiperOrigin-RevId: 257914648
      fe748d4a
  21. 22 May, 2019 1 commit
    • Zhuoran Liu's avatar
      Add TPU SavedModel exporter and refactor OD code (#6737) · 80444539
      Zhuoran Liu authored
      247226201  by ronnyvotel:
      
          Updating the visualization tools to accept unique_ids for color coding.
      
      --
      247067830  by Zhichao Lu:
      
          Add box_encodings_clip_range options for the convolutional box predictor (for TPU compatibility).
      
      --
      246888475  by Zhichao Lu:
      
          Remove unused _update_eval_steps function.
      
      --
      246163259  by lzc:
      
          Add a gather op that can handle ignore indices (which are "-1"s in this case).
      
      --
      246084944  by Zhichao Lu:
      
          Keras based implementation for SSD + MobilenetV2 + FPN.
      
      --
      245544227  by rathodv:
      
          Add batch_get_targets method to target assigner module to gather any groundtruth tensors based on the results of target assigner.
      
      --
      245540854  by rathodv:
      
          Update target assigner to return match tensor instead of a match object.
      
      --
      245434441  by Zhichao Lu:
      
          Add README for tpu_exporters package.
      
      --
      245381834  by lzc:
      
          Internal change.
      
      --
      245298983  by Zh...
      80444539
  22. 07 Mar, 2019 1 commit
    • pkulzc's avatar
      Merged commit includes the following changes: (#6315) · 05584085
      pkulzc authored
      236813471  by lzc:
      
          Internal change.
      
      --
      236507310  by lzc:
      
          Fix preprocess.random_resize_method config type issue. The target height and width will be passed as "size" to tf.image.resize_images which only accepts integer.
      
      --
      236409989  by Zhichao Lu:
      
          Config export_to_tpu from function parameter instead of HParams for TPU inference.
      
      --
      236403186  by Zhichao Lu:
      
          Make graph file names optional arguments.
      
      --
      236237072  by Zhichao Lu:
      
          Minor bugfix for keyword args.
      
      --
      236209602  by Zhichao Lu:
      
          Add support for PartitionedVariable to get_variables_available_in_checkpoint.
      
      --
      235828658  by Zhichao Lu:
      
          Automatically stop evaluation jobs when training is finished.
      
      --
      235817964  by Zhichao Lu:
      
          Add an optional process_metrics_fn callback to eval_util, it gets called
          with evaluation results once each evaluation is complete.
      
      --
      235788721  by lzc:
      
          Fix yml file tf runtime...
      05584085
  23. 21 Sep, 2018 1 commit
    • pkulzc's avatar
      Release iNaturalist Species-trained models, refactor of evaluation, box... · 99256cf4
      pkulzc authored
      Release iNaturalist Species-trained models, refactor of evaluation, box predictor for object detection. (#5289)
      
      * Merged commit includes the following changes:
      212389173  by Zhichao Lu:
      
          1. Replace tf.boolean_mask with tf.where
      
      --
      212282646  by Zhichao Lu:
      
          1. Fix a typo in model_builder.py and add a test to cover it.
      
      --
      212142989  by Zhichao Lu:
      
          Only resize masks in meta architecture if it has not already been resized in the input pipeline.
      
      --
      212136935  by Zhichao Lu:
      
          Choose matmul or native crop_and_resize in the model builder instead of faster r-cnn meta architecture.
      
      --
      211907984  by Zhichao Lu:
      
          Make eval input reader repeated field and update config util to handle this field.
      
      --
      211858098  by Zhichao Lu:
      
          Change the implementation of merge_boxes_with_multiple_labels.
      
      --
      211843915  by Zhichao Lu:
      
          Add Mobilenet v2 + FPN support.
      
      --
      211655076  by Zhichao Lu:
      
          Bug fix for generic keys in config overrides
      
          In generic configuration overrides, we had a duplicate entry for train_input_config and we were missing the eval_input_config and eval_config.
      
          This change also introduces testing for all config overrides.
      
      --
      211157501  by Zhichao Lu:
      
          Make the locally-modified conv defs a copy.
      
          So that it doesn't modify MobileNet conv defs globally for other code that
          transitively imports this package.
      
      --
      211112813  by Zhichao Lu:
      
          Refactoring visualization tools for Estimator's eval_metric_ops. This will make it easier for future models to take advantage of a single interface and mechanics.
      
      --
      211109571  by Zhichao Lu:
      
          A test decorator.
      
      --
      210747685  by Zhichao Lu:
      
          For FPN, when use_depthwise is set to true, use slightly modified mobilenet v1 config.
      
      --
      210723882  by Zhichao Lu:
      
          Integrating the losses mask into the meta architectures. When providing groundtruth, one can optionally specify annotation information (i.e. which images are labeled vs. unlabeled). For any image that is unlabeled, there is no loss accumulation.
      
      --
      210673675  by Zhichao Lu:
      
          Internal change.
      
      --
      210546590  by Zhichao Lu:
      
          Internal change.
      
      --
      210529752  by Zhichao Lu:
      
          Support batched inputs with ops.matmul_crop_and_resize.
      
          With this change the new inputs are images of shape [batch, heigh, width, depth] and boxes of shape [batch, num_boxes, 4]. The output tensor is of the shape [batch, num_boxes, crop_height, crop_width, depth].
      
      --
      210485912  by Zhichao Lu:
      
          Fix TensorFlow version check in object_detection_tutorial.ipynb
      
      --
      210484076  by Zhichao Lu:
      
          Reduce TPU memory required for single image matmul_crop_and_resize.
      
          Using tf.einsum eliminates intermediate tensors, tiling and expansion. for an image of size [40, 40, 1024] and boxes of shape [300, 4] HBM memory usage goes down from 3.52G to 1.67G.
      
      --
      210468361  by Zhichao Lu:
      
          Remove PositiveAnchorLossCDF/NegativeAnchorLossCDF to resolve "Main thread is not in main loop error" issue in local training.
      
      --
      210100253  by Zhichao Lu:
      
          Pooling pyramid feature maps: add option to replace max pool with convolution layers.
      
      --
      209995842  by Zhichao Lu:
      
          Fix a bug which prevents variable sharing in Faster RCNN.
      
      --
      209965526  by Zhichao Lu:
      
          Add support for enabling export_to_tpu through the estimator.
      
      --
      209946440  by Zhichao Lu:
      
          Replace deprecated tf.train.Supervisor with tf.train.MonitoredSession. MonitoredSession also takes away the hassle of starting queue runners.
      
      --
      209888003  by Zhichao Lu:
      
          Implement function to handle data where source_id is not set.
      
          If the field source_id is found to be the empty string for any image during runtime, it will be replaced with a random string. This avoids hash-collisions on dataset where many examples do not have source_id set. Those hash-collisions have unintended site effects and may lead to bugs in the detection pipeline.
      
      --
      209842134  by Zhichao Lu:
      
          Converting loss mask into multiplier, rather than using it as a boolean mask (which changes tensor shape). This is necessary, since other utilities (e.g. hard example miner) require a loss matrix with the same dimensions as the original prediction tensor.
      
      --
      209768066  by Zhichao Lu:
      
          Adding ability to remove loss computation from specific images in a batch, via an optional boolean mask.
      
      --
      209722556  by Zhichao Lu:
      
          Remove dead code.
      
          (_USE_C_API was flipped to True by default in TensorFlow 1.8)
      
      --
      209701861  by Zhichao Lu:
      
          This CL cleans-up some tf.Example creation snippets, by reusing the convenient tf.train.Feature building functions in dataset_util.
      
      --
      209697893  by Zhichao Lu:
      
          Do not overwrite num_epoch for eval input. This leads to errors in some cases.
      
      --
      209694652  by Zhichao Lu:
      
          Sample boxes by jittering around the currently given boxes.
      
      --
      209550300  by Zhichao Lu:
      
          `create_category_index_from_labelmap()` function now accepts `use_display_name` parameter.
          Also added create_categories_from_labelmap function for convenience
      
      --
      209490273  by Zhichao Lu:
      
          Check result_dict type before accessing image_id via key.
      
      --
      209442529  by Zhichao Lu:
      
          Introducing the capability to sample examples for evaluation. This makes it easy to specify one full epoch of evaluation, or a subset (e.g. sample 1 of every N examples).
      
      --
      208941150  by Zhichao Lu:
      
          Adding the capability of exporting the results in json format.
      
      --
      208888798  by Zhichao Lu:
      
          Fixes wrong dictionary key for num_det_boxes_per_image.
      
      --
      208873549  by Zhichao Lu:
      
          Reduce the number of HLO ops created by matmul_crop_and_resize.
      
          Do not unroll along the channels dimension. Instead, transpose the input image dimensions, apply tf.matmul and transpose back.
      
          The number of HLO instructions for 1024 channels reduce from 12368 to 110.
      
      --
      208844315  by Zhichao Lu:
      
          Add an option to use tf.non_maximal_supression_padded in SSD post-process
      
      --
      208731380  by Zhichao Lu:
      
          Add field in box_predictor config to enable mask prediction and update builders accordingly.
      
      --
      208699405  by Zhichao Lu:
      
          This CL creates a keras-based multi-resolution feature map extractor.
      
      --
      208557208  by Zhichao Lu:
      
          Add TPU tests for Faster R-CNN Meta arch.
      
          * Tests that two_stage_predict and total_loss tests run successfully on TPU.
          * Small mods to multiclass_non_max_suppression to preserve static shapes.
      
      --
      208499278  by Zhichao Lu:
      
          This CL makes sure the Keras convolutional box predictor & head layers apply activation layers *after* normalization (as opposed to before).
      
      --
      208391694  by Zhichao Lu:
      
          Updating visualization tool to produce multiple evaluation images.
      
      --
      208275961  by Zhichao Lu:
      
          This CL adds a Keras version of the Convolutional Box Predictor, as well as more general infrastructure for making Keras Prediction heads & Keras box predictors.
      
      --
      208275585  by Zhichao Lu:
      
          This CL enables the Keras layer hyperparameter object to build a dedicated activation layer, and to disable activation by default in the op layer construction kwargs.
      
          This is necessary because in most cases the normalization layer must be applied before the activation layer. So, in Keras models we must set the convolution activation in a dedicated layer after normalization is applied, rather than setting it in the convolution layer construction args.
      
      --
      208263792  by Zhichao Lu:
      
          Add a new SSD mask meta arch that can predict masks for SSD models.
          Changes including:
           - overwrite loss function to add mask loss computation.
           - update ssd_meta_arch to handle masks if predicted in predict and postprocessing.
      
      --
      208000218  by Zhichao Lu:
      
          Make FasterRCNN choose static shape operations only in training mode.
      
      --
      207997797  by Zhichao Lu:
      
          Add static boolean_mask op to box_list_ops.py and use that in faster_rcnn_meta_arch.py to support use_static_shapes option.
      
      --
      207993460  by Zhichao Lu:
      
          Include FGVC detection models in model zoo.
      
      --
      207971213  by Zhichao Lu:
      
          remove the restriction to run tf.nn.top_k op on CPU
      
      --
      207961187  by Zhichao Lu:
      
          Build the first stage NMS function in the model builder and pass it to FasterRCNN meta arch.
      
      --
      207960608  by Zhichao Lu:
      
          Internal Change.
      
      --
      207927015  by Zhichao Lu:
      
          Have an option to use the TPU compatible NMS op cl/206673787, in the batch_multiclass_non_max_suppression function. On setting pad_to_max_output_size to true, the output nmsed boxes are padded to be of length max_size_per_class.
      
          This can be used in first stage Region Proposal Network in FasterRCNN model by setting the first_stage_nms_pad_to_max_proposals field to true in config proto.
      
      --
      207809668  by Zhichao Lu:
      
          Add option to use depthwise separable conv instead of conv2d in FPN and WeightSharedBoxPredictor. More specifically, there are two related configs:
          - SsdFeatureExtractor.use_depthwise
          - WeightSharedConvolutionalBoxPredictor.use_depthwise
      
      --
      207808651  by Zhichao Lu:
      
          Fix the static balanced positive negative sampler's TPU tests
      
      --
      207798658  by Zhichao Lu:
      
          Fixes a post-refactoring bug where the pre-prediction convolution layers in the convolutional box predictor are ignored.
      
      --
      207796470  by Zhichao Lu:
      
          Make slim endpoints visible in FasterRCNNMetaArch.
      
      --
      207787053  by Zhichao Lu:
      
          Refactor ssd_meta_arch so that the target assigner instance is passed into the SSDMetaArch constructor rather than constructed inside.
      
      --
      
      PiperOrigin-RevId: 212389173
      
      * Fix detection model zoo typo.
      
      * Modify tf example decoder to handle label maps with either `display_name` or `name` fields seamlessly.
      
      Currently, tf example decoder uses only `name` field to look up ids for class text field present in the data. This change uses both `display_name` and `name` fields in the label map to fetch ids for class text.
      
      PiperOrigin-RevId: 212672223
      
      * Modify create_coco_tf_record tool to write out class text instead of class labels.
      
      PiperOrigin-RevId: 212679112
      
      * Fix detection model zoo typo.
      
      PiperOrigin-RevId: 212715692
      
      * Adding the following two optional flags to WeightSharedConvolutionalBoxHead:
      1) In the box head, apply clipping to box encodings in the box head.
      2) In the class head, apply sigmoid to class predictions at inference time.
      
      PiperOrigin-RevId: 212723242
      
      * Support class confidences in merge boxes with multiple labels.
      
      PiperOrigin-RevId: 212884998
      
      * Creates multiple eval specs for object detection.
      
      PiperOrigin-RevId: 212894556
      
      * Set batch_norm on last layer in Mask Head to None.
      
      PiperOrigin-RevId: 213030087
      
      * Enable bfloat16 training for object detection models.
      
      PiperOrigin-RevId: 213053547
      
      * Skip padding op when unnecessary.
      
      PiperOrigin-RevId: 213065869
      
      * Modify `Matchers` to use groundtruth weights before performing matching.
      
      Groundtruth weights tensor is used to indicate padding in groundtruth box tensor. It is handled in `TargetAssigner` by creating appropriate classification and regression target weights based on the groundtruth box each anchor matches to. However, options such as `force_match_all_rows` in `ArgmaxMatcher` force certain anchors to match to groundtruth boxes that are just paddings thereby reducing the number of anchors that could otherwise match to real groundtruth boxes.
      
      For single stage models like SSD the effect of this is negligible as there are two orders of magnitude more anchors than the number of padded groundtruth boxes. But for Faster R-CNN and Mask R-CNN where there are only 300 anchors in the second stage, a significant number of these match to groundtruth paddings reducing the number of anchors regressing to real groundtruth boxes degrading the performance severely.
      
      Therefore, this change introduces an additional boolean argument `valid_rows` to `Matcher.match` methods and the implementations now ignore such padded groudtruth boxes during matching.
      
      PiperOrigin-RevId: 213345395
      
      * Add release note for iNaturalist Species trained models.
      
      PiperOrigin-RevId: 213347179
      
      * Fix the bug of uninitialized gt_is_crowd_list variable.
      
      PiperOrigin-RevId: 213364858
      
      * ...text exposed to open source public git repo...
      
      PiperOrigin-RevId: 213554260
      99256cf4
  24. 01 Aug, 2018 1 commit
    • pkulzc's avatar
      Refactor object detection box predictors and fix some issues with model_main. (#4965) · 02a9969e
      pkulzc authored
      * Merged commit includes the following changes:
      206852642  by Zhichao Lu:
      
          Build the balanced_positive_negative_sampler in the model builder for FasterRCNN. Also adds an option to use the static implementation of the sampler.
      
      --
      206803260  by Zhichao Lu:
      
          Fixes a misplaced argument in resnet fpn feature extractor.
      
      --
      206682736  by Zhichao Lu:
      
          This CL modifies the SSD meta architecture to support both Slim-based and Keras-based box predictors, and begins preparation for Keras box predictor support in the other meta architectures.
      
          Concretely, this CL adds a new `KerasBoxPredictor` base class and makes the meta architectures appropriately call whichever box predictors they are using.
      
          We can switch the non-ssd meta architectures to fully support Keras box predictors once the Keras Convolutional Box Predictor CL is submitted.
      
      --
      206669634  by Zhichao Lu:
      
          Adds an alternate m...
      02a9969e
  25. 13 Jul, 2018 4 commits
    • pkulzc's avatar
      Update blog link. · e2d46371
      pkulzc authored
      e2d46371
    • pkulzc's avatar
      Update README with blogpost link. · 5cd07c09
      pkulzc authored
      5cd07c09
    • Zhichao Lu's avatar
      Merged commit includes the following changes: · 85dd5fa4
      Zhichao Lu authored
      204489224  by Zhichao Lu:
      
          Modify ssd mobilenet v1 fpn config to be a bit more tolerant to OOM failure by bumping down the batch size to 64 and doubling the number of iterations to 25k. It now converges in 2.5 hours.
      
      --
      204488942  by Zhichao Lu:
      
          Internal change
      
      204480631  by Zhichao Lu:
      
          This CL makes sure that num_steps parameter are not updated to 0 if num_steps field is not mentioned in config.
      
          The default behavior for number of steps parameter for training is infinite (train forever). The default value num_steps in train.proto is 0 (for training indefinitely). However the estimator/training function expects the num_steps to be set to None to train indefinitely.
      
      --
      204437217  by Zhichao Lu:
      
          Create a Docker image to support TensorFlow Lite / Object Detection blog post.
      
      --
      204317570  by Zhichao Lu:
      
          Internal change
      
      PiperOrigin-RevId: 204489224
      85dd5fa4
    • pkulzc's avatar
      Object detection Internal Changes. (#4757) · 70255908
      pkulzc authored
      * Merged commit includes the following changes:
      204316992  by Zhichao Lu:
      
          Update docs to prepare inputs
      
      --
      204309254  by Zhichao Lu:
      
          Update running_pets.md to use new binaries and correct a few things in running_on_cloud.md
      
      --
      204306734  by Zhichao Lu:
      
          Move old binaries into legacy folder and add deprecation notice.
      
      --
      204267757  by Zhichao Lu:
      
          Fixing a problem in VRD evaluation with missing ground truth annotations for
          images that do not contain objects from 62 groundtruth classes.
      
      --
      204167430  by Zhichao Lu:
      
          This fixes a flaky losses test failure.
      
      --
      203670721  by Zhichao Lu:
      
          Internal change.
      
      --
      203569388  by Zhichao Lu:
      
          Internal change
      
      203546580  by Zhichao Lu:
      
          * Expand TPU compatibility g3doc with config snippets
          * Change mscoco dataset path in sample configs to the sharded versions
      
      --
      203325694  by Zhichao Lu:
      
          Make merge_multiple_label_boxes work for model_main code path.
      
      --
      203305655  by Zhichao Lu:
      
          Remove the 1x1 conv layer before pooling in MobileNet-v1-PPN feature extractor.
      
      --
      203139608  by Zhichao Lu:
      
          - Support exponential_decay with burnin learning rate schedule.
          - Add the minimum learning rate option.
          - Make the exponential decay start only after the burnin steps.
      
      --
      203068703  by Zhichao Lu:
      
          Modify create_coco_tf_record.py to output sharded files.
      
      --
      203025308  by Zhichao Lu:
      
          Add an option to share the prediction tower in WeightSharedBoxPredictor.
      
      --
      203024942  by Zhichao Lu:
      
          Move ssd mobilenet v1 ppn configs to third party.
      
      --
      202901259  by Zhichao Lu:
      
          Delete obsolete ssd mobilenet v1 focal loss configs and update pets dataset path
      
      --
      202894154  by Zhichao Lu:
      
          Move all TPU compatible ssd mobilenet v1 coco14/pet configs to third party.
      
      --
      202861774  by Zhichao Lu:
      
          Move Retinanet (SSD + FPN + Shared box predictor) configs to third_party.
      
      --
      
      PiperOrigin-RevId: 204316992
      
      * Add original files back.
      70255908
  26. 02 Jul, 2018 1 commit
    • pkulzc's avatar
      Open Images Challenge 2018 tools, minor fixes and refactors. (#4661) · 32e7d660
      pkulzc authored
      * Merged commit includes the following changes:
      202804536  by Zhichao Lu:
      
          Return tf.data.Dataset from input_fn that goes into the estimator and use PER_HOST_V2 option for tpu input pipeline config.
      
          This change shaves off 100ms per step resulting in 25 minutes of total reduced training time for ssd mobilenet v1 (15k steps to convergence).
      
      --
      202769340  by Zhichao Lu:
      
          Adding as_matrix() transformation for image-level labels.
      
      --
      202768721  by Zhichao Lu:
      
          Challenge evaluation protocol modification: adding labelmaps creation.
      
      --
      202750966  by Zhichao Lu:
      
          Add the explicit names to two output nodes.
      
      --
      202732783  by Zhichao Lu:
      
          Enforcing that batch size is 1 for evaluation, and no original images are retained during evaluation when use_tpu=False (to avoid dynamic shapes).
      
      --
      202425430  by Zhichao Lu:
      
          Refactor input pipeline to improve performance.
      
      --
      202406389  by Zhichao Lu:
      
          Only check the validity of `warmup_learning_rate` if it will be used.
      
      --
      202330450  by Zhichao Lu:
      
          Adding the description of the flag input_image_label_annotations_csv to add
            image-level labels to tf.Example.
      
      --
      202029012  by Zhichao Lu:
      
          Enabling displaying relationship name in the final metrics output.
      
      --
      202024010  by Zhichao Lu:
      
          Update to the public README.
      
      --
      201999677  by Zhichao Lu:
      
          Fixing the way negative labels are handled in VRD evaluation.
      
      --
      201962313  by Zhichao Lu:
      
          Fix a bug in resize_to_range.
      
      --
      201808488  by Zhichao Lu:
      
          Update ssd_inception_v2_pets.config to use right filename of pets dataset tf records.
      
      --
      201779225  by Zhichao Lu:
      
          Update object detection API installation doc
      
      --
      201766518  by Zhichao Lu:
      
          Add shell script to create pycocotools package for CMLE.
      
      --
      201722377  by Zhichao Lu:
      
          Removes verified_labels field and uses groundtruth_image_classes field instead.
      
      --
      201616819  by Zhichao Lu:
      
          Disable eval_on_tpu since eval_metrics is not setup to execute on TPU.
          Do not use run_config.task_type to switch tpu mode for EVAL,
          since that won't work in unit test.
          Expand unit test to verify that the same instantiation of the Estimator can independently disable eval on TPU whereas training is enabled on TPU.
      
      --
      201524716  by Zhichao Lu:
      
          Disable export model to TPU, inference is not compatible with TPU.
          Add GOOGLE_INTERNAL support in object detection copy.bara.sky
      
      --
      201453347  by Zhichao Lu:
      
          Fixing bug when evaluating the quantized model.
      
      --
      200795826  by Zhichao Lu:
      
          Fixing parsing bug: image-level labels are parsed as tuples instead of numpy
          array.
      
      --
      200746134  by Zhichao Lu:
      
          Adding image_class_text and image_class_label fields into tf_example_decoder.py
      
      --
      200743003  by Zhichao Lu:
      
          Changes to model_main.py and model_tpu_main to enable training and continuous eval.
      
      --
      200736324  by Zhichao Lu:
      
          Replace deprecated squeeze_dims argument.
      
      --
      200730072  by Zhichao Lu:
      
          Make detections only during predict and eval mode while creating model function
      
      --
      200729699  by Zhichao Lu:
      
          Minor correction to internal documentation (definition of Huber loss)
      
      --
      200727142  by Zhichao Lu:
      
          Add command line parsing as a set of flags using argparse and add header to the
          resulting file.
      
      --
      200726169  by Zhichao Lu:
      
          A tutorial on running evaluation for the Open Images Challenge 2018.
      
      --
      200665093  by Zhichao Lu:
      
          Cleanup on variables_helper_test.py.
      
      --
      200652145  by Zhichao Lu:
      
          Add an option to write (non-frozen) graph when exporting inference graph.
      
      --
      200573810  by Zhichao Lu:
      
          Update ssd_mobilenet_v1_coco and ssd_inception_v2_coco download links to point to a newer version.
      
      --
      200498014  by Zhichao Lu:
      
          Add test for groundtruth mask resizing.
      
      --
      200453245  by Zhichao Lu:
      
          Cleaning up exporting_models.md along with exporting scripts
      
      --
      200311747  by Zhichao Lu:
      
          Resize groundtruth mask to match the size of the original image.
      
      --
      200287269  by Zhichao Lu:
      
          Having a option to use custom MatMul based crop_and_resize op as an alternate to the TF op in Faster-RCNN
      
      --
      200127859  by Zhichao Lu:
      
          Updating the instructions to run locally with new binary. Also updating pets configs since file path naming has changed.
      
      --
      200127044  by Zhichao Lu:
      
          A simpler evaluation util to compute Open Images Challenge
          2018 metric (object detection track).
      
      --
      200124019  by Zhichao Lu:
      
          Freshening up configuring_jobs.md
      
      --
      200086825  by Zhichao Lu:
      
          Make merge_multiple_label_boxes work for ssd model.
      
      --
      199843258  by Zhichao Lu:
      
          Allows inconsistent feature channels to be compatible with WeightSharedConvolutionalBoxPredictor.
      
      --
      199676082  by Zhichao Lu:
      
          Enable an override for `InputReader.shuffle` for object detection pipelines.
      
      --
      199599212  by Zhichao Lu:
      
          Markdown fixes.
      
      --
      199535432  by Zhichao Lu:
      
          Pass num_additional_channels to tf.example decoder in predict_input_fn.
      
      --
      199399439  by Zhichao Lu:
      
          Adding `num_additional_channels` field to specify how many additional channels to use in the model.
      
      --
      
      PiperOrigin-RevId: 202804536
      
      * Add original model builder and docs back.
      32e7d660
  27. 03 May, 2018 1 commit
    • Zhichao Lu's avatar
      Merged commit includes the following changes: · 63054210
      Zhichao Lu authored
      195269567  by Zhichao Lu:
      
          Removing image summaries during train mode.
      
      --
      195147413  by Zhichao Lu:
      
          SSDLite config for mobilenet v2.
      
      --
      194883585  by Zhichao Lu:
      
          Simplify TPU compatible nearest neighbor upsampling using reshape and broadcasting.
      
      --
      194851009  by Zhichao Lu:
      
          Include ava v2.1 detection models in model zoo.
      
      --
      194292198  by Zhichao Lu:
      
          Add option to evaluate any checkpoint (without requiring write access to that directory and overwriting any existing logs there).
      
      --
      194122420  by Zhichao Lu:
      
          num_gt_boxes_per_image and num_det_boxes_per_image value incorrect.
          Should be not the expand dim.
      
      --
      193974479  by Zhichao Lu:
      
          Fixing a bug in the coco evaluator.
      
      --
      193959861  by Zhichao Lu:
      
          Read the default batch size from config file.
      
      --
      193737238  by Zhichao Lu:
      
          Fix data augmentation functions.
      
      --
      193576336  by Zhichao Lu:
      
          Add support for training keypoints.
      
      --
      193409179  by Zhichao Lu:
      
          Update protobuf requirements to 3+ in installation docs.
      
      --
      193382651  by Zhichao Lu:
      
          Updating coco evaluation metrics to allow for a batch of image info, rather than a single image.
      
      --
      193244778  by Zhichao Lu:
      
          Remove deprecated batch_norm_trainable field from ssd mobilenet v2 config
      
      --
      193228972  by Zhichao Lu:
      
          Make sure the final layers are also resized proportional to conv_depth_ratio.
      
      --
      193204364  by Zhichao Lu:
      
          Do not add batch norm parameters to final conv2d ops that predict boxes encodings and class scores in weight shared conv box predictor.
      
          This allows us to set proper bias and force initial predictions to be background when using focal loss.
      
      --
      193137342  by Zhichao Lu:
      
          Add a util function to visualize value histogram as a tf.summary.image.
      
      --
      193119411  by Zhichao Lu:
      
          Adding support for reading in logits as groundtruth labels and applying an optional temperature (scaling) before softmax in support of distillation.
      
      --
      193087707  by Zhichao Lu:
      
          Post-process now works again in train mode.
      
      --
      193067658  by Zhichao Lu:
      
          fix flakiness in testSSDRandomCropWithMultiClassScores due to randomness.
      
      --
      192922089  by Zhichao Lu:
      
          Add option to set dropout for classification net in weight shared box predictor.
      
      --
      192850747  by Zhichao Lu:
      
          Remove inaccurate caveat from proto file.
      
      --
      192837477  by Zhichao Lu:
      
          Extend to accept different ratios of conv channels.
      
      --
      192813444  by Zhichao Lu:
      
          Adding option for one_box_for_all_classes to the box_predictor
      
      --
      192624207  by Zhichao Lu:
      
          Update to trainer to allow for reading multiclass scores
      
      --
      192583425  by Zhichao Lu:
      
          Contains implementation of Visual Relations Detection evaluation metric (per
          image evaluation).
      
      --
      192529600  by Zhichao Lu:
      
          Modify the ssd meta arch to allow the option of not adding an implicit background class.
      
      --
      192512429  by Zhichao Lu:
      
          Refactor model_tpu_main.py files and move continuous eval loop into model_lib.py
      
      --
      192494267  by Zhichao Lu:
      
          Update create_pascal_tf_record.py and create_pet_tf_record.py
      
      --
      192485456  by Zhichao Lu:
      
          Enforcing that all eval metric ops have valid python strings.
      
      --
      192472546  by Zhichao Lu:
      
          Set regularize_depthwise to true in mobilenet_v1_argscope.
      
      --
      192421843  by Zhichao Lu:
      
          Refactoring of Mask-RCNN to put all mask prediction code in third stage.
      
      --
      192320460  by Zhichao Lu:
      
          Returning eval_on_train_input_fn from create_estimator_and_inputs(), rather than using train_input_fn in EVAL mode (which will still have data augmentation).
      
      --
      192226678  by Zhichao Lu:
      
          Access TPUEstimator and CrossShardOptimizer from tf namesspace.
      
      --
      192195514  by Zhichao Lu:
      
          Fix test that was flaky due to randomness
      
      --
      192166224  by Zhichao Lu:
      
          Minor fixes to match git repo.
      
      --
      192147130  by Zhichao Lu:
      
          use shape utils for assertion in feature extractor.
      
      --
      192132440  by Zhichao Lu:
      
          Class agnostic masks for mask_rcnn
      
      --
      192006190  by Zhichao Lu:
      
          Add learning rate summary in EVAL mode in model.py
      
      --
      192004845  by Zhichao Lu:
      
          Migrating away from Experiment class, as it is now deprecated. Also, refactoring into a separate model library and binaries.
      
      --
      191957195  by Zhichao Lu:
      
          Add classification_loss and localiztion_loss metrics for TPU jobs.
      
      --
      191932855  by Zhichao Lu:
      
          Add an option to skip the last striding in mobilenet. The modified network has nominal output stride 16 instead of 32.
      
      --
      191787921  by Zhichao Lu:
      
          Add option to override base feature extractor hyperparams in SSD models. This would allow us to use the same set of hyperparams for the complete feature extractor (base + new layers) if desired.
      
      --
      191743097  by Zhichao Lu:
      
          Adding an attribute to SSD model to indicate which fields in prediction dictionary have a batch dimension. This will be useful for future video models.
      
      --
      191668425  by Zhichao Lu:
      
          Internal change.
      
      --
      191649512  by Zhichao Lu:
      
          Introduce two parameters in ssd.proto - freeze_batchnorm, inplace_batchnorm_update - and set up slim arg_scopes in ssd_meta_arch.py such that applies it to all batchnorm ops in the predict() method.
      
          This centralizes the control of freezing and doing inplace batchnorm updates.
      
      --
      191620303  by Zhichao Lu:
      
          Modifications to the preprocessor to support multiclass scores
      
      --
      
      PiperOrigin-RevId: 195269567
      63054210
  28. 03 Apr, 2018 4 commits
  29. 08 Mar, 2018 1 commit
    • Zhichao Lu's avatar
      Add FAQ to object detection and replace... · ec16e472
      Zhichao Lu authored
      Add FAQ to object detection and replace tf.contrib.slim.tfexample_decoder.BackupHandler with object_detection.data_decoders.tf_example_decoder.BackupHandler.
      
      PiperOrigin-RevId: 188191275
      ec16e472